Microkernels are slow and Elvis didn’t do no drugs

Thom Holwerda 2016-01-02 OS News 79 Comments

Microkernel hatred is a peculiar phenomenon. Sheltered users who have never had any background in much beyond Windows and some flavor of free monolithic Unix, will, despite a general apathy or ignorance in the relevant subjects, have strong opinions on the allegedly dreadful performance and impracticality of “icrokernels”, however they define the term (and we shall see that a lot of people have some baffling impressions of what a microkernel is supposed to be). Quite often, these negative views will be a result of various remarks made by Linus Torvalds and a general hero worship of his character, a misrepresentation of an old Usenet flame war between AST and Torvalds that was somehow “won” and which supposedly proved that microkernels are nothing but a toy of ivory tower academics, or a rehash of quarter century-old benchmarks on CMU’s Mach that were unfavorable. The presence of Linus’ character in many of this is no coincidence. It strikes me that anti-microkernel sentiment most vocally originates as a sort of tribal affiliation mechanism by Linux users to ward off insecurity.
In any event, this article will be a concise tour of microkernel myths and misconceptions throughout the ages.

I wouldn’t exactly call this article “concise”, but it’s definitely filled with valuable technical information.

About The Author

Thom Holwerda

Follow me on Mastodon @thomholwerda@exquisite.social

79 Comments

2016-01-03 12:45 am
RobG
Recently, research projects such as Microsoft’s Singularity O/S have demonstrated that it is possible to bypass hardware memory protection in preference for running only verifiable code that simply demonstrably (provably) cannot access anything outside the set of resources made available to it.
This made their system to avoiding context switches, adopt a Microkernel architecture without accepting one legitimate concern – that context switches really do incur a cost in performance, and these architectures mandate more switches.
Its fascinating stuff, and one of the best bits of real blue-sky thinking (although, as with all good ideas, firmly standing on the shoulders of giants) in O/S design.
Here’s a link (https://en.wikipedia.org/wiki/Singularity_(operating_system) to the Wikipedia article, which is a good overview and gives further links to the project itself.

2016-01-04 12:53 am
tylerdurden
Are context switches really that big of an issue though?
Edited 2016-01-04 00:58 UTC

2016-01-04 8:50 am
galvanash
Are context switches really that big of an issue though?
Were you just comparing uKernels to monolithic kernels I would say no. Modern ones tend to try and avoid them unless absolutely necessary, and while the results vary tremendously L4, for example, seems to do a very good job of it. In other words they avoid context switching to a large enough degree that its impact is mostly noise relative to comparably monolithic kernels.
But Singularity is a bit different. It doesn’t avoid context switches when going from user mode to kernel mode, it avoids them entirely. It has a single address space, so there never any context switches.
Papers I have seen put the cost of a full context switch at around 30us (average) on modern, high end x86 processors. A really busy machine running lots of busy processes can easily end up doing over 7.5k per second. In other words it is spending ~20% of the time in context switches.
Something that avoids them entirely and can run many, many processes performing some kind of collective work through IPC would benefit from it quite a bit. And the more processes running, the bigger the benefit.
Edited 2016-01-04 08:58 UTC

2016-01-05 7:03 am
DeepThought
Singularity avoid switching the MMU. But of course there are context switches. Only that the context is much smaller.
If two or more tasks/processes/threads/execution entities run in parallel on the same core, the OS/scheduler must save and restore information (mostly in CPU registers) about the tasks.
2016-01-05 7:19 pm
tylerdurden
yeah, I was wondering the cost of context switches in the context (pun intended) of micro vs macro kernels. Most of the academic literature I’ve read tended to point towards context switch overhead induced by microkernels being a non issue in the overall scheme of things.

2016-01-04 9:56 am
segedunum
Yes.

2016-01-03 12:49 am
Carewolf
Not because microkernels is a bad idea, it is a perfectly good design goal, but it is never implemented as cleanly as it is described, and these days even Linux is not cleanly macrokernel either (loadable modules, user-land services, etc), and original microkernels like modern Windows is more macro.
So use it where it makes sense, especially when writing a new OS, but how often do you do that?
Edited 2016-01-03 00:51 UTC

2016-01-03 1:59 am
sergio
Not because microkernels is a bad idea, it is a perfectly good design goal, but it is never implemented as cleanly as it is described, and these days even Linux is not cleanly macrokernel either (loadable modules, user-land services, etc), and original microkernels like modern Windows is more macro
Well, QNX is a successful microkernel, OSX is pretty much microkernel and the biggest GNU/Hurd problem is the lack of people working on it.
I really don’t buy that “microkernels are too difficult” mantra. Rocket science is difficult too and rockets are launched everyday ha!
IMHO the real “problem” is that macrokernels like Linux/Windows are _GOOD ENOUGH_ for 99.9% of the tasks… so there’s no real incentive to go microkernel. MP3 vs CDs thing.
I think the market (and people in general) always prefers something practical/good to something great. Microkernels are technically super great, every comp-sci student in the world know that and We love them… but Betamax was great too (and the Amiga, and the SACD, and electric cars… and you name it). That “better is enemy of the good” thing rules the fscking world. xD

2016-01-03 5:19 am
Drumhellar
OSX is pretty much microkerne
Not really. It’s structured much like a microkernel, with the various parts of the kernel passing messages in the same way a microkernel might, but the whole thing runs in kernel mode, essentially negating all the advantages a microkernel brings to the table.
It is described as a hybrid kernel, but many think that “hybrid” is just a marketing term, and anything that runs wholly in kernel mode is a monolithic kernel.
Additionally, while IOKit allows drivers to be built to run in usermode, the majority of drivers that ship with OSX run in kernel mode, too.
2016-01-03 11:02 am
Kochise
https://www.google.fr/search?q=design+vs+user+experience&tbm=isch

2016-01-03 1:15 am
ronaldst
Emulation fits so well in the microkernel world. Other than the virtual 8086 mode modifications in the kernel, it would be just another branch in the tree.
I’ve always wanted to read more about IBM’s Workplace OS (microkernel + OS personalities).
2016-01-03 7:13 am
hackus
Just when I thought the new year was going to start off right.
😉
1) Microkernels suck. How badly? VERY badly when compared to monolithic ones in terms of efficiency and performance.
Not just a little, I mean a lot. No kidding. Do you think it is a giant conspiracy by OS engineers to some how not use Micro Kernels? No, it isn’t. Its a fact. The idea simply sucks to modularize at that level on Von Neumann machines. If someone tells you the past 80 years in digital computing somehow every one is stupid and nobody knows any better they are lying to you.
Lets move on shall we?
2) Now that I explained why Microkernels suck. Lets explore why they suck JUST SOOOOOOOOO badly.
First of all, to make a modular kernel/system call level to actually work as efficient as a monolithic OS, large amounts of hardware and complexity would have to be added to todays PC architecture.
That is a problem, but one that can be overcome.
The kicker is, nobody in the academic world can agree as to exactly what a micro kernel is, where it begins and ends in the syscall level and exactly what sorts of security constraints it should have or how it should be organized.
Nope. Nobody agrees. If nobody agrees, and nobody has after 50 plus yeas of looking at this academic NONSENSE, which is what I call it, a micro kernel org standard would emerge.
Just like it did for Monolithic kernels and as a result, we have dedicated hardware to accelerate the standard tasks a Monolithic kernel needs to perform in hardware.
Cept, it never happened for Micro kernels. Why is that do you suppose?
Without everyone agreeing on what a Micro Kernel is, you can’t create a set of hardware designs that will enable a Micro kernel to work.
Anyone here know how expensive it is to lithograph and create chipsets? Let me clue you in: VERY expensive.
Which is really key here. No standard? No processor/chipset support to make the idea of modularizing the address and communication spaces, the whole key parts to making a Microkernel work fast and efficiently.
Intel MANY TIMES, has investigated designing a chipset/processor architecture for Microkernels using a Von Neumann setup. They all failed, or were labeled too costly because the academic idiots don’t even know what they want in a Microkernel.
Hence no standard hardware can be sold to accommodate the proponents of Microkernels.
3) Since almost CERTAINLY a Microkernel hardware platform/processor/chipset combo would be more expensive than a Monolithic kernel (which requires far simpler hardware for some basic address translation features), it would then be a issue of OK, who would buy it?
Especially since using a Monolithic machine is just as secure, fast and efficient?
So let me close with an illustration of how a Microkenel syscall works:
https://www.youtube.com/watch?v=Y8cuuP4Jmio

2016-01-03 9:16 am
CavemanGR
Well… go tell CISCO who bought a microkernel os (qnx) in order to build their internet backbone systems on it. What would they know?

2016-01-03 11:06 am
Priest
Please don’t confuse control plane for forwarding. The actual routing engine on most large scale internet routers is no more powerful than a 5 year old laptop computer because all the heavy lifting is being done on ASICs.
IOS XR is built on QNX because of fault tolerance, NOT because of performance. As control plane its job isn’t to be fast, it’s to be reliable. It also needs to distribute various process and data to other routing engines running either on other chassis or as a standby so the microkernel way of message passing is seen as less of a hindrance for the architecture.
The crown jewel of the design was supposed to be in service software upgrades where portions of the OS could be upgraded without need to reboot a control system of an Internet core router each time someone rolls out a bug fix but they have not really been able to capitalize on this because updates typically contain numerous changes which tend to warrant a reload of the whole thing anyway so it doesn’t really do (in practice) the main thing it is supposed to do.
Even with IOS XR they use a hybrid design between QNX And Linux so it isn’t purely just QNX.
Either way the job of the OS in this context is to program hardware and sit back meaning using it as an example of a high performance microkernel is largely bogus. My PC is significantly faster than the routing engines that run on core routers because they only have to reliably process updates to network topology, not packets.

2016-01-03 10:33 pm
Yamin
Definitely.
I worked on that.
I often find this the biggest issue. When it comes to net benefits, rarely does it pan out or turn out to be all that useful.
For example, modular upgrades. It never quite worked perfectly, but here’s the kicker. Even if it could be done, the actual people using the router wouldn’t be casually upgrading modules. They have their own test cycles and typically have fail over to take over, while upgrades are done. Again, this is key. Any router used in a major system is going to have HA anyways which theoretically can handle the load. So they’re going to use it anyways.
So the use-case sounds amazing in theory. But in real life, there really wasn’t any.
This can change of course if you can deliver quality over years and people learn to trust things.
In the end, you’re basically better off focusing on quick reboot times than modular loading in these kinds of routers.

2016-01-05 2:51 pm
Bill Shooter of Bul Platinum Prime
I’d love to tell Cisco a lot of things, believe me I would. Do you have a direct number to someone who cares over there?

2016-01-03 9:31 am
Brendan
Hi,
1) Microkernels suck. How badly? VERY badly when compared to monolithic ones in terms of efficiency and performance.
Not just a little, I mean a lot. No kidding. Do you think it is a giant conspiracy by OS engineers to some how not use Micro Kernels? No, it isn’t. Its a fact. The idea simply sucks to modularize at that level on Von Neumann machines. If someone tells you the past 80 years in digital computing somehow every one is stupid and nobody knows any better they are lying to you. [/q]
Except that the overhead is typically less than %1 in real world tests; and there’s even cases where (e.g.) L4 runs Linux processes faster than Linux does.
2) Now that I explained why Microkernels suck. Lets explore why they suck JUST SOOOOOOOOO badly.
First of all, to make a modular kernel/system call level to actually work as efficient as a monolithic OS, large amounts of hardware and complexity would have to be added to todays PC architecture.
Wrong. The only thing that might help is the PCID feature (to reduce TLB misses) that Intel already added (assuming it’s not SASOS in the first place). Note that this feature can/does help monolithic kernels too.
Of course similar is true for monolithic – to avoid the bloat of having to run the OS inside a VM to “containerise” and mitigate the high risk of security problems; Intel have added features to improve security in monolithic (like SGX).
The kicker is, nobody in the academic world can agree as to exactly what a micro kernel is, where it begins and ends in the syscall level and exactly what sorts of security constraints it should have or how it should be organized.
Just like nobody really agrees what a monolithic kernel is. Even between Unix clones alone monolithic kernels (Linux, Solaris, Free/Open/NetBSD, HP/UX, …) have different/incompatible kernel APIs, features, algorithms, etc.
Just like it did for Monolithic kernels and as a result, we have dedicated hardware to accelerate the standard tasks a Monolithic kernel needs to perform in hardware.
There is no hardware that accelerates monolithic kernels (the SGX security stuff doesn’t help/accelerate performance and only helps security slightly).
Cept, it never happened for Micro kernels. Why is that do you suppose?
Because a general purpose CPU is intended to be general purpose?
Without everyone agreeing on what a Micro Kernel is, you can’t create a set of hardware designs that will enable a Micro kernel to work.
They already work, and are already more common than monolithic kernels (just in areas that noobs don’t notice, like communication, embedded systems, automotive, etc).
Anyone here know how expensive it is to lithograph and create chipsets? Let me clue you in: VERY expensive.
Which is really key here. No standard? No processor/chipset support to make the idea of modularizing the address and communication spaces, the whole key parts to making a Microkernel work fast and efficiently.
While creating chipsets is expensive, it’s obviously nothing compared to the cost of the drugs you’ve been consuming.
3) Since almost CERTAINLY a Microkernel hardware platform/processor/chipset combo would be more expensive than a Monolithic kernel (which requires far simpler hardware for some basic address translation features), it would then be a issue of OK, who would buy it?
It’s exactly the same hardware.
Especially since using a Monolithic machine is just as secure, fast and efficient?
I’m starting to doubt you know the difference between hardware and software now. What is a “monolithic machine” (is it where the OS is built into the hardware and isn’t software at all)?
[q]So let me close with an illustration of how a Microkenel syscall works:
https://www.youtube.com/watch?v=Y8cuuP4Jmio
A system call is exactly the same for both micro-kernel and monolithic kernel.
Note that I didn’t watch the video – it’s blocked (in my country at least) due to copyright infringement.
– Brendan

2016-01-04 12:28 pm
segedunum
Except that the overhead is typically less than %1 in real world tests; and there’s even cases where (e.g.) L4 runs Linux processes faster than Linux does. [/q]
Utter bollocks. You’ve pulled this out of your backside and likely been very selective, at best. You have to have been smoking something pretty strong to make a statement like that. We’re off to a great start.
Wrong. The only thing that might help is the PCID feature (to reduce TLB misses) that Intel already added (assuming it’s not SASOS in the first place). Note that this feature can/does help monolithic kernels too.
Nope. Until a structure can be agreed for a microkernel adding hardware features needed to make it perform acceptably will simply not happen. Most of what has happened so far has been about making virtualisation faster. Hardware help is absolutely required to get microkernels perform acceptably pretty much everywhere. That is the front and centre point here.
Because a general purpose CPU is intended to be general purpose?
Well done Sherlock. Let that sink in for a few seconds…….
They already work, and are already more common than monolithic kernels (just in areas that noobs don’t notice, like communication, embedded systems, automotive, etc).
Nope. They exist in a very few specific use cases where performance will never be a real issue. It’s highly debatable what a microkernel brings in those cases as well.
While creating chipsets is expensive, it’s obviously nothing compared to the cost of the drugs you’ve been consuming.
Heh, nothing compared to what is coming up. But that’s par for the course when someone advocates microkernels.
It’s exactly the same hardware.
You’re…..not getting the point here. At all.
[q]A system call is exactly the same for both micro-kernel and monolithic kernel.
Are you deliberately being this idiotic? How many system calls are there in a microkernel and how much context switching goes on? Hint: It’s a lot more than a monolithic kernel……which is why monolithic kernels have been ubiquitous and why minimising system calls, context switching and IPC is always important. Microkernel idiots have been parroting for years that faster hardware would make these concerns go away. Obviously not so.
To make a microkernel perform acceptably you’d have to implement large parts of it in hardware and so have to make assumptions about architecture.
Every man and his f–king dog pops their head up every few years and tells us how microkernels are misunderstood, attempts to debunk some ‘myths’ and how it’s the future everywhere. Incredible stupidity is attempting the same thing over and over for decades on end and expecting a different result. Microkernels exist for reasons that exist only in the minds of certain people, such as driver instability. This really only works in specific cases and is a huge price to pay in overall performance.
Edited 2016-01-04 12:30 UTC

2016-01-04 8:07 pm
Brendan
Hi,
Except that the overhead is typically less than %1 in real world tests; and there’s even cases where (e.g.) L4 runs Linux processes faster than Linux does.
Utter bollocks. You’ve pulled this out of your backside and likely been very selective, at best. You have to have been smoking something pretty strong to make a statement like that. We’re off to a great start. [/q]
I suggest you actually read at least some of the research; especially papers involving “small address spaces” and L4Linux.
[q]Wrong. The only thing that might help is the PCID feature (to reduce TLB misses) that Intel already added (assuming it’s not SASOS in the first place). Note that this feature can/does help monolithic kernels too.
Nope. Until a structure can be agreed for a microkernel adding hardware features needed to make it perform acceptably will simply not happen. Most of what has happened so far has been about making virtualisation faster. Hardware help is absolutely required to get microkernels perform acceptably pretty much everywhere. That is the front and centre point here. [/q]
This is completely moronic. Until a structure can be agreed for web browsers, spreadsheets, database management engines, C++ compilers, …., monolithic kernels and micro-kernels; adding hardware features needed to make each of these things perform acceptably will simply continue to be unnecessary.
Of course a huge amount of software that does completely different things in completely different ways uses branches, so a CPU manufacturer might add features that improve things like branch prediction; and a lot of software that does completely different things in completely different ways process lots of data, so a CPU manufacturer might add features that improve things like SIMD; and a lot of OS kernels that do completely different things in completely different ways use paging, so a CPU manufacturer might add features that improve things like TLB efficiency.
[q]Because a general purpose CPU is intended to be general purpose?
Well done Sherlock. Let that sink in for a few seconds……. [/q]
A few seconds wasn’t long enough. I think I’m going to need a large team of researchers working for at least 10 years just to find whatever point you think you had.
[q]They already work, and are already more common than monolithic kernels (just in areas that noobs don’t notice, like communication, embedded systems, automotive, etc).
Nope. They exist in a very few specific use cases where performance will never be a real issue. It’s highly debatable what a microkernel brings in those cases as well. [/q]
You’re saying that in situations where (e.g.) hard real-time performance guarantees are absolute requirements, micro-kernel’s like QNX will never dominate entire market segments (e.g. the automotive industry); and it’s much better to use a monolithic kernel so that a bug in the kernel’s bluetooth driver can starve the ABS braking system of CPU time and cause fatalities?
[q]It’s exactly the same hardware.
You’re…..not getting the point here. At all. [/q]
I got your point, laughed at how idiotic it was, then dismissed it. Manufacturers never design CPUs for one specific piece of software or algorithm, but for some unknown reason you think (e.g.) Intel spend all their time adding things like “red eye reduction instruction set extensions” (to accelerate photo-shop) and “NTFS file permission comparator” (to improve Windows) and “red-black tree walker” (to accelerate the Linux scheduler, or at least the latest version of it).
[q]A system call is exactly the same for both micro-kernel and monolithic kernel.
Are you deliberately being this idiotic? [/q]
Here’s a list of (some) different types of calls: function call, method call, shared library call, remote procedure call, system call. On modern (64-bit) 80×86 almost all of the extremely different monolithic kernels and almost all of the extremely different micro-kernels use the same “SYSCALL” instruction introduced by AMD over 10 years ago. It’s the same.
What is different is the kernel API – what the call does, not the call itself.
How many system calls are there in a microkernel and how much context switching goes on?
On modern (64-bit) 80×86 almost all of the extremely different monolithic kernels and almost all of the extremely different micro-kernels have either one or 2 system calls (mostly depending on whether they support 32-bit processes or not); and there’s always one switch from “user space context” to “kernel context” (and one switch back on the return side).
What you’re trying (and failing) to say is that micro-kernels involve more overhead caused by switching between processes (drivers, services, etc); and for most micro-kernels (excluding “SASOS with managed languages/software isolation”) you would’ve been right. The amount of extra overhead depends on how it’s implemented and varies greatly between different micro-kernels under different loads; but is typically a small price to pay for the knowledge that a silly “USB coffee cup warmer” device driver bug isn’t going to take down the entire OS, or that a binary blob third party driver can’t be designed to send a log of all keys you press to some IP address in China.
Hint: It’s a lot more than a monolithic kernel……which is why monolithic kernels have been ubiquitous and why minimising system calls, context switching and IPC is always important. Microkernel idiots have been parroting for years that faster hardware would make these concerns go away. Obviously not so.
While there may be a few deluded/wrong “micro-kernel idiots” that think faster hardware makes the IPC overhead go away; the vast majority of micro-kernel advocates would never say anything like that. It’s typically considered a small performance sacrifice in exchange for one or more other advantages (maintainability, stability, security and/or fault tolerance).
To make a microkernel perform acceptably you’d have to implement large parts of it in hardware and so have to make assumptions about architecture.
This is pure nonsense. It’s like saying “to make web apps perform acceptably you need a HTTP server built into the CPU”.
[q]Every man and his f–king dog pops their head up every few years and tells us how microkernels are misunderstood, attempts to debunk some ‘myths’ and how it’s the future everywhere. Incredible stupidity is attempting the same thing over and over for decades on end and expecting a different result. Microkernels exist for reasons that exist only in the minds of certain people, such as driver instability. This really only works in specific cases and is a huge price to pay in overall performance.
Actually; it’s a smaller price than running multiple instances of the OS inside separate virtual machines, which is something that’s become common (for servers) simply because things like “chroot jails” only solve part of the problem.
Incredible stupidity is attempting the same thing over and over for decades on end and expecting a different result; like trying to build a secure monolithic kernel. Why do you think Microsoft has mandatory device driver signing and a full certification process; and Linux developers get all whiny and scared when they see a binary blob; and both of these have > 100 critical vulnerabilities per year anyway?
– Brendan

2016-01-04 11:22 pm
segedunum
I suggest you actually read at least some of the research; especially papers involving “small address spaces” and L4Linux. [/q]
Utter bollocks. I’m afraid you are pigeon-holing specific ‘research’ scenarios, as I suspected you would do. To suggest a microkernel will run Linux processes faster than the native kernel itself is pure unadulterated tripe. It’s not even up for debate sunshine. Totally illogical for starters.
From there we can go no further really. What we’re talking about is the real world, not academia.
This is completely moronic. Until a structure can be agreed for web browsers, spreadsheets, database management engines,
No, it’s not, and I’m afraid you’re mudding the waters in what appears to be desperation. The only way microkernels can replace anything in the current monolithic world is for specific hardware support of microkernels, in the manner that things are happening with virtualisation of hardware. As you state below without even knowing it, that will never happen.
…perform acceptably will simply continue to be unnecessary.
Microkernels do not perform acceptably……that’s why they are not used in all but the most specific of use cases, and in those they are largely being replaced.
A few seconds wasn’t long enough. I think I’m going to need a large team of researchers working for at least 10 years just to find whatever point you think you had.
Yes, microkernels do need large teams of researchers to find ways in which they can be made to work, but you know what point was being made very well ;-).
You’re saying that in situations where (e.g.) hard real-time performance guarantees are absolute requirements..
What microkernels have are acceptable performance penalties in a small number of use cases. Those use cases are, alas, dwindling themselves. Separation is happening via other means where it is required.
micro-kernel’s like QNX will never dominate entire market segments (e.g. the automotive industry)
I’m afraid QNX is largely being replaced by Linux in the automotive world in head units, multimedia systems and elsewhere and it’s one of those cases where it’s not clear at all what advantages it actually had.
…and it’s much better to use a monolithic kernel so that a bug in the kernel’s bluetooth driver can starve the ABS braking system of CPU time and cause fatalities?
I’m afraid this is the same old tired tripe that gets debunked with microkernels time and time again. For starters, Bluetooth systems don’t go anywhere near braking systems, so you haven’t the faintest idea what you are talking about. The multimedia or head units are entirely separate, and the *real* realtime control systems don’t generally run operating systems but time triggered bare metal code etc. That is how components are separated these days in most embedded systems removing the need almost completely for any microkernel features or the hardware to run them. Small, focused pieces of hardware running code designed to get as much out of it as possible.
The tired old argument of a driver simply being restarted and not interfering is getting very, very old and is the modus operandi microkernel advocates resort to. That works in very specific cases, but in the vast majority of cases drivers simply require too much internal knowledge of the kernel. Beyond there we are into IPC mechanisms and how to get that data to the driver without compromising the much vaunted integrity. Everything falls apart from there.
Again, architectures of how these systems are built have changed, and they don’t leave room to run a microkernel.
I got your point, laughed at how idiotic it was, then dismissed it. Manufacturers never design CPUs for one specific piece of software or algorithm, but for some unknown reason you think
No, the point has flown over your head yet again. *I* don’t think that at all. The point here is to make microkernels perform acceptably and retain the theoretical advantages they seem to have is going to require a lot more specific hardware support to overcome performance problems. As you rightly point out that will never happen. Game over.
What you’re trying (and failing) to say is that micro-kernels involve more overhead caused by switching between processes (drivers, services, etc)
That is what I’ve said. I’m afraid you’re muddying waters again with a lot of bumf. You can never get around the fact that IPC and process separation mechanisms within a microkernel are hugely costly and can never be overcome, unless via some clever hardware means requiring hardware knowledge of the microkernel in question – the OP’s point. Game over.
The amount of extra overhead depends on how it’s implemented and varies greatly between different micro-kernels under different loads; but is typically a small price to pay for the knowledge that a silly “USB coffee cup warmer” device driver bug isn’t going to take down the entire OS..
Again, you’re getting a little desperate. Microkernels simply add too much overhead to give the advantages of separation that proponents claim. In reality, those separation advantages mostly never occur because of just what needs to be shared between kernel components. IPC mechanisms kill kernel performance, unless you can live with it in a narrow window. Newer and faster hardware doesn’t change that because the goalposts simply move on.
As for driver separation, that’s really all a microkernel has going for it and the overhead to make that happen means it gets debunked time and again.
It’s typically considered a small performance sacrifice in exchange for one or more other advantages (maintainability, stability, security and/or fault tolerance).
Your argument has failed there completely. It is not a small performance penalty at all. History has borne that out.
This is pure nonsense. It’s like saying “to make web apps perform acceptably you need a HTTP server built into the CPU”.
You’re either still not getting or you’re deliberately and steadfastly refusing to see it. I can’t really make my mind up which.
Actually; it’s a smaller price than running multiple instances of the OS inside separate virtual machines.
It’s not. Separate monolithic systems where required beat a large microkernel every time. Serves a different purpose.
Why do you think Microsoft has mandatory device driver signing and a full certification process; and Linux developers get all whiny and scared when they see a binary blob
That has nothing to do with a microkernel. Device drivers are treated with the suspicion they deserve because of the privileged access they generally need to have. However, there is no getting away from the fact that it is access they need to have to function acceptably. You can’t simply ‘untrust’ them.
[q]Incredible stupidity is attempting the same thing over and over for decades on end and expecting a different result; like trying to build a secure monolithic kernel.
Alas, those ‘security problems’ are very much hypothetical and not worth the effort effectively destroying acceptable performance and making a kernel unusable. Once you get down to a kernel level you have privileged code and there is simply no getting around that. When you begin ‘untrusting’ all kernel code you end up performing a great deal of extremely sad academic mental gymnastics that microkernel people get themselves into.
I’ve had conversations with hypervisor people who are overflowing with hypothetical security problems whilst greatly increasing the complexity of their systems to support separation and also leaving them unable to reuse tried and tested code.
2016-01-05 4:57 am
Brendan
Hi,
I suggest you actually read at least some of the research; especially papers involving “small address spaces” and L4Linux.
Utter bollocks. I’m afraid you are pigeon-holing specific ‘research’ scenarios, as I suspected you would do. To suggest a microkernel will run Linux processes faster than the native kernel itself is pure unadulterated tripe. It’s not even up for debate sunshine. Totally illogical for starters. [/q]
Here’s one of the many research papers: https://ssrg.nicta.com.au/publications/papers/Uhlig_DSHH_02.pdf
If you skip to the conclusion, you’ll find:
“We also found some cases where L4Linux with small spaces outperforms native Linux.”
…perform acceptably will simply continue to be unnecessary.
Microkernels do not perform acceptably……that’s why they are not used in all but the most specific of use cases, and in those they are largely being replaced. [/q]
You mean, the way Ford is switched from a monolithic/hybrid to a micro-kernel: http://techcrunch.com/2014/12/11/ford-ditches-microsoft-for-qnx-in-…
Or the way the Cisco switched from a monolithic OS (JUNOS) to a micro-kernel:
https://en.wikipedia.org/wiki/IOS_XR
Or…?
[q]micro-kernel’s like QNX will never dominate entire market segments (e.g. the automotive industry)
I’m afraid QNX is largely being replaced by Linux in the automotive world in head units, multimedia systems and elsewhere and it’s one of those cases where it’s not clear at all what advantages it actually had. [/q]
You’ve failed to understand the difference between “Linux Foundation is trying to get it into actual cars” and “it is in actual cars”. As far as I can tell it’s not used in any production vehicle and no manufacturer currently has plans to ever put it in any vehicle.
I’m afraid this is the same old tired tripe that gets debunked with microkernels time and time again. For starters, Bluetooth systems don’t go anywhere near braking systems, so you haven’t the faintest idea what you are talking about.
Typically a car has multiple systems connected via. a network (e.g. https://en.wikipedia.org/wiki/CAN_bus ) where the isolated pieces communicate via. messages. Does that sound familiar? Let me say it again – isolated pieces communicating via. messages! Where have we heard that before?
The multimedia or head units are entirely separate, and the *real* realtime control systems don’t generally run operating systems but time triggered bare metal code etc.
Um, that’s how things worked in the 1980s. Today there’s small embedded real-time OSs.
The tired old argument of a driver simply being restarted and not interfering is getting very, very old and is the modus operandi microkernel advocates resort to. That works in very specific cases, but in the vast majority of cases drivers simply require too much internal knowledge of the kernel. Beyond there we are into IPC mechanisms and how to get that data to the driver without compromising the much vaunted integrity. Everything falls apart from there.
For all processes running under all OSs (including drivers running under micro-kernels) no internal knowledge of the kernel is needed or has ever been needed whatsoever. The only knowledge needed is a knowledge of the kernel’s API; which (for micro-kernels) is often little more than a few functions to send/receive messages. I have no idea how or why this could possibly “compromise the much vaunted integrity” of anything.
The point here is to make microkernels perform acceptably and retain the theoretical advantages they seem to have is going to require a lot more specific hardware support to overcome performance problems. As you rightly point out that will never happen. Game over.
It requires no hardware change and no specific hardware support at all.
The problem is marketing, which is the same problem that “Linux on the desktop” has struggled with (and failed at) for 20+ years that has nothing at all to do with micro-kernel vs. monolithic and everything to do with displacing entrenched products.
You can never get around the fact that IPC and process separation mechanisms within a microkernel are hugely costly and can never be overcome, unless via some clever hardware means requiring hardware knowledge of the microkernel in question – the OP’s point. Game over.
I can’t tell if this is hyperbole or delusion. The IPC overhead is a minor disadvantage that’s more than made up for by multiple advantages.
Microkernels simply add too much overhead to give the advantages of separation that proponents claim.
I think what you meant to say (I assume) is that (in your opinion) the advantages of separation aren’t prevented by overhead but don’t justify the overhead.
In the same way (in my opinion) the advantages of managed languages like Java don’t justify the overhead (but the overhead doesn’t prevent the advantages).
In reality, those separation advantages mostly never occur because of just what needs to be shared between kernel components.
What is this mysterious “data shared between kernel components” and why does a micro-kernel (where most things are shifted out of the kernel and into user space) even have components to begin with?
Your argument has failed there completely. It is not a small performance penalty at all. History has borne that out.
More wishful thinking without any references or facts? I think you’ve reached your quota now.
It’s not. Separate monolithic systems where required beat a large microkernel every time. Serves a different purpose.
Usually the purpose of virtualisation is to separate one thing (e.g. HTTP server) from another (e.g. database) because you know the kernel is millions of lines of code running at the highest privilege level and has 100+ critical vulnerabilities and therefore can’t be trusted to isolate processes properly.
Device drivers are treated with the suspicion they deserve because of the privileged access they generally need to have. However, there is no getting away from the fact that it is access they need to have to function acceptably. You can’t simply ‘untrust’ them.
Wrong. A device driver needs access to one specific device’s resources, not everything. By only letting it have access to its device’s resources you don’t need to trust it anywhere near as much. The worst thing a “USB coffee cup warmer driver” can do is let your coffee get cold.
[q]Incredible stupidity is attempting the same thing over and over for decades on end and expecting a different result; like trying to build a secure monolithic kernel.
Alas, those ‘security problems’ are very much hypothetical and not worth the effort effectively destroying acceptable performance and making a kernel unusable. [/q]
Those security problems are very real. In 2015 alone there were 77 critical vulnerabilities in the Linux kernel (which is a huge improvement compared to the 133 vulnerabilities in 2014, and the 189 vulnerabilities in 2013). Source: http://www.cvedetails.com/product/47/Linux-Linux-Kernel.html?vendor…
Once you get down to a kernel level you have privileged code and there is simply no getting around that. When you begin ‘untrusting’ all kernel code you end up performing a great deal of extremely sad academic mental gymnastics that microkernel people get themselves into.
You’re looking for this: https://en.wikipedia.org/wiki/Principle_of_least_privilege
[q]I’ve had conversations with hypervisor people who are overflowing with hypothetical security problems whilst greatly increasing the complexity of their systems to support separation and also leaving them unable to reuse tried and tested code.
Yes. That’s because most hyper-visors have a massive amount of code running at the highest privilege level (on the host), just like a monolithic kernel.
– Brendan

2016-01-05 2:59 pm
Bill Shooter of Bul Platinum Prime
Agreed on all points, but performance concerns. There is a reason why all of the examples of “good” micro kernels are all embedded RTOS style OSes. The lack of performance on the specific hardware, only has a small cost associated with it. For those specific use cases, the reliability matters more than the extra pennies per unit.

2016-01-03 10:11 am
tidux
> Which is really key here. No standard? No processor/chipset support to make the idea of modularizing the address and communication spaces, the whole key parts to making a Microkernel work fast and efficiently.
Wrong. That’s exactly what IOMMU hardware does on recent x86_64. You might know it as VT-d or “that stuff I need to get GPU passthrough working right in KVM.” And hey, what do you know, giving a VM direct access to physical hardware looks a LOT like a microkernel with protected address spaces and user-mode drivers. Recent x86_64 microkernel research is leaning on the IOMMU hardware for enforcement just like VM hypervisors.

2016-01-04 12:36 pm
segedunum
Wrong. That’s exactly what IOMMU hardware does on recent x86_64. [/q]
Wrong. I’m afraid you’d have to go a lot further than hardware virtualisation for a microkernel.
[q]And hey, what do you know, giving a VM direct access to physical hardware looks a LOT like a microkernel with protected address spaces and user-mode drivers. Recent x86_64 microkernel research is leaning on the IOMMU hardware for enforcement just like VM hypervisors.
Yes, having to be implemented in hardware for performance reasons and in a specific scenario where it actually makes sense to do so.
Alas, you’d have to implement a lot more in hardware to make a microkernel realistic in terms of performance.

2016-01-04 12:00 pm
segedunum
Sums it up quite nicely.

2016-01-03 7:54 am
shotsman
couldn’t (when looking at it from some POV’s) a Hypervisor be considered a Microkernel?

2016-01-03 8:54 am
kwan_e
couldn’t (when looking at it from some POV’s) a Hypervisor be considered a Microkernel?
It’s more of an exokernel.
2016-01-05 10:21 pm
sergio
couldn’t (when looking at it from some POV’s) a Hypervisor be considered a Microkernel?
In practice, hypervisors, like Xen or ESXi, are our ghetto microkernels!!! For sure!! haha
But Xen and VMware still run using monolithic kernels based on Linux and they have exactly the same problems as Linux/Windows/Solaris.
For example: if you have to update a module/driver on an ESXi/Xen host, yeap, in most cases you have to reboot it!!! If you have a stalled LUN or stuck multipath stack or something related to “un-interruptable” scsi commands issue… yes, you have to reboot!!! (usually a hard reboot). So… problems are the same.
As I said, with hypervisors We gained a lot of microkernel’s advantages following a different path (just to avoid the fundamental problem)… but the fundamental problem is still there laughing at us: monolithics kernels suck and the OSes that We are used to use suck too because We have exactly the same problems that We had 30 years ago.
But as you know, people like Linus Torvalds have their agendas and they will tell you that Linux is wonderful and monolithic kernels are perfectly fine (even if we try to develop technologies to avoid their problems all the time!! i.e ksplice, i.e hypervisors, ecc). Well, let me tell you: Windows/Linux/Solaris are not fine, in fact, ancient OSes like VMS/OpenVMS were 1000x better than the shit We are used to use today.
But hey, this is the IT world… money and mediocrity rules.

2016-01-05 11:19 pm
tony
couldn’t (when looking at it from some POV’s) a Hypervisor be considered a Microkernel?
In practice, hypervisors, like Xen or ESXi, are our ghetto microkernels!!! For sure!! haha
But Xen and VMware still run using monolithic kernels based on Linux and they have exactly the same problems as Linux/Windows/Solaris.
For example: if you have to update a module/driver on an ESXi/Xen host, yeap, in most cases you have to reboot it!!! If you have a stalled LUN or stuck multipath stack or something related to “un-interruptable” scsi commands issue… yes, you have to reboot!!! (usually a hard reboot). So… problems are the same.
As I said, with hypervisors We gained a lot of microkernel’s advantages following a different path (just to avoid the fundamental problem)… but the fundamental problem is still there laughing at us: monolithics kernels suck and the OSes that We are used to use suck too because We have exactly the same problems that We had 30 years ago.
But as you know, people like Linus Torvalds have their agendas and they will tell you that Linux is wonderful and monolithic kernels are perfectly fine (even if we try to develop technologies to avoid their problems all the time!! i.e ksplice, i.e hypervisors, ecc). Well, let me tell you: Windows/Linux/Solaris are not fine, in fact, ancient OSes like VMS/OpenVMS were 1000x better than the shit We are used to use today.
But hey, this is the IT world… money and mediocrity rules.
The particular problem of needing to reboot on an upgrade isn’t really that much of a problem, at least compared to the problem of coming up with a new operating system that might (only might) be able to accomplish this. Making an operating system is just the first step, there has to be a platform and community around it.
I mean, yeah, I’d rather have the ability to upgrade a module in flight rather than reboot, but it’s such an easy thing to work around that it doesn’t even register against the noise of other challenges we have in IT.
And are VMS/OpenVMS really better than what we’ve got today? Depends on your measure of better. For me, it’s can I do the things I need to do on modern systems. And the answer for that is obviously no. So no, they’re not better.
So yeah, Windows/Linux/Solaris are fine. None of them are perfect, and there’s constant evolution in them. But they’re a lot better than hypothetical, unproven implementations (and by proof, I mean running real workloads) or ancient operating systems that while useful in their day, aren’t useless anymore. I’m all for better technology and better solutions, but purity for purity sake isn’t a solution.

2016-01-03 1:57 pm
tingo
Valuable information?
Those who can, code. Those who can’t, write about it.
Nothing new here.
2016-01-03 3:08 pm
Steve
Many years ago when there were questions about whether Torvald had copied Minix code into Linux, I pulled up old copies of both and compared the implementation code in each for one particular system call – I think it was the “open” call. The Linux code was straightforward – it consisted of a function with parameters (file name and flags) passed as arguments to the function in standard C fashion. The Minix code was also implemented as a function, but it had no arguments. How did the parameters get in? They came in through global message blocks, but the way in which they worked was a tangle of spaghetti code – opaque #defines, declarations split across multiple files and different subdirectories – yuk! No one in their right mind would attempt to copy such code. Don’t know if that has anything to do with the micro/macro kernel argument.

2016-01-03 7:12 pm
Brendan
Hi,
Many years ago when there were questions about whether Torvald had copied Minix code into Linux, I pulled up old copies of both and compared the implementation code in each for one particular system call – I think it was the “open” call. The Linux code was straightforward – it consisted of a function with parameters (file name and flags) passed as arguments to the function in standard C fashion. The Minix code was also implemented as a function, but it had no arguments. How did the parameters get in? They came in through global message blocks, but the way in which they worked was a tangle of spaghetti code – opaque #defines, declarations split across multiple files and different subdirectories – yuk! No one in their right mind would attempt to copy such code. Don’t know if that has anything to do with the micro/macro kernel argument.
For a micro-kernel, the kernel doesn’t implement “open()” at all. What it does is implement some sort of communication that is used to send something (e.g. an “open file request” message) to another process (e.g. the virtual file system layer).
This communication is used a lot, so it’s designed carefully and implemented to minimise overhead. It tends to play a significant role in scheduling (tasks blocking while waiting for a message and unblocking when they receive), and (for some micro-kernels and not others) may have a built in permission system (who can send what to who).
Mostly, if you were looking for something that resembles Linux’s “open()” you were looking in the wrong place; and shouldn’t be surprised that what you found in Minix doesn’t look anything like what you found in Linux.
Also note that one specific micro-kernel is not all micro-kernels.
It’s entirely possible for someone to dislike Minix’s code but like micro-kernels. I am someone like this (I think synchronous message passing is inferior to asynchronous message passing, especially for modern/multi-CPU systems; I think the kernel’s messaging shouldn’t be bloated up with a permission system and it should be left to the receiver to either accept or reject; etc).
In the same way it’s possible for someone to dislike Linux’s code and still like monolithic kernels. I am someone like this too (Linux was original a beginner/amateur’s project and has fundamental design flaws in key areas, like memory management and scheduling, that became unfixable because too much code depends on these early design flaws; and kernels like FreeBSD and Solaris are technically superior).
– Brendan

2016-01-04 4:21 am
Steve
I’ll take your word for it, but here’s a few lines of Minix file “open.c”:
————
/* This file contains the procedures for creating, opening, closing, and
* seeking on files.
*/
————
PUBLIC int do_open()
{
/* Perform the open(name, mode) system call. */
————
Maybe Minix wasn’t such a “microkernel” after all.

2016-01-04 6:14 am
Brendan
Hi,
I’ll take your word for it, but here’s a few lines of Minix file “open.c”:
As far as I can tell, that’s this file: http://www.cise.ufl.edu/~cop4600/cgi-bin/lxr/http/source.cgi/fs/ope…
Note that this is from the “minix/fs” directory and not from “minix/kernel” directory. This is because you’re not even looking at the kernel’s source code to begin with. You’re looking at the “fs” service (that runs as a process in user-space).
– Brendan

2016-01-04 12:39 pm
segedunum
For a micro-kernel, the kernel doesn’t implement “open()” at all. What it does is implement some sort of communication that is used to send something (e.g. an “open file request” message) to another process (e.g. the virtual file system layer).
Did you actually type that with a straight face?
Can you sort of understand why that would be a huge problem when people start thinking about the thorny issue of performance?

2016-01-04 4:58 pm
indieinvader
Basing your interactions on message passing means you can do cool shit along the lines of Plan 9. Even if there *is* a local performance penalty, the advantage of accessing local and remote resources using the same protocol far outweighs the cost.

2016-01-04 5:29 pm
kwan_e
Basing your interactions on message passing means you can do cool shit along the lines of Plan 9. Even if there *is* a local performance penalty, the advantage of accessing local and remote resources using the same protocol far outweighs the cost.
Does it? People have been getting along fine without it.
2016-01-04 11:26 pm
segedunum
Once you get into IPC mechanisms within a kernel you are sunk.
Even if there *is* a local performance penalty, the advantage of accessing local and remote resources using the same protocol far outweighs the cost.
Trust me, there is a performance penalty and a big one. I’m afraid given how prevalent microkernels are, which is not very, that is simply not borne out by facts.

2016-01-04 8:25 pm
Brendan
Hi,
For a micro-kernel, the kernel doesn’t implement “open()” at all. What it does is implement some sort of communication that is used to send something (e.g. an “open file request” message) to another process (e.g. the virtual file system layer).
Did you actually type that with a straight face? [/q]
Yes. It is exactly how micro-kernels do work.
[q]Can you sort of understand why that would be a huge problem when people start thinking about the thorny issue of performance?
I understand that it can be a minor performance sacrifice (in exchange for more important advantages).
– Brendan

2016-01-04 11:33 pm
segedunum
Yes. It is exactly how micro-kernels do work. [/q]
Yes, I know. The point is do you have any idea how incredibly costly that ‘some sort of communication‘ is? The answer seems to be ‘not a clue’.
[q]I understand that it can be a minor performance sacrifice (in exchange for more important advantages).
Excuse me, my sides have split. That performance sacrifice where any IPC (and with memory copying and copying of any data structures) is concerned is absolutely enormous and it is prohibitive enough that there are very few microkernel based operating systems around – and they will dwindle further. That debunks everything here and cuts to the heart of the matter. Microkernels live in a bubble in academia where there are no performance penalties for any of this or where they can be waved away nebulously as acceptable, and sadly people still live in a strange world where that is the case.
Edited 2016-01-04 23:39 UTC
2016-01-05 2:29 am
Brendan
Hi,
Excuse me, my sides have split. That performance sacrifice where any IPC (and with memory copying and copying of any data structures) is concerned is absolutely enormous and it is prohibitive enough that there are very few microkernel based operating systems around – and they will dwindle further. That debunks everything here and cuts to the heart of the matter. Microkernels live in a bubble in academia where there are no performance penalties for any of this or where they can be waved away nebulously as acceptable, and sadly people still live in a strange world where that is the case.
Do you have any actual facts to back up your ignorant drivel?
Macro-benchmarks (not biased/worthless micro-benchmarks) have shown L4Linux running normal processes (e.g. compiling something large) to be between 5% and 10% slower than doing exactly the same under Linux.
Sure, 5% to 10% isn’t great, but that’s acceptable for a lot of use cases (e.g. plenty of programmers using managed languages take a 5% to 10% performance loss for the sake of safety and nobody seems to care). However; let’s put this into its proper perspective.
It’s literally a micro-kernel written on a budget by a university (and not something companies have invested billions of $$ in like Linux), running a massive pile of bloat (the full Linux kernel) in user space, and then running processes that were designed for Linux underneath that; and it’s only 5% to 10% slower.
Let’s put this another way. If you ran a copy of the Linux kernel as a process underneath another Linux kernel, how much performance would you expect to lose? Would you be surprised to learn that “Linux under Linux” (User Mode Linux) has been benchmarked and is slower than “Linux under L4” (L4Linux)? That is a fair comparison.
Now imagine what would happen to that 5% to 10% if the processes were actually designed for the micro-kernel (e.g. and were using raw message passing, and not “less well suited” APIs like POSIX that were designed for monolithic OSs), and if you disposed of that massive pile of bloated “Linux kernel middle-ware”. You’d be able to do a fair comparison between “micro-kernel running processes designed for micro-kernel” and “monolithic running processes designed for monolithic”.
Sadly; fair comparisons like this are almost impossible to find. Virtually all of the micro-kernel developers (QNX, L4, Minix, etc) take a completely inappropriate user-space and glue it underneath their kernel. It’s insane. It’d be like designing an entire user-space around message passing, running it under a monolithic OS that was never designed for it (and using something like pipes or sockets to mimic message passing), and then complaining that monolithic OS performance sucks.
– Brendan
2016-01-05 7:22 pm
Megol
Hi,
Excuse me, my sides have split. That performance sacrifice where any IPC (and with memory copying and copying of any data structures) is concerned is absolutely enormous and it is prohibitive enough that there are very few microkernel based operating systems around – and they will dwindle further. That debunks everything here and cuts to the heart of the matter. Microkernels live in a bubble in academia where there are no performance penalties for any of this or where they can be waved away nebulously as acceptable, and sadly people still live in a strange world where that is the case.
Do you have any actual facts to back up your ignorant drivel?
[/q]
Of course he haven’t – as the facts doesn’t support his delusions.
[q]
<snip>
…
Sadly; fair comparisons like this are almost impossible to find. Virtually all of the micro-kernel developers (QNX, L4, Minix, etc) take a completely inappropriate user-space and glue it underneath their kernel. It’s insane. It’d be like designing an entire user-space around message passing, running it under a monolithic OS that was never designed for it (and using something like pipes or sockets to mimic message passing), and then complaining that monolithic OS performance sucks.
In the case of QNX Neutrino they designed the kernel to be a good fit to the Posix standard and the amount of glue is tiny. By using the native IPC channels as the file descriptors read()/write() map directly to the internal interface. Very clean.
2016-01-06 4:54 am
Brendan
Hi,
In the case of QNX Neutrino they designed the kernel to be a good fit to the Posix standard and the amount of glue is tiny. By using the native IPC channels as the file descriptors read()/write() map directly to the internal interface. Very clean.
This is a little complicated…
A lot of the C/POSIX functions (especially for IO) can block. For example, you call “open()” and your thread has to wait for a result. Under a micro-kernel this might mean your thread sends a message to VFS layer and blocks, then a task switch happens, then VFS layer handles the request and sends a reply that unblocks you, then another task switch happens because your thread can run now.
If you want to open 10 files you get 20 task switches; simply because “open()” (and a whole bunch of other functions) will cause the thread to block. That’s why a lot of micro-kernel research/effort has gone into making synchronous IPC and task switches fast.
What if we say “screw POSIX”?
If we want to open 10 files, we can send 10 messages to the VFS layer, then do one task switch, the VFS layer can handle all of them and send 10 replies, and then we can do one task switch back. Alternatively, you could also just send a single message saying “open this list of 10 files”. Instead of 20 tasks switches it drops to 2 task switches.
But.. why stop there? If your thread is running on one CPU and has other work it can do while its waiting (or is just “pre-opening” the files for later); and if the VFS layer is running on a different CPU (and handling requests from other processes); then…
Zero task switches.
Of course that’s best case (e.g. it can’t happen for single-CPU) and you’re not going to be that lucky all the time; and not all of the overhead is from task switches. However, you can significantly reduce the number of task switches and significantly reduce the overhead (especially under load), as long as you’re willing to say “screw POSIX”.
– Brendan

2016-01-05 7:07 pm
Megol
For a micro-kernel, the kernel doesn’t implement “open()” at all. What it does is implement some sort of communication that is used to send something (e.g. an “open file request” message) to another process (e.g. the virtual file system layer).
Did you actually type that with a straight face?
Can you sort of understand why that would be a huge problem when people start thinking about the thorny issue of performance?
No. Because it isn’t one – as shown by research.

2016-01-05 3:07 pm
Bill Shooter of Bul Platinum Prime
Kind of interested why you think linux memory management or scheduling is baddly done and unfixable. Kernel scheduling is an interest to me. Any further sources I could look at ( aside from pointing me to openIndiana, freebsd, and linux source code)?

2016-01-03 7:38 pm
MacMan
Windows NT 3.5.1 was the the best designed Windows kernel ever, IMO.
If MS had stuck with the original design, we would never have had a huge majority of all the security issues we see with Windows over the past 20 years.
The irony is, all the security patches and buffer checks MS has put in their operating systems have taken a huge toll performance wise, and it they would have stuck with the original pure microkernel, we likely would have a faster, more efficient OS today.
For those who don’t know, the original Windows NT 3.5 was a pure microkernel, and then in 4.0, MS switched to a hybrid model, where most of the UI processes got moved to the kernel.
The problem with monolithic kernels is everything is in the same address space, any issue with any kernel driver or component can take down the whole system.
And all of the performance issues with the early pure microkernel OS have been resolved.

2016-01-04 12:07 am
galvanash
Windows NT 3.5.1 was the the best designed Windows kernel ever, IMO. [/q]
I really don’t understand comments like this. Yes, there was a rather significant change made when Microsoft developed NT 4, but it wasn’t a design change, it was an implementation change. Design wise, NT 3.5.1 was exactly the same as NT 4…
If MS had stuck with the original design, we would never have had a huge majority of all the security issues we see with Windows over the past 20 years.
So you are laying the majority of security issues over the years found in NT at the feet of moving the GDI subsystem into kernel space? Really? Because frankly I don’t think it made one bit of difference in that regard. Did it make it less robust? Debatable. But less secure? Absolutely not. It did, however, certainly make it perform better.
Just to be clear – there were almost zero kernel changes made in NT 4. Literally the only major kernel level change was moving GDI back into the kernel (which is of course a major change, but it is rather compartmentalized).
Just like NT 3.5.1, everything from the executive, to file systems, drivers, etc. ran in the same memory space, in kernel mode, and did not use IPC for communications. It was not a microkernel, and neither was NT 3.5.1
The linked article gives a fairly good definition of a microkernel imo, and it is rather clear that NT never fit it.
The irony is, all the security patches and buffer checks MS has put in their operating systems have taken a huge toll performance wise, and it they would have stuck with the original pure microkernel, we likely would have a faster, more efficient OS today.
Again… Ugh. It was never a pure microkernel. And even if it were, most of the problems you are describing would still be there, they would just be in user mode subsystems. This is something that drives me crazy when people talk about this stuff – just because the bug is in usermode doesn’t make it any less of a bug. If you need a working file system, and the file system has bugs and don’t work, it really doesn’t make any difference at all whether or not you are running a microkernel – your still hosed.
[q]The problem with monolithic kernels is everything is in the same address space, any issue with any kernel driver or component can take down the whole system.
A problem Windows 3.5.1 had as well. Sure, GDI was in userspace, but that didn’t do much good when 9 out of 10 times a crash in it was unrecoverable. Yeah! The kernel survived the crash! To bad there is no longer any way to interact with it… And it certainly didn’t help with file systems, or network drivers, or anything else really since as far as those are concerned 3.5.1 was a monolithic kernel.

2016-01-04 1:21 am
dpJudas
just because the bug is in usermode doesn’t make it any less of a bug. If you need a working file system, and the file system has bugs and don’t work, it really doesn’t make any difference at all whether or not you are running a microkernel – your still hosed.
Yes and no. This argument is actually covered by the article and he points out that a microkernel can apply methods to restart processes that stopped functioning properly.
You already see this principle on modern Windows with (parts of) the graphics driver. I’ve several times managed to crash my Nvidia driver, which in the Windows XP days would have blue screened my computer. Since Vista this just causes a small popup to appear telling me that the graphics usermode driver was restarted.
Of course, for security bugs you’re absolutely right. Except maybe that in kernel space a bug gave you access to *everything*.

2016-01-04 4:28 am
galvanash
Yes and no. This argument is actually covered by the article and he points out that a microkernel can apply methods to restart processes that stopped functioning properly.
Yes. And Windows NT cannot do that with file systems, (which was my specific example). It can do that (as you mention later), to a limited degree, with the user mode portion of the video driver (at least since Vista). However, the ability to do such a thing in a specific subsystem has nothing to do with whether or not the underlying OS is microkernel based…
The kernel mode/user mode split in the WDDM driver model for video drivers is an implementation detail of the driver model, it does not rely on the services of a microkernel to exist. They just moved the parts of the driver that don’t need to be in the kernel into user space. Linux does much the same thing with X, and it certainly isn’t a microkernel.
I have no qualms with microkernels. What I am arguing about is whether or not the kernel in Windows NT was ever a microkernel. It is not, and never was. Simply based on how the kernel implements IPC, it is quite obvious that if an attempt were made to convert it into one utilizing it’s existing design, it simply would not work (not flexible enough and FAR too slow to be usable).
There is a reason it runs almost everything in kernel mode… It is because it was designed to. The notion of it having been a true microkernel at any point in its history is ridiculous.

2016-01-04 6:46 am
dpJudas
Yes. And Windows NT cannot do that with file systems, (which was my specific example). [/q]
No, but if it was a microkernel it was more likely it would be easier for them to implement such a feature.
You made a broad general comment that a bug is a bug and that it didn’t matter if it was in kernel space or not. It does make a world of difference and that’s what process space separation is all about.
[q]There is a reason it runs almost everything in kernel mode… It is because it was designed to. The notion of it having been a true microkernel at any point in its history is ridiculous.
I never claimed it was. My graphics driver example was just an obvious case of why it makes all the difference whether the bug appeared in userspace or kernelspace.
2016-01-04 8:06 am
galvanash
You made a broad general comment that a bug is a bug and that it didn’t matter if it was in kernel space or not. It does make a world of difference and that’s what process space separation is all about.
My original comment was only meant to contradict the belief held by the poster I was responding to that a uKernel somehow magically makes bugs disappear. All I’m saying is uKernels simply move the bug, they don’t fix it.
I concede that there is a world of difference between a bug in user mode and a bug in kernel mode. But depending on the bug, and the subsystem it affects, in a rather large number of cases such distinctions don’t really matter – because either way the result is “it don’t work”.
Being able to restart a process that doesn’t work isn’t all that useful most of the time. It can be, in specific scenarios, but it is just as likely that the result will be it just ends up getting restarted again. and again. and again. If it is a rather exotic bug that only gets hit once in a blue moon it can make a huge difference, but then again if it is say a hardware interaction bug (a much more common case) it won’t matter at all since it is just going to crash again momentarily.
Then you have the nightmare of a problem of dealing with state restoration in such cases (for example, file system resurrection in Minux)… Yes, uKernel designs make this kind of thing more manageable at least, but it isn’t without complexity costs and performance penalties.
My only point is ultimately the bug has to be fixed. It makes no difference whether it is in user space or not, its still a bug. The benefit of the kernel surviving such a bug induced crash may not always be particularly compelling for the user.
There is, in my opinion, a rather compelling argument to be made that the people working on an OS are better serving their users by finding and fixing bugs than by trying to engineer Rube Goldberg like contraptions to make them somehow more tolerable. I don’t think all uKernels suffer from this kind of priority inversion, but Minux sure as hell does.
2016-01-04 1:22 pm
dpJudas
Being able to restart a process that doesn’t work isn’t all that useful most of the time. It can be, in specific scenarios, but it is just as likely that the result will be it just ends up getting restarted again. and again. and again. [/q]
So it would be OK with you if I removed Kill Process from the Task Manager on your PC? After all, by this logic there shouldn’t be a need as the bug will still be there next time you run whatever you needed to end task. Tempting to conclude you’d find it much better that you have to reboot your computer until the devs fixes the bug.
My only point is ultimately the bug has to be fixed. It makes no difference whether it is in user space or not, its still a bug. The benefit of the kernel surviving such a bug induced crash may not always be particularly compelling for the user.
Because seat belts does not always save a driver in an accident we might as well remove it..
[q]There is, in my opinion, a rather compelling argument to be made that the people working on an OS are better serving their users by finding and fixing bugs than by trying to engineer Rube Goldberg like contraptions to make them somehow more tolerable.
Sure, you could argue that adding this kind of resilience is harder than fixing the bugs. The only problem with this kind of argument is that the crashing driver has a nasty tendency to be 3rd party (like the nvidia driver). Not so trivial. Why do you think Microsoft moved the majority of the graphics driver to usermode?

2016-01-04 5:51 am
Alfman verbose=1
dpJudas,
Yes and no. This argument is actually covered by the article and he points out that a microkernel can apply methods to restart processes that stopped functioning properly.
You already see this principle on modern Windows with (parts of) the graphics driver. I’ve several times managed to crash my Nvidia driver, which in the Windows XP days would have blue screened my computer. [/q]
I wouldn’t call Windows a microkernel, but you are right about this. Many gamers already benefit from the “microkernel approach” after a crash since the directX stack can be easily restarted between applications, and the overall system remains stable. Even implementing partial microkernel designs can simplify state management, recovery, security.
https://en.wikipedia.org/wiki/Marshalling_%28computer_science~*~…
[q]This is an issue because calling kernel-mode operations from user-mode requires performing a system call, and this inevitably forces the CPU to switch to “kernel mode”. This is a slow operation, taking in the order of microseconds to complete.[4] During this time, the CPU is unable to perform any operations. As such, minimizing the number of times this switching operation must be performed would optimize performance to a substantive degree.
Linux OpenGL drivers are split in two: a kernel-driver and a user-space driver. The user-space driver does all the translation of OpenGL commands into machine code to be submitted to the GPU. To reduce the number of system calls, the user-space driver implements marshalling. If the GPU’s command buffer is full of rendering data, the API could simply store the requested rendering call in a temporary buffer and, when the command buffer is close to being empty, it can perform a switch to kernel-mode and add a number of stored commands all at once.
While it doesn’t quite “fit” with the rest of the win32s, MS undoubtedly chose a streaming design for DirectX over a more conventional syscall banging for performance. Streams are perfectly suited for microkernels with no overhead compared to macrokernels.
A side benefit of this design is that it is what makes hardware acceleration efficient. Otherwise using more conventional syscalls to build hardware streams in the kernel using would be slow as heck.

2016-01-04 7:04 am
dpJudas
I wouldn’t call Windows a microkernel, but you are right about this. [/q]
Never said it was.
[q]While it doesn’t quite “fit” with the rest of the win32s, MS undoubtedly chose a streaming design for DirectX over a more conventional syscall banging for performance. Streams are perfectly suited for microkernels with no overhead compared to macrokernels.
A side benefit of this design is that it is what makes hardware acceleration efficient. Otherwise using more conventional syscalls to build hardware streams in the kernel using would be slow as heck.
This is true regardless of what API call you’re doing. If you perform a file read syscall for 1 byte at a time the overhead is insane. So you buffer it.
Using command queues like OpenGL/Direct3D does is naturally even more efficient but it isn’t required anymore in a microkernel than a normal one. There is nothing that mandates that a syscall needs to be asynchronous. The microkernel could just as easily switch execution immediately to the receiving process as it can run things in kernel mode.
2016-01-04 8:42 am
Alfman verbose=1
dpJudas,
Never said it was. [/q]
Glad you agree, I was just clarifying my own position in the context of the thread with galvanash.
This is true regardless of what API call you’re doing. If you perform a file read syscall for 1 byte at a time the overhead is insane. So you buffer it.
Sure, so long as you are talking about trivial reads or writes, we can just use larger buffers. But the API you are using makes a big difference as whether coalescing is even possible. As an example: io_submit can coalesce several *small* operations into a single call, multiuser databases come to mind here.
I’d go so far as to suggest that a microkernel built with streaming APIs (along with software using those APIs) would be particularly useful for transparently and efficiently distributing microkernel processes across a cluster/network. This scalability would almost come for free as part of the design. A macrokernel is much more difficult to distribute due to it’s monolithic design and syscalls.
[q]Using command queues like OpenGL/Direct3D does is naturally even more efficient but it isn’t required anymore in a microkernel than a normal one. There is nothing that mandates that a syscall needs to be asynchronous.
Even if new APIs are not mandated, without them the full potential of microkernels won’t be unlocked. People may laugh at what I’m about to say, but mainframe developers had mastered many of these design concepts. It seems much of that has been forgotten.
Edited 2016-01-04 08:51 UTC
2016-01-04 10:35 am
kwan_e
People may laugh at what I’m about to say, but mainframe developers had mastered many of these design concepts. It seems much of that has been forgotten.
It’s not so much forgotten as it is just really cumbersome. To use any of the useful mainframe design, you have to go right down to the assembler and macro libraries. You also have the benefit of dedicated IO processors for hard drives, networking, gpus and cryptography that you can program directly if you needed to.
It also helps that z/Architecture was designed from its early origins to run a hypervisor – namely z/VM, so all the issues of isolation without sacrificing performance are somewhat mitigated (and somewhat easier to implement).
Edited 2016-01-04 10:38 UTC
2016-01-04 8:32 pm
Alfman verbose=1
kwan_e,
It’s not so much forgotten as it is just really cumbersome. To use any of the useful mainframe design, you have to go right down to the assembler and macro libraries. [/q]
Few program in assembler, even on mainframes. In any case I’m not asserting that we should go back to using mainframe languages like COBAL. It’s the emphasis on batch operations using asynchronous messaging queues that is very powerful and scalable.
Modern software tends to emphasize APIs with small synchronous calls to get things done instead. The costs of this design becomes much more apparent when we shift to distributed network I/O using SOAP/Corba /even JSON-RPC. These are all nice in a way, but these synchronous operations are terribly inefficient when we need to batch thousands or millions of them. The asynchronous stream could even perform better across a continent versus a synchronous RPC design talking to a server across the hallway.
Bare in mind my point is not that modern software can’t use efficient streams, obviously we can. But in the mainframe era message queues and copybooks were the defacto tools to solve problems and your average software became scalable by virtue of using them effectively. Software APIs in general today do not have this trait (although there are some exceptions like directx). Making them run better either often requires a redesign, or provisioning better hardware to compensate for the overhead of the software design.
[q]You also have the benefit of dedicated IO processors for hard drives, networking, gpus and cryptography that you can program directly if you needed to.
Sure, but don’t forget that even hardware accelerators need asynchronous/parallel loads to achieve acceleration. If hardware supports a queue depth of 64, but your load is blocked on one or two sequential operations at a time, it’s fundamentally not going to be able to max out the hardware. It’s good that we’re learning to build multithreaded programs to achieve higher degrees of parallelism, but I still think there would be benefit in relearning some of the tricks of the past.
Edited 2016-01-04 20:44 UTC
2016-01-04 7:32 am
galvanash
I wouldn’t call Windows a microkernel, but you are right about this. Many gamers already benefit from the “microkernel approach” after a crash since the directX stack can be easily restarted between applications, and the overall system remains stable. Even implementing partial microkernel designs can simplify state management, recovery, security.
I actually agree with you completely. I just have a pet peeve about how terms are thrown about. “Pure microkernel” means a very specific thing in academia, and the NT kernel is (and always was) about as far from one as you can get.
That doesn’t in any way make it “bad”. It just isn’t a microkernel. There is far more similarity between NT and Linux when it comes to kernel design than say NT and L4, or NT and QNX.
At least with something like XNU in OSX, you could make the argument that it started out as a uKernel (Mach), but memory isolation and IPC semantics were selectively broken down and removed from most subsystems for the sake of performance. Mach had so many performance problems though that such capitulation for the sake of performance was basically a prerequisite to it being usable – it never really “worked” as a uKernel.
NT is different. It never had such isolation or internal IPC to begin with, and is fundamentally incapable of it (even slowly). It wasn’t ever designed with that usage model in mind. It was originally intended to (and did to a degree) run “personality” processes to implement foreign OS subsystems (OS/2, posix), but it did this in a very crude and incomplete way. Again, Linux can run L4/Fiasco in userspace too, but that doesn’t make Linux a uKernel OS…
You can always move things out of the kernel selectively and gain some robustness advantages. Linux has FUSE file systems, and there is your example with OpenGL drivers. You could certainly call things like this “inspired” by microkernel designs, but they don’t in and of themselves turn your kernel into one…
I’m just saying it is important to call things what they actually are. Words matter. The Windows NT kernel is no closer to being a uKernel than the Linux kernel is, because both contain significant trade offs in their designs which disqualify them from carrying that label.
2016-01-05 7:06 pm
MacMan
NT is different. It never had such isolation or internal IPC to begin with, and is fundamentally incapable of it (even slowly). It wasn’t ever designed with that usage model in mind. It was originally intended to (and did to a degree) run “personality” processes to implement foreign OS subsystems (OS/2, posix), but it did this in a very crude and incomplete way. Again, Linux can run L4/Fiasco in userspace too, but that doesn’t make Linux a uKernel OS…
Actually the NT kernel does communicate with itself through messages:
http://blogs.msdn.com/b/ntdebugging/archive/2007/07/26/lpc-local-pr…
It uses the LPC/ALPC api, which currently is undocumented, though I’m starting to use it in an app we’re developing here.
The NT LCP (local procedure call) messaging system is very similar to Mach messages which is how the XNU kernel communicates with itself.
The NT implementation is extremely good, based on my initial testing, its faster than Mach messages.
The point here is that the NT kernel communicates with itself though messages, and though much of it may be in the same address space, it logically operates very similarly to microkernels, so it is actually most similar to OS X XNU kernel, where both of them are hybrid kernels.
The Linux kernel is more based on standard function calls, which is one of the reasons why its a classic monolithic kernel.
I really wish MS would document the NT LPC/ALPC messaging system, it is used in a large number of their applications such as Skype, and its a damned good system.
2016-01-05 8:03 pm
galvanash
Actually the NT kernel does communicate with itself through messages
Yes, it has varying forms of IPC. But none of them are performant enough to facilitate the pervasive use of it as you would find in a uKernel.
Im not saying it doesn’t use messaging within the kernel. Im saying it doesn’t have a performant mechanism to do so between kernel mode and user mode. Passing such messages, even using (A)LPC, requires buffer copying, as you said much the same as Mach. That is fundamentally what is wrong with Mach, and why it never worked as a uKernel. Buffering is simply too slow. Modern uKernels such as L4 and QNX go to great lengths, and make some rather fundamental tradeoffs, to avoid buffering in all but the most extraordinary cases (which is why they actually perform well enough to be competitive).
While in-kernel IPC such as NT uses can be made very fast since you can avoid the buffering, you lose the memory space isolation, which imo is the whole point of a microkernel.
I do appreciate the other advantages of such a design (modularity, separation of concerns, avoidance of shared data structures, etc.), it still isn’t a microkernel. It just a better engineered monolithic kernel (or “hybrid” if you prefer that term).
2016-01-05 7:06 pm
tylerdurden
The NT kernel most definitively has many microkernel design aspects to it.
Edited 2016-01-05 19:15 UTC

2016-01-03 8:03 pm
Brendan
Hi,
In general; a micro-kernel sacrifices a small amount of performance (due to communication overheads) in exchange for a large amount of isolation (which allows other things, like better security, more fault tolerance, more flexibility, etc).
The first problem with micro-kernels is that it’s easy to benchmark performance (and easy to see the small disadvantage, and even easier to use micro-benchmarks that exacerbate pathological cases to fool suckers), and impossible to benchmark “isolation” (and extremely hard to see the huge advantage in practical comparisons).
The second problem is that, for any OS, the kernel is just a relatively small piece. The majority of the work is in user-space – things like GUIs, web browsers, office applications, database management engines, etc. Because of this, most “OS developers” end up being kernel developers – the entire user-space is too hard/time consuming so they recycle an existing open source user-space. Existing open source user-space is built on top of APIs (C and C++ standard library, POSIX API, etc) that were designed for monolithic kernels that are “less well suited” to micro-kernels. This makes micro-kernels seem worse (in the same way that monolithic kernels would seem worse if the APIs that user-space is built on were designed for message passing).
Finally; (especially for desktop and server) it’s extremely hard to get market share for any new OS, regardless of what it is and regardless of how awesome it is. You can’t attract many users without applications and drivers, and you can’t attract application and device driver developers without users. This means that the OS market tends to be dominated by whoever was there first, not because the older OSs are better (or because they’re monolithic), but because they’re established products.
Basically; the real problems with micro-kernels are not technical problems, they’re marketing problems. This is why micro-kernels have only been successful in areas where marketing has less influence (e.g. embedded systems) and aren’t used for desktop/server.
Ironically; this is insane/backwards. For embedded systems (where the total amount of code is small, you don’t need to care about third-party software, and performance and size matters) monolithic kernels should be superior (but mostly aren’t used); and for desktop/server (where there’s no resource constraints, third-party software is unavoidable, and all the performance is mostly wasted due to Wirth’s law anyway) micro-kernels should be superior (but mostly aren’t used).
– Brendan

2016-01-03 8:20 pm
CavemanGR
Basically; the real problems with micro-kernels are not technical problems, they’re marketing problems. This is why micro-kernels have only been successful in areas where marketing has less influence (e.g. embedded systems) and aren’t used for desktop/server. [/q]
exactly. 101% correct.
[q]
Ironically; this is insane/backwards. For embedded systems (where the total amount of code is small, you don’t need to care about third-party software, and performance and size matters) monolithic kernels should be superior (but mostly aren’t used); and for desktop/server (where there’s no resource constraints, third-party software is unavoidable, and all the performance is mostly wasted due to Wirth’s law anyway) micro-kernels should be superior (but mostly aren’t used).
– Brendan
yeap, correct again

2016-01-04 1:22 pm
segedunum
It strikes me that anti-microkernel sentiment most vocally originates as a sort of tribal affiliation mechanism by Linux users to ward off insecurity.
No actually. ‘Security’ is brought up by microkernel proponents because it is simply all that they have. The security examples cited are always incredibly hypothetical and would basically end up in an unusable kernel.
2016-01-04 3:07 pm
Bill Shooter of Bul Platinum Prime
The differences are real, and there are advantages to both. But the article really jumped on the crazy train when it started hypothesizing about a successful Hurd in the 90’s. Its kind of like saying how great of a Jedi I would be, if this was actually the star wars universe. ( BTW, I’d be one of the worst Jedi masters.)
Hurd was/ is insanely more complex than just a microkernel. Which is why its still not really ready, and basically everyone ditched the theoretical benefits for the practical benefits of the working Linux kernel.
2016-01-04 5:59 pm
crazycanuck
I see a lot of stuff suggesting that Linus got it wrong and Andrew was right all along …. just a continuation of the same discussion from 20 years ago.
It’s interesting, but that’s all.
If and when someone writes a microkernal OS that displaces Linux it’ll be significant. Until then it’s just a difference of opinion.
2016-01-04 6:39 pm
TommyD
I’m sticking with safer topics like Politics and Religion.
2016-01-04 10:38 pm
Bill Shooter of Bul Platinum Prime
I’m guessing you’re not buying the article’s premise?
I’d agree, but obviously “slow” is a relative term and the niche’s that microkernels live in is different than macros. Making them difficult to compare. And really, its tough enough to bench mark operating systems designed for the same workloads and sharing substantial parts ( ubuntu vs RHEL for example).
2016-01-04 11:33 pm
NorthWay
All I want is a 64-bit micro-kernel SASOS with a strong whiff of Amiga (yeah, die-hard Amiga dude) plus a versioning filesystem.
And some processor support for better sharing and protecting memory without flushing caches.
World peace might as well come sooner…

2016-01-05 6:04 am
cybergorf
Exactly my dream as well…
Is there an elegant (fast) way to make a SASOS secure?

2016-01-05 1:08 pm
NorthWay
Is there an elegant (fast) way to make a SASOS secure?
I think you would need MMU support for multiple IDs at once. As in, a page have one ID and a process would have a number of IDs associated with it that identifies which pages it can access.
A context switch changes the current ID associations and the need to flush the cache goes away.
If you can associate a process with only 1 ID then you are basically doing exactly the same as you do today (except you can stop flushing the cache).
As is obvious, this (as so many other CS topics – “0, 1, or infinite”) isn’t obvious how to handle once you go past 0 or 1 IDs. A fixed number of registers will always be a headache. You end up with something like another cache handler in hw or sw.
Possibly can the protection rings principle be used or maybe range use can offer something.
Anyone know some other approaches to read-sharing/read-write-sharing/handover (for either pages or address ranges, and for frequent and many uses at once preferably)?

2016-01-05 3:16 pm
viton
Itanium systems are pretty cheap on ebay nowadays =)
http://www.bottomupcs.com/hardware_support_for_virtual_memory.html

2016-01-06 12:34 am
Intuition
https://en.m.wikipedia.org/wiki/Quark_(kernel)
2016-01-06 11:20 am
derstef
The article is full of strong opinions but also some really interesting facts. It also shows the apodictic statements that not always the best is the most successful and there is no right solution for all the usecases. See x86 – its really far away from beeing the best CPU architecture but managed to gain broad acceptance anyways. Sometimes it also needs some time for the best to get that acceptance. See parallel programming. It needs a certain kind of thinking and i know programmers who still are’nt able to handle concurrency.