HPC-Colony Project

Adaptive System Software For Improved Resiliency and Performance

Overview	Goals	Accomplishments	FAQ	News	Participants	Publications	Links	Internal Page

Frequently Asked Questions (FAQ)

Why Linux?

Linux offers a very complete system call set which is vital for application portability. Furthermore, as an open source operating system Linux provides easy access to source code (useful for understanding details). If necessary, open source Linux may be modified for custom needs. The Colony project is developing a Linux kernel suitable for very large node count parallel applications.

What is Parallel Aware Scheduling?

What are Virtualized Processors?

Charm++ and Adaptive MPI are based on the notion of virtual processors: progammers target their program to a large number of (logical or virtualized) "processors", independent of the number of physical processors, and the runtime assigns many virtualized processors (VPs) to physical processors. The programmer chooses the number of virtualized processors used based on application-structure considerations, while keeping the amount of work per VP above a minimum threshold to limit the effects of runtime system overhead. In this way, the programmer is free to pursue the problem independent of the number of processors and effective parallel progamming is made much simpler. Each VP may just be a C++-style object, but to program MPI and other applications, a user-level thread can be embedded in such an object. These user level threads are extremely lightweight and migratable across processors, and are called "Virtualized Processor Threads". This idea of virtualization is distinct from "Virtual OS".

What motivated the name HPC-Colony?

Colony can refer to a very large group of penguins (pride of lions, gaggle of geese, colony of penguins, ...); it was chosen as a wordplay on Linux's mascot -- Tux the penguin.