RIT DirectoriesRIT Site IndexSearch
Research Computing Banner
 
Condor1000 Project
 
 

(see article in RIT News&Events Feb. 7, 2008)

On December 21st, 2007 Research Computing achieved its goal of harnessing the power of 1000 processors by tapping into the unused computing cycles of hundreds of computers at RIT. Computing power is being utilized in the Center for Imaging Science, Computer Science, Research Computing, Information Technology Services, Information Technology, Math, Electrical Engineering, Mechanical Engineering, Manufacturing and Mechanical Engineering Technology, and Liberal Arts. And more will be joining the pool in coming weeks.

Multiplying the power of a single desktop computer by 1000 or more is a tremendous increase in personal computing capacity and extends the boundaries of research problems that can be computed. Essentially, this is a campus computational grid. In total, the current Condor system consists of more than 1.5 terabytes of memory, greater than 67 terabytes of disk space, and about .75 teraflops of computing power.


Condor Clients by Operating System

What is Condor?

Condor is a project of the University of Wisconsin. It is what is known as High Throughput Computing, that is, the goal is to run as many jobs as possible at the same time. The easiest way to do this is to tap into all the computers on campus when they aren't being used. Condor senses when the computer is free according the policy on that particular machine (the policy is set by the owner of each machine. e.g., "only use for condor jobs between 11pm and 6am", etc.). The harvesting of these unused cycles ensures that the investment in all that computer hardware is utilized to the maximum extent possible.

Efficiency is not the only benefit from Condor. Researchers benefit by getting access to a huge number of computer processors at essentially little or no cost. Certain types of computer problems are ideal for the Condor environment. These jobs are ones that can be separated into hundreds or thousands of little jobs that run independently of one another but whose results can be aggregated at the end. An example of this is image processing or video. Some analysis requires the processing of each individual pixel of an image, or of a single frame of a movie. Each of these individual jobs can be run separately and the results brought together at the conclusion of the job.

Condor at RIT - where did it come from?

In 1999 researchers in the Center for Imaging Science needed more processing power to analyze images with high resolution and hyperspectral bands using physics-based models. Rolando Raqueno of the Digital Imaging and Remote Sensing (DIRS) laboratory and Bob Krzaczek of the Laboratory for Imaging Algorithms and Systems (LIAS) began to use Condor to meet these needs. A few years later, James Craig of Computer Science started another condor cluster or "flock" and joined it to the CIS flock.

In 2006, Rick Bohn (Research Computing) started to help support the Condor flock of workstations in Imaging Science. By the summer of 2007, Paul Mezzanini (Research Computing) started seriously deploying Condor on the 100 cpu high performance cluster and the possibility of managing 1000 condor clients became an achievable goal. In the Fall of 2007, Bill Hoagland (CIS/LIAS) developed new installers and Brent Strong (ITS/DSS) and Jeremy Sieminski (ITS/DSS) installed Condor on Macintosh computers in the Math Department. And by the end of 2007, over 1000 processors were available on Windows, Macintosh, Solaris, and linux computers among all the flocks on campus spread across eight departments. Condor1000 was a reality.

According to Raqueno, "The dissertation research of recent Ph.D. graduate, Captain Michael Foster (USAF), using hyperspectral imagery fused with simulated airborne LIDAR data was a prime example of a problem that could not have been solved in time without CONDOR."

...and where is it going? Grid Computing

With the success of the Condor1000 Project, RIT can not only support more researchers on campus who will utilize the system, but RIT can now begin to experiment with connecting its system with systems at other institutions to create pools of thousands and tens of thousands of processors in size. The advantage of such a large pool is threefold - larger jobs can be run, more users can run at the same time, and backup and redundancy are available, should there be any outages on campus. This is the benefit of a grid of computers: a shared cyberinfrastructure that increases efficiency, scope and productivity. RIT's membership in the statewide NYSGrid and the worldwide Open Science Grid will help enable such a scenario.

current Condor pool stats
Condor Wiki article