<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-4890039722003127341</id><updated>2011-08-15T22:07:46.969-04:00</updated><category term='protomol'/><category term='data mining'/><category term='condor'/><category term='elections'/><category term='mpi'/><category term='fault tolerance'/><category term='eucalyptus'/><category term='wavefront'/><category term='bioinformatics'/><category term='hadoop'/><category term='green cloud'/><category term='troubleshooting'/><category term='processes'/><category term='makeflow'/><category term='amazon'/><category term='bxgrid'/><category term='grid heating'/><category term='biocompute'/><category term='parallel'/><category term='map-reduce'/><category term='classify'/><category term='work queue'/><category term='roars'/><category term='linux'/><category term='ethernet'/><category term='elastic applications'/><category term='visualizing'/><category term='threads'/><category term='cloud computing'/><category term='java'/><category term='chirp'/><category term='cyclecomputing'/><category term='log file'/><category term='genome assembly'/><category term='enavis'/><category term='multicore'/><category term='distributed computing'/><category term='weaver'/><category term='cluster computing'/><category term='parrot'/><category term='linking'/><category term='abstractions'/><category term='grid computing'/><category term='replica exchange'/><category term='turtles'/><category term='allpairs'/><category term='virtual machines'/><category term='assembly language'/><category term='lockdown'/><category term='audit trail'/><category term='web browsers'/><title type='text'>Prof. Douglas Thain</title><subtitle type='html'>Reflections on distributed computing in clusters, clouds, and grids.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>29</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-5159751502323873754</id><published>2010-11-16T13:00:00.003-05:00</published><updated>2010-11-16T14:02:38.174-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='virtual machines'/><title type='text'>The Virtualization Theorem Ignored for Three Decades</title><summary type='text'>Today, in my graduate operating systems class, we discussed what I believe is the most important result in computer science ever to be persistently ignored:Popek and Goldberg, Formal Requirements for Virtualizible Third Generation Architectures, Communications of the ACM, Volume 17, Issue 7, July 1974.This paper puts forth a very simple principle that must be observed in order for a CPU to be </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/5159751502323873754/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2010/11/today-in-my-graduate-operating-systems.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/5159751502323873754'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/5159751502323873754'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2010/11/today-in-my-graduate-operating-systems.html' title='The Virtualization Theorem Ignored for Three Decades'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-411520610055353975</id><published>2010-11-08T11:00:00.004-05:00</published><updated>2010-11-08T11:00:03.381-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='work queue'/><category scheme='http://www.blogger.com/atom/ns#' term='makeflow'/><category scheme='http://www.blogger.com/atom/ns#' term='bxgrid'/><category scheme='http://www.blogger.com/atom/ns#' term='weaver'/><category scheme='http://www.blogger.com/atom/ns#' term='parrot'/><category scheme='http://www.blogger.com/atom/ns#' term='chirp'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>Sometimes It All Comes Together</title><summary type='text'>Most days, software engineering involves compromises and imperfect solutions. It's rare for two pieces of software to mesh perfectly -- you always have to work to overcome the limitations or assumptions present in different modules. But, every once in a while, the pieces just come together in a satisfying way.A few weeks back, we ran into a problem with BXGrid, our system for managing biometric </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/411520610055353975/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2010/11/sometimes-it-all-comes-together.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/411520610055353975'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/411520610055353975'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2010/11/sometimes-it-all-comes-together.html' title='Sometimes It All Comes Together'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_0DkxYiRVGyA/TMiMhFYdhhI/AAAAAAAAAHk/P45oG6hYAEk/s72-c/everything.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-7295928271229878094</id><published>2010-11-01T11:00:00.005-04:00</published><updated>2010-11-01T11:00:01.570-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='makeflow'/><category scheme='http://www.blogger.com/atom/ns#' term='weaver'/><category scheme='http://www.blogger.com/atom/ns#' term='allpairs'/><title type='text'>Compiling Workflows with Weaver</title><summary type='text'>Over the last year, our Makeflow system has become quite popular here at Notre Dame.  Briefly, Makeflow takes a workload expressed in the plain old Make format, and executes it in a distributed system, using the dependency information to set up the appropriate remote execution environment.  It does not require a distributed filesystem, so it's easy to get your applications going on hundreds to </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/7295928271229878094/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2010/11/compiling-workflows-with-weaver.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7295928271229878094'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7295928271229878094'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2010/11/compiling-workflows-with-weaver.html' title='Compiling Workflows with Weaver'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-6392473168151819932</id><published>2010-10-27T11:00:00.002-04:00</published><updated>2010-10-27T11:14:40.106-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bxgrid'/><category scheme='http://www.blogger.com/atom/ns#' term='roars'/><category scheme='http://www.blogger.com/atom/ns#' term='parrot'/><title type='text'>From Database to Filesystem and Back Again</title><summary type='text'>Hoang Bui is leading the development of ROARS: a Rich Object Archival System, which is our generalization many of the ideas expressed in the Biometrics Research Grid.  Hoang presented a paper on ROARS at the workshop on Data Intensive Distributed Computing earlier this year.What makes ROARS particularly interesting is that it combines elements of both relational databases and file systems, and </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/6392473168151819932/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2010/10/from-database-to-filesystem-and-back.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/6392473168151819932'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/6392473168151819932'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2010/10/from-database-to-filesystem-and-back.html' title='From Database to Filesystem and Back Again'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_0DkxYiRVGyA/TMg4SIAfgOI/AAAAAAAAAHc/4b8x7Mck_14/s72-c/iris.jpeg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-7601453597021998855</id><published>2010-10-18T12:04:00.017-04:00</published><updated>2010-10-19T15:12:36.338-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='work queue'/><category scheme='http://www.blogger.com/atom/ns#' term='elastic applications'/><category scheme='http://www.blogger.com/atom/ns#' term='mpi'/><category scheme='http://www.blogger.com/atom/ns#' term='protomol'/><category scheme='http://www.blogger.com/atom/ns#' term='replica exchange'/><title type='text'>Summer REU: Toward Elastic Scientific Applications</title><summary type='text'>In recent months, we have been working on the problem of building elastic parallel applications that can adapt to the available resources at run-time. Much has been written about elastic internet services, but scientific applications have a ways to catch up.Traditional parallel applications are rigid: the user chooses how many nodes (or cores or CPUs) to use when the program starts. If more </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/7601453597021998855/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2010/10/summer-reu-toward-elastic-scientific.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7601453597021998855'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7601453597021998855'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2010/10/summer-reu-toward-elastic-scientific.html' title='Summer REU: Toward Elastic Scientific Applications'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_0DkxYiRVGyA/TL3sUA2-M4I/AAAAAAAAAHM/q8wNh_Es1Zs/s72-c/wq.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-3974063046436912525</id><published>2010-04-05T12:45:00.000-04:00</published><updated>2010-04-05T12:47:14.415-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='java'/><category scheme='http://www.blogger.com/atom/ns#' term='virtual machines'/><category scheme='http://www.blogger.com/atom/ns#' term='linux'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>The Forty Tribes of Linux</title><summary type='text'>As I have noted in this column before, a perennial challenge of distributed computing in the real world is dealing with the multiplicity of operating systems and related environments.  If you are dealing with an uncontrolled environment like a large university or an 'at home' computing environment, there is no telling what you are going to get.    If you have a piece of software that depends </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/3974063046436912525/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2010/04/forty-tribes-of-linux.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/3974063046436912525'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/3974063046436912525'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2010/04/forty-tribes-of-linux.html' title='The Forty Tribes of Linux'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-8565097235158460588</id><published>2010-01-21T13:37:00.001-05:00</published><updated>2010-01-21T13:39:06.906-05:00</updated><title type='text'>Summer REU at Notre Dame</title><summary type='text'>We invite outstanding undergraduates to apply for summer researchpositions in scientific and cloud computing at the University of Notre Dame.Students will build and operate systems that harness hundreds ofmachines at once to attack large problems in science and engineering.Research topics include:Green Cloud Computing Portals for Scientific Research Languages for Distributed Computing  More </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/8565097235158460588/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2010/01/we-invite-outstanding-undergraduates-to.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/8565097235158460588'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/8565097235158460588'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2010/01/we-invite-outstanding-undergraduates-to.html' title='Summer REU at Notre Dame'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-7489342411717063766</id><published>2010-01-11T11:07:00.006-05:00</published><updated>2010-01-11T12:29:09.141-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='genome assembly'/><category scheme='http://www.blogger.com/atom/ns#' term='green cloud'/><category scheme='http://www.blogger.com/atom/ns#' term='grid heating'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>Green Cloud Online</title><summary type='text'>The Green Cloud is now online!The Green Cloud is the invention of Dr. Paul Brenner at the ND Center for Research Computing.  It is a containerized data center located at the South Bend city greenhouse, stocked with used servers kindly donated by Ebay, Inc.  The first batch of machines was installed in December, and will eventually reach about 400 cores once everything is turned on.What makes the </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/7489342411717063766/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2010/01/green-cloud-online.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7489342411717063766'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7489342411717063766'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2010/01/green-cloud-online.html' title='Green Cloud Online'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_0DkxYiRVGyA/S0tfbiB4qlI/AAAAAAAAAGw/mrjkO22ExM8/s72-c/greencloud.gif' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-7219908666150046094</id><published>2009-10-08T09:00:00.000-04:00</published><updated>2009-10-08T09:00:04.815-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='processes'/><category scheme='http://www.blogger.com/atom/ns#' term='threads'/><category scheme='http://www.blogger.com/atom/ns#' term='web browsers'/><title type='text'>On Programming With Processes, Part II</title><summary type='text'>One of the biggest challenges in building computer systems is finding a way to make things simpler. Any propeller-head can make a piece of software more complicated. Unfortunately, our industry seems to have a way of gravitating toward the complex. Let's look at the current state of the web browsers -- pick any one -- which seem to insist upon reimplementing or otherwise abusing the operating </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/7219908666150046094/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/10/on-programming-with-processes-part-ii.html#comment-form' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7219908666150046094'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7219908666150046094'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/10/on-programming-with-processes-part-ii.html' title='On Programming With Processes, Part II'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_0DkxYiRVGyA/Sl01KahQkoI/AAAAAAAAAEk/T0PkkQ0mGsQ/s72-c/tabbed-browsing.gif' height='72' width='72'/><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-1203856853642904517</id><published>2009-10-01T23:00:00.001-04:00</published><updated>2009-10-01T23:08:47.991-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='grid computing'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud computing'/><category scheme='http://www.blogger.com/atom/ns#' term='parrot'/><category scheme='http://www.blogger.com/atom/ns#' term='hadoop'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>Partly Cloudy with a Chance of Condor</title><summary type='text'> We have been thinking about cloud computing quite a bit over the last month. As I noted earlier, cloud computing is hardly a new idea, but it does add a few new twists on some old concepts in distributed systems. So, we are spending some time to understand how we can take our existing big applications and make them work with cloud systems and software. It should come as no surprise that there </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/1203856853642904517/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/10/partly-cloudy-with-chance-of-condor.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/1203856853642904517'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/1203856853642904517'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/10/partly-cloudy-with-chance-of-condor.html' title='Partly Cloudy with a Chance of Condor'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_0DkxYiRVGyA/SsTBDmxJ0hI/AAAAAAAAAGM/xdaL3oWeDdY/s72-c/cloud-grid1.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-493069942866277040</id><published>2009-08-03T09:00:00.001-04:00</published><updated>2009-08-03T09:00:05.393-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='makeflow'/><category scheme='http://www.blogger.com/atom/ns#' term='bxgrid'/><category scheme='http://www.blogger.com/atom/ns#' term='allpairs'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>REU Project: BXGrid</title><summary type='text'>This post continues last week's subject of summer REU projects.Rachel Witty and Kameron Srimoungchanh worked on BXGrid, our web portal and computing system for biometrics research. This project is a collaboration between the Cooperative Computing Lab and the Computer Vision Research Lab at Notre Dame. Hoang Bui is the lead graduate student on the project.  Rachel and Kameron added a bunch of new </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/493069942866277040/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/08/reu-project-bxgrid.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/493069942866277040'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/493069942866277040'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/08/reu-project-bxgrid.html' title='REU Project: BXGrid'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_0DkxYiRVGyA/Sm8lIDgQaiI/AAAAAAAAAF8/ZqScRIGpv-w/s72-c/bxgrid1.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-3776419080269520338</id><published>2009-07-28T12:01:00.011-04:00</published><updated>2009-07-29T13:21:52.091-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='makeflow'/><category scheme='http://www.blogger.com/atom/ns#' term='bxgrid'/><category scheme='http://www.blogger.com/atom/ns#' term='biocompute'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>REU Project: Biocompute</title><summary type='text'>This summer, we hosted four REU students who contributed to two web portals for distributed computing: Biocompute and BXGrid. I'll write about one this week and the other next week.REU students Ryan Jansen and Joey Rich worked with recent grad Rory Carmichael on Biocompute, our web portal and computing system for bioinformatics research. Biocompute was originally created by Patrick Braga-Henebry </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/3776419080269520338/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/07/reu-project-biocompute.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/3776419080269520338'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/3776419080269520338'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/07/reu-project-biocompute.html' title='REU Project: Biocompute'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_0DkxYiRVGyA/Sm8ijomwliI/AAAAAAAAAFc/xHuKqk4zlfA/s72-c/biocompute1.gif' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-798641116368187178</id><published>2009-07-03T12:55:00.006-04:00</published><updated>2009-07-03T14:32:54.923-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='makeflow'/><category scheme='http://www.blogger.com/atom/ns#' term='abstractions'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>Make as an Abstraction for Distributed Computing</title><summary type='text'>In previous articles, I have introduced the idea of abstractions for distributed computing. An abstraction is a way of specifying a large amount of work in a way that makes it possible to be distributed across a large computing system. All of the abstractions I have discussed so far have a compact, regular structure. However, many people have large workloads that do not have a regular structure. </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/798641116368187178/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/07/make-as-abstraction-for-distributed.html#comment-form' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/798641116368187178'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/798641116368187178'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/07/make-as-abstraction-for-distributed.html' title='Make as an Abstraction for Distributed Computing'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-8677737011622119141</id><published>2009-06-02T11:33:00.002-04:00</published><updated>2009-06-02T11:38:14.576-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='grid computing'/><category scheme='http://www.blogger.com/atom/ns#' term='grid heating'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>Grid Heating: Putting Data Center Heat to Productive Use</title><summary type='text'>Dr. Paul Brenner, a research scientist in the Computing Research Center at the University Notre Dame, has been advocating a novel idea called grid heating. He recently won a "Green IT Award" from the Uptime Institute for his work. Here is a short introduction  to the idea:Around the world, large data centers consume enormous amounts of power. In addition to the energy needed to spin disks and </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/8677737011622119141/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/06/grid-heating-putting-data-center-heat.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/8677737011622119141'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/8677737011622119141'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/06/grid-heating-putting-data-center-heat.html' title='Grid Heating: Putting Data Center Heat to Productive Use'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_0DkxYiRVGyA/SiVGAoK7RPI/AAAAAAAAAEM/RJGYRVYAwG0/s72-c/Picture+004.jpg' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-7534852785967399482</id><published>2009-05-29T16:00:00.000-04:00</published><updated>2009-05-29T16:35:20.683-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='distributed computing'/><category scheme='http://www.blogger.com/atom/ns#' term='linking'/><title type='text'>Dynamic Linking and Distributed Computing Don't Mix</title><summary type='text'>Dynamic linking is one of the more frustrating aspects of distributed computing in the real world. It's is the sort of technology that is meant to optimize the computer's happiness at the expense of the end user's sanity. Dynamic linking should really be avoided, except in a few very specific cases outlined below.For those of you who don't remember, here is a brief primer on linking:Back in the </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/7534852785967399482/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/05/dynamic-linking-and-distributed.html#comment-form' title='10 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7534852785967399482'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7534852785967399482'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/05/dynamic-linking-and-distributed.html' title='Dynamic Linking and Distributed Computing Don&apos;t Mix'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_0DkxYiRVGyA/SiBFb4gMpHI/AAAAAAAAAD0/S-lgJ6yYmuM/s72-c/static-linking.gif' height='72' width='72'/><thr:total>10</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-3973138414778176502</id><published>2009-04-14T10:16:00.013-04:00</published><updated>2009-04-14T12:19:13.872-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='genome assembly'/><category scheme='http://www.blogger.com/atom/ns#' term='bioinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='abstractions'/><title type='text'>Distributed Genome Assembly on 1000 Computers</title><summary type='text'>Lately, my research group has been collaborating with Prof. Scott Emrich on several problems in bioinformatics. Our students Chris Moretti and Mike Olson have been building a system for carrying out whole-genome assembly problems on campus grids. They recently got it scaled up to run on nearly 1000 nodes spread across Notre Dame, Purdue, and Wisconsin, making the problem complete in a few hours </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/3973138414778176502/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/04/distributed-genome-assembly-on-1000.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/3973138414778176502'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/3973138414778176502'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/04/distributed-genome-assembly-on-1000.html' title='Distributed Genome Assembly on 1000 Computers'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_0DkxYiRVGyA/SeS1x4VAGNI/AAAAAAAAADs/UHBM6TL9Ql4/s72-c/assembly.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-7930297254819419960</id><published>2009-02-25T09:54:00.006-05:00</published><updated>2009-03-13T11:13:13.146-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='processes'/><category scheme='http://www.blogger.com/atom/ns#' term='threads'/><category scheme='http://www.blogger.com/atom/ns#' term='parallel'/><category scheme='http://www.blogger.com/atom/ns#' term='abstractions'/><title type='text'>On Parallel Programming with Processes</title><summary type='text'>About once a week, a well-meaning person stops by my office to ask a question like this:I need to run about 1000 simulations that take about an hour each.  I can't wait a thousand hours for the results, so I need to parallelize my simulation.  So, should I re-write my application using threads, MPI, or something else?For some reason, they are always disappointed by my response:Just run multiple </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/7930297254819419960/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/02/on-parallel-programming-with-processes.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7930297254819419960'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7930297254819419960'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/02/on-parallel-programming-with-processes.html' title='On Parallel Programming with Processes'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-4767548460398677880</id><published>2009-02-21T13:41:00.011-05:00</published><updated>2009-02-24T16:53:50.833-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='ethernet'/><category scheme='http://www.blogger.com/atom/ns#' term='fault tolerance'/><title type='text'>Exponential Backoff in Distributed Systems</title><summary type='text'>In response to my previous article, a commenter asked:Why exponential backoff? To put a finer point on the question, How should I choose the parameters for my exponential backoff algorithm? I think many people choose parameters that back off too much, too fast.The idea of exponential backoff in distributed systems goes back quite a few years. An early example can be found in the Ethernet network.</summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/4767548460398677880/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/02/exponential-backoff-in-distributed.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/4767548460398677880'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/4767548460398677880'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/02/exponential-backoff-in-distributed.html' title='Exponential Backoff in Distributed Systems'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_0DkxYiRVGyA/SaIUoVsKoqI/AAAAAAAAAC0/APiyDxuzvzs/s72-c/ethernet.gif' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-2970849652351394924</id><published>2009-02-08T13:32:00.006-05:00</published><updated>2009-02-21T13:40:44.085-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='fault tolerance'/><category scheme='http://www.blogger.com/atom/ns#' term='troubleshooting'/><category scheme='http://www.blogger.com/atom/ns#' term='wavefront'/><title type='text'>Fail Fast, Fail Often</title><summary type='text'>A common misconception among programmers is that software should always attempt to hide failures in distributed systems. This idea seems sensible at first, because distributed systems are full of failures of all kinds: machines crash, software fails, and networks go down, just to name a few. If I am writing a function called transfer_file() which copies a file from one place to another, then I </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/2970849652351394924/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/02/fail-fast-fail-often.html#comment-form' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/2970849652351394924'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/2970849652351394924'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/02/fail-fast-fail-often.html' title='Fail Fast, Fail Often'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-7200134489053686249</id><published>2009-01-14T09:19:00.013-05:00</published><updated>2009-01-14T16:33:36.656-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='troubleshooting'/><category scheme='http://www.blogger.com/atom/ns#' term='audit trail'/><category scheme='http://www.blogger.com/atom/ns#' term='elections'/><category scheme='http://www.blogger.com/atom/ns#' term='log file'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>Audit Trails in Voting Machines</title><summary type='text'>Kim Zetter at Wired magazine recently wrote about the use of log files in electronic voting machines. (It actually shows snippets of the relevant data, which is a refreshing use of primary evidence in journalism.) The article illustrates an often overlooked rule of software engineering:A DEBUG FILE IS NOT THE SAME THING AS AN AUDIT TRAIL.Here is my rough guess at what happened: Political forces </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/7200134489053686249/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2009/01/audit-trails-in-election-computers.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7200134489053686249'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/7200134489053686249'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2009/01/audit-trails-in-election-computers.html' title='Audit Trails in Voting Machines'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-4641725232459931599</id><published>2008-12-29T11:55:00.015-05:00</published><updated>2009-01-05T09:34:52.820-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='bxgrid'/><category scheme='http://www.blogger.com/atom/ns#' term='distributed computing'/><category scheme='http://www.blogger.com/atom/ns#' term='allpairs'/><category scheme='http://www.blogger.com/atom/ns#' term='abstractions'/><title type='text'>BXGrid: The Biometrics Research Grid</title><summary type='text'>One of our graduate students, Hoang Bui, presented a poster on the Biometrics Research Grid (BXGrid) at the IEEE e-Science conference in Indianapolis a few weeks ago. BXGrid is a large data repository that we have built to support both production research in biometrics and to provide a platform for research in large scale data intensive computing. It provides another example of the idea of </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/4641725232459931599/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2008/12/bxgrid-biometrics-research-grid.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/4641725232459931599'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/4641725232459931599'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2008/12/bxgrid-biometrics-research-grid.html' title='BXGrid: The Biometrics Research Grid'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_0DkxYiRVGyA/SVkGEv-N9rI/AAAAAAAAACc/bijdCogTY8Q/s72-c/bxgrid-atlas.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-5106482842130544480</id><published>2008-12-11T13:49:00.010-05:00</published><updated>2008-12-12T10:30:05.780-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='grid computing'/><category scheme='http://www.blogger.com/atom/ns#' term='cyclecomputing'/><category scheme='http://www.blogger.com/atom/ns#' term='amazon'/><category scheme='http://www.blogger.com/atom/ns#' term='eucalyptus'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud computing'/><category scheme='http://www.blogger.com/atom/ns#' term='abstractions'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>Abstractions, Grids, and Clouds at IEEE e-Science 2008</title><summary type='text'>I just attended the IEEE conference on e-Science in Indianapolis, and gave this talk on harnessing distributed systems with high level abstractions.Another highlight of the conference was Rich Wolski's talk on Eucalyptus, an open source toolkit for cloud computing.   Like Nimbus, it  is API compatible with Amazon's EC2. That is, if your code works with Amazon, you can install Nimbus or Eucalyptus</summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/5106482842130544480/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2008/12/abstractions-grids-and-clouds-at-ieee-e.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/5106482842130544480'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/5106482842130544480'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2008/12/abstractions-grids-and-clouds-at-ieee-e.html' title='Abstractions, Grids, and Clouds at IEEE e-Science 2008'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-3132379546030546811</id><published>2008-12-09T12:13:00.009-05:00</published><updated>2008-12-09T15:08:24.508-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='map-reduce'/><category scheme='http://www.blogger.com/atom/ns#' term='visualizing'/><category scheme='http://www.blogger.com/atom/ns#' term='distributed computing'/><category scheme='http://www.blogger.com/atom/ns#' term='chirp'/><category scheme='http://www.blogger.com/atom/ns#' term='hadoop'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>Visualizing Clusters in Real Time</title><summary type='text'>The end of the semester is nearing, so activity in our distributed system really shoots up as undergraduates finish their semester projects and graduate students hurry to generate those last few research results. You can see this activity reflected in a number of visual displays that we have created to track activity in the system.For example, the following is a snapshot of an applet that </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/3132379546030546811/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2008/12/visualizing-clusters-in-real-time.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/3132379546030546811'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/3132379546030546811'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2008/12/visualizing-clusters-in-real-time.html' title='Visualizing Clusters in Real Time'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_0DkxYiRVGyA/ST6oKaQCUPI/AAAAAAAAABE/2mKNrFOmqkw/s72-c/viz3.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-409373884748252915</id><published>2008-11-30T17:00:00.000-05:00</published><updated>2008-12-02T14:41:43.417-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='visualizing'/><category scheme='http://www.blogger.com/atom/ns#' term='lockdown'/><category scheme='http://www.blogger.com/atom/ns#' term='distributed computing'/><category scheme='http://www.blogger.com/atom/ns#' term='enavis'/><title type='text'>Visualizing a Large Distributed System with Enavis</title><summary type='text'>Two students at Notre Dame, Qi Liao and Andrew Blaich, recently received the Best Paper award at USENIX LISA for their work on Enavis, a tool that gives a visual display of network traffic collected by the Lockdown network administration tool. Enavis gives the administrator of a large network a way to browse all of the users, programs, hosts, and network connections in a system of hundreds or </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/409373884748252915/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2008/11/visualizing-large-distributed-system.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/409373884748252915'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/409373884748252915'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2008/11/visualizing-large-distributed-system.html' title='Visualizing a Large Distributed System with Enavis'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-733698386468438871</id><published>2008-11-13T09:18:00.013-05:00</published><updated>2008-11-17T15:06:42.101-05:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='multicore'/><category scheme='http://www.blogger.com/atom/ns#' term='wavefront'/><category scheme='http://www.blogger.com/atom/ns#' term='distributed computing'/><category scheme='http://www.blogger.com/atom/ns#' term='abstractions'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>The Wavefront Abstraction</title><summary type='text'>This is the third in a series of posts on the idea of abstractions for distributed computing on clusters, clouds, and grids. An abstraction is a simple interface that allows you to scale up well-structured problems to run on hundreds or thousands of computers at once.The Wavefront abstraction came up in a discussion with several economists.  You want to compute a recurrence relation where each </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/733698386468438871/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2008/11/wavefront-abstraction.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/733698386468438871'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/733698386468438871'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2008/11/wavefront-abstraction.html' title='The Wavefront Abstraction'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_0DkxYiRVGyA/SSHCEDk4_JI/AAAAAAAAAA8/1KTujgkWWCA/s72-c/abstr-wavefront.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-5783855652288661211</id><published>2008-10-31T16:58:00.007-04:00</published><updated>2008-10-31T17:47:16.832-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='map-reduce'/><category scheme='http://www.blogger.com/atom/ns#' term='classify'/><category scheme='http://www.blogger.com/atom/ns#' term='data mining'/><category scheme='http://www.blogger.com/atom/ns#' term='abstractions'/><category scheme='http://www.blogger.com/atom/ns#' term='chirp'/><category scheme='http://www.blogger.com/atom/ns#' term='hadoop'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>An Abstraction for Ensemble Classifiers</title><summary type='text'>In the last post, I presented the idea of abstractions for distributed computing, and explained the All-Pairs abstraction, which represents a very large Cartesian product. Of course, a single abstraction is meant to address a very focused kind of workload. If you have a different category of problem, then you need another abstraction.We discovered another abstraction while working with Nitesh </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/5783855652288661211/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2008/10/abstraction-for-ensemble-classifiers.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/5783855652288661211'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/5783855652288661211'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2008/10/abstraction-for-ensemble-classifiers.html' title='An Abstraction for Ensemble Classifiers'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_0DkxYiRVGyA/SQt5XwLjCoI/AAAAAAAAAAs/CqjG2RMpveM/s72-c/abstr-classify.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-318793440950081328</id><published>2008-10-22T12:37:00.013-04:00</published><updated>2008-10-23T12:04:00.792-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='distributed computing'/><category scheme='http://www.blogger.com/atom/ns#' term='assembly language'/><category scheme='http://www.blogger.com/atom/ns#' term='allpairs'/><category scheme='http://www.blogger.com/atom/ns#' term='abstractions'/><category scheme='http://www.blogger.com/atom/ns#' term='condor'/><title type='text'>Abstractions for Distributed Computing</title><summary type='text'>My current research revolves around the idea of abstractions for distributed computing. An abstraction is a way of simplifying a workload that runs on thousands of machines, in much the same way that a high level language simplifies the tiresome process of programming in assembly language. Let me explain a little more.Real assembly language has operations like this:MOV memory to registerPUSH </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/318793440950081328/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2008/10/abstractions-for-distributed-computing.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/318793440950081328'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/318793440950081328'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2008/10/abstractions-for-distributed-computing.html' title='Abstractions for Distributed Computing'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-870541340092539328</id><published>2008-10-06T10:24:00.001-04:00</published><updated>2008-10-06T16:34:23.144-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='troubleshooting'/><category scheme='http://www.blogger.com/atom/ns#' term='grid computing'/><category scheme='http://www.blogger.com/atom/ns#' term='distributed computing'/><category scheme='http://www.blogger.com/atom/ns#' term='data mining'/><title type='text'>Troubleshooting Distributed Systems via Data Mining</title><summary type='text'>One of our students, David Cieslak, just presented this paper on troubleshooting large distributed systems at the IEEE Grid Computing Conference in Japan.  Here's the situation:You have a million jobs to run.You submit them to a system of thousands of CPUs.Half of them complete, and half of them fail.Now what do you do?  It's hopeless to debug any single failure, because you don't know if it </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/870541340092539328/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2008/10/troubleshooting-distributed-systems-via.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/870541340092539328'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/870541340092539328'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2008/10/troubleshooting-distributed-systems-via.html' title='Troubleshooting Distributed Systems via Data Mining'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-4890039722003127341.post-277189465591581182</id><published>2008-10-01T16:44:00.000-04:00</published><updated>2008-10-02T21:20:32.434-04:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='grid computing'/><category scheme='http://www.blogger.com/atom/ns#' term='distributed computing'/><category scheme='http://www.blogger.com/atom/ns#' term='cluster computing'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud computing'/><category scheme='http://www.blogger.com/atom/ns#' term='turtles'/><title type='text'>Clusters, Grids, and Clouds:It's Turtles All the Way Down</title><summary type='text'>In this blog, I'll discuss open problems and new developments in the field of distributed systems.A distributed system is a set of independent computers that accomplish a meaningful task by working together. The field covers a huge range of interesting systems, from sensor networks to multi-tier web server farms. For the most part, I will be writing about the sort of large computing systems used </summary><link rel='replies' type='application/atom+xml' href='http://dthain.blogspot.com/feeds/277189465591581182/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://dthain.blogspot.com/2008/10/clusters-grids-and-clouds-its-turtles.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/277189465591581182'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4890039722003127341/posts/default/277189465591581182'/><link rel='alternate' type='text/html' href='http://dthain.blogspot.com/2008/10/clusters-grids-and-clouds-its-turtles.html' title='Clusters, Grids, and Clouds:&lt;br&gt;It&apos;s Turtles All the Way Down'/><author><name>Douglas Thain</name><uri>http://www.blogger.com/profile/10046446527813216338</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
