OptorSim
OptorSim is a Grid simulator designed to test dynamic replication strategies used in optimising the efficiency of a Grid. A Java Applet version of OptorSim can be seen by clicking on the screenshot below. This demonstration shows what happens to the Grid as jobs are submitted to it and our replication algorithms control where replicas of files are created and deleted.

Screenshot of the OptorSim Java Applet
The GridPP2 (UK Grid for Particle Physics) production plan describes a future Grid infrastructure. Sample jobs based on real high energy physics analysis are simulated. A job consists of a set of files which must be read in sequential order with calculations performed on the data in each file. A single Resource Broker assigns jobs to sites based on a scheduling algorithm which takes into account the cost of accessing data for the job in terms of time and the current workload at each site.
As the job runs at a site our replica optimisation algorithms decide whether or not it is worthwhile to create a copy, or replica, of each file it processes on local storage so that in future the file can be accessed faster. With limited storage space available it makes sense to only copy the most popular files, so this decision can be made by considering the previous history of files accessed at the site. An economic model we have developed for trading files has been shown to be the most effective at optimising Grid resources by reducing the total time to process all the simluated jobs, when compared to more traditional algorithms. Eventually these optimisation algorithms will be incorporated into the Replica Optimisation Service currently being developed by the EU DataGrid which will soon be deployed on the real live Grid.
The Java Applet is a demonstration of OptorSim showing what happens when 500 jobs are submitted to the GridPP testbed when our Economic Model algorithm is in use. All the files (which are all of size 1GB) are intially stored at sites outwith the UK (CERN, FNAL and SLAC) and all the other sites have empty storage between 50GB and 300GB (These numbers are scaled down from the real values which are between 5TB and 100TB). There are 100 files for each experiment and three jobs per experiment, which require between 20 and 50 files each. The experiments GridPP is involved in are: ATLAS, CMS, LHCb (all at CERN), CDF, DZero (both at FNAL) and BaBar (at SLAC). When a site requests a file for a job it automatically creates a local replica until its local storage has filled up, and from then on our algorithm decides if it is worth deleting any files to create replicas of new files and if so which files to delete.
Each section of the applet is explained in detail below:
|
Simulation Parameters This table shows some of the input parameters for OptorSim. When "Auction" is on the Grid uses an auctioning protocol (described in detail in the links below) for finding the best file among the replicas distributed around the Grid. The "Scale Factor" is used to make the simulation run faster by scaling down the size of the files and storage space. Currently it is set so that twelve seconds of simulation time = one second of real time.
Simulation Time
Job Submission
Site Information
Grid Status |
As the simulation begins most of the jobs are scheduled to the sites with the best network connections since none of the sites in the UK have any files. But as the queues build up at these sites, the other sites become more favourable to run the jobs. After some time the heavy traffic to the outside world becomes lighter as replicas of files become distributed throughout the Grid and eventually the whole network becomes less used as the jobs are scheduled to sites which now have the files required stored locally.
More information on OptorSim can be found in:
- The OptorSim website. http://cern.ch/edg-wp2/optimization/optorsim.html
- The most recent OptorSim results. William H. Bell, David G. Cameron, Ruben Carvajal-Schiaffino, A. Paul Millar, Kurt Stockinger, and Floriano Zini. Evaluation of an Economy-Based File Replication Strategy for a Data Grid. In International Workshop on Agent based Cluster and Grid Computing at CCGrid 2003, Tokyo, Japan, May 2003. IEEE Computer Society Press.
- Some previous OptorSim results. William H. Bell, David G. Cameron, Luigi Capozza, A. Paul Millar, Kurt Stockinger, and Floriano Zini. Simulation of Dynamic Grid Replication Strategies in OptorSim. In Proc. of the 3rd Int'l. IEEE Workshop on Grid Computing (Grid'2002), Baltimore, USA, November 2002. Springer Verlag, Lecture Notes in Computer Science.
- More information on the Economic Model. Mark Carman, Floriano Zini, Luciano Serafini, and Kurt Stockinger. Towards an Economy-Based Optimisation of File Access and Replication on a Data Grid. In International Workshop on Agend based Cluster and Grid Computing at International Symposium on Cluster Computing and the Grid (CCGrid'2002)., Berlin, Germany, May 2002. IEEE Computer Society Press.
- A technical report outlining a Replica Optimisation Service. W. H. Bell, D. G. Cameron, L. Capozza, P. Millar, K. Stockinger, and F. Zini. Design of a Replica Optimisation Framework. Technical report, DataGrid-02-TED-021215, Geneva, Switzerland, December 2002.
Any questions? e-mail the OptorSim mailing list: hep-proj-grid-optorsim@listbox.cern.ch
(c) 2003 CERN, ITC-irst, PPARC, on behalf of the EU DataGrid.
Last modified Wed 26 November 2003 . View page history
Switch to HTTPS . Website Help . Print View . Built with GridSite 1.4.3