Yogesh Simmhan
All Publications
[DBLP] [Scholar] [ACM] [MS Academic] [CiteseerX] [CSB] [MSR]

Global QuickSearch:   Number of matching entries: 0

Search Settings

    Key Author / Editor / Organization Title Year Journal / Conference / Book Pub Type Keywords
    varshney:spe:2020 Varshney, P. & Simmhan, Y.
    Characterizing Application Scheduling on Edge, Fog and Cloud Computing Resources
    2020 Software: Practice and Experience
    Vol. 50 (5) , pp. 558-595  
    article iisc, cloud, edge, fog, survey
    BibTeX:
    @article{varshney:spe:2020,
      author = {Prateeksha Varshney and Yogesh Simmhan},
      title = {Characterizing Application Scheduling on Edge, Fog and Cloud Computing Resources},
      journal = {Software: Practice and Experience},
      year = {2020},
      volume = {50},
      number = {5},
      pages = {558--595},
      doi = {https://doi.org/10.1002/spe.2699}
    }
    					
    simmhan:jiisc:2020 Simmhan, Y.; Rambha, T.; Khochare, A.; Ramesh, S.; Baranawal, A.; George, J.V.; Bhope, R.A.; Namtirtha, A.; Sundararajan, A.; Bhargav, S.S.; Thakkar, N. & Kiran, R.
    GoCoronaGo: Privacy Respecting Contact Tracing for COVID-19 Management
    2020 Journal of the Indian Institute of Science   article
    BibTeX:
    @article{simmhan:jiisc:2020,
      author = {Yogesh Simmhan and Tarun Rambha and Aakash Khochare and Shriram Ramesh and Animesh Baranawal and John Varghese George and Rahul Atul Bhope and Amrita Namtirtha and Amritha Sundararajan and Sharath Suresh Bhargav and Nihar Thakkar and Raj Kiran},
      title = {GoCoronaGo: Privacy Respecting Contact Tracing for COVID-19 Management},
      journal = {Journal of the Indian Institute of Science},
      year = {2020},
      note = {To Appear},
      url = {https://arxiv.org/abs/2009.04916}
    }
    					
    ramesh:ccgrid:2020 Ramesh, S.; Baranawal, A. & Simmhan, Y.
    A Distributed Path Query Engine for Temporal Property Graphs
    2020 IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID) , pp. 499-508   inproceedings
    BibTeX:
    @inproceedings{ramesh:ccgrid:2020,
      author = {Shriram Ramesh and Animesh Baranawal and Yogesh Simmhan},
      title = {A Distributed Path Query Engine for Temporal Property Graphs},
      booktitle = {IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID)},
      year = {2020},
      pages = {499--508},
      doi = {https://doi.org/10.1109/CCGrid49817.2020.00-43}
    }
    					
    gandhi:icde:2020 Gandhi, S. & Simmhan, Y.
    An Interval-centric Model for Distributed Computing over Temporal Graphs
    2020 IEEE International Conference on Data Engineering (ICDE) , pp. 1129-1140   inproceedings
    BibTeX:
    @inproceedings{gandhi:icde:2020,
      author = {Swapnil Gandhi and Yogesh Simmhan},
      title = {An Interval-centric Model for Distributed Computing over Temporal Graphs},
      booktitle = {IEEE International Conference on Data Engineering (ICDE)},
      year = {2020},
      pages = {1129--1140},
      doi = {https://doi.org/10.1109/ICDE48307.2020.00102}
    }
    					
    acharya:comsnet:2020 Acharya, S.; Bharadwaj, A.; Simmhan, Y.; Gopalan, A.; Parag, P. & Tyagi, H.
    CORNET: A Co-Simulation Middleware for Robot Networks
    2020 IEEE International Conference on COMmunication Systems & NETworkS (COMSNETS) , pp. 245-251   inproceedings
    BibTeX:
    @inproceedings{acharya:comsnet:2020,
      author = {Srikrishna Acharya and Amrutur Bharadwaj and Yogesh Simmhan and Aditya Gopalan and Parimal Parag and Himanshu Tyagi},
      title = {CORNET: A Co-Simulation Middleware for Robot Networks},
      booktitle = {IEEE International Conference on COMmunication Systems & NETworkS (COMSNETS)},
      year = {2020},
      pages = {245--251},
      doi = {https://doi.org/10.1109/COMSNETS48256.2020.9027459}
    }
    					
    garg:europar:2020 Garg, D.; Shirolkar, P.; Shukla, A. & Simmhan, Y.
    TorqueDB: Distributed Querying of Time-Series Data from Edge-local Storage
    2020
    Vol. 12247 International Conference on Parallel and Distributed Computing (Euro-Par) , pp. 281-295  
    inproceedings
    BibTeX:
    @inproceedings{garg:europar:2020,
      author = {Dhruv Garg and Prathik Shirolkar and Anshu Shukla and Yogesh Simmhan},
      title = {TorqueDB: Distributed Querying of Time-Series Data from Edge-local Storage},
      booktitle = {International Conference on Parallel and Distributed Computing (Euro-Par)},
      publisher = {Springer},
      year = {2020},
      volume = {12247},
      pages = {281--295},
      doi = {https://doi.org/10.1007/978-3-030-57675-2%5C_18}
    }
    					
    simmhan:icfec:2020 Simmhan, Y. & Varghese, B. (Hrsg.)
    Proceedings of the IEEE International Conference on Fog and Edge Computing (ICFEC)
    2020   proceedings
    BibTeX:
    @proceedings{simmhan:icfec:2020,,
      title = {Proceedings of the IEEE International Conference on Fog and Edge Computing (ICFEC)},
      year = {2020},
      doi = {https://doi.org/10.1109/ICFEC50348.2020}
    }
    					
    mueller:isorc:2020 Mueller, F.; Cucinotta, T. & Simmhan, Y. (Hrsg.)
    Proceedings of the IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC)
    2020   proceedings
    BibTeX:
    @proceedings{mueller:isorc:2020,,
      title = {Proceedings of the IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC)},
      year = {2020},
      doi = {https://doi.org/10.1109/ISORC49007.2020}
    }
    					
    buyya:csur:2019 Buyya, R.; Srirama, S.N.; Casale, G.; Calheiros, R.N.; Simmhan, Y.; Varghese, B.; Gelenbe, E.; Javadi, B.; Vaquero, L.M.; Netto, M.A.S.; Toosi, A.N.; Rodriguez, M.A.; Llorente, I.M.; di Vimercati, S.D.C.; Samarati, P.; Milojicic, D.S.; Varela, C.A.; Bahsoon, R.; de Assunção, M.D.; Rana, O.; Zhou, W.; Jin, H.; Gentzsch, W.; Zomaya, A.Y. & Shen, H.
    A Manifesto for Future Generation Cloud Computing: Research Directions for the Next Decade
    2019 ACM Computing Surveys (CSUR)
    Vol. 51 (5) , pp. 105:1-105:38  
    article iisc, cloud
    BibTeX:
    @article{buyya:csur:2019,
      author = {Rajkumar Buyya and Satish Narayana Srirama and Giuliano Casale and Rodrigo N. Calheiros and Yogesh Simmhan and Blesson Varghese and Erol Gelenbe and Bahman Javadi and Luis Miguel Vaquero and Marco A. S. Netto and Adel Nadjaran Toosi and Maria Alejandra Rodriguez and Ignacio Mart\in Llorente and Sabrina De Capitani di Vimercati and Pierangela Samarati and Dejan S. Milojicic and Carlos A. Varela and Rami Bahsoon and Marcos Dias de Assunção and Omer Rana and Wanlei Zhou and Hai Jin and Wolfgang Gentzsch and Albert Y. Zomaya and Haiying Shen},
      title = {A Manifesto for Future Generation Cloud Computing: Research Directions for the Next Decade},
      journal = {ACM Computing Surveys (CSUR)},
      year = {2019},
      volume = {51},
      number = {5},
      pages = {105:1--105:38},
      url = {https://arxiv.org/abs/1711.09123},
      doi = {https://doi.org/10.1145/3241737}
    }
    					
    varshney:tpds:2019 Varshney, P. & Simmhan, Y.
    AutoBoT: Resilient and Cost-effective Scheduling of a Bag of Tasks on Spot VMs
    2019 IEEE Transactions on Parallel and Distributed Systems (TPDS)
    Vol. 30 (7) , pp. 1512-1527  
    article iisc, cloud, scheduling, spot vm
    BibTeX:
    @article{varshney:tpds:2019,
      author = {Prateeksha Varshney and Yogesh Simmhan},
      title = {AutoBoT: Resilient and Cost-effective Scheduling of a Bag of Tasks on Spot VMs},
      journal = {IEEE Transactions on Parallel and Distributed Systems (TPDS)},
      year = {2019},
      volume = {30},
      number = {7},
      pages = {1512--1527},
      doi = {https://doi.org/10.1109/TPDS.2018.2889851}
    }
    					
    simhan:encycl:2019 Simmhan, Y. Sakr, S. & Zomaya, A.Y. (Hrsg.)
    Big Data and Fog Computing ( Encyclopedia of Big Data Technologies )
    2019 Encyclopedia of Big Data Technologies   inbook iisc, big data, fog computing, iot, peer reviewed
    BibTeX:
    @inbook{simhan:encycl:2019,
      author = {Yogesh Simmhan},
      title = {Encyclopedia of Big Data Technologies},
      publisher = {Springer},
      year = {2019},
      url = {http://arxiv.org/abs/1712.09552},
      doi = {https://doi.org/10.1007/978-3-319-63962-8_41-1}
    }
    					
    jaiswal:ipdpsw:2019 Jaiswal, S.D. & Simmhan, Y.
    A Partition-centric Distributed Algorithm for Identifying Euler Circuits in Large Graphs
    2019 IEEE International Workshop on High-Performance Big Data, Deep Learning, and Cloud Computing (HPBDC), Co-located with IEEE International Parallel and Distributed Processing Symposium (IPDPS) , pp. 452-459   inproceedings iisc, graph, subgraph centric, algorithm
    BibTeX:
    @inproceedings{jaiswal:ipdpsw:2019,
      author = {Siddharth D. Jaiswal and Yogesh Simmhan},
      title = {A Partition-centric Distributed Algorithm for Identifying Euler Circuits in Large Graphs},
      booktitle = {IEEE International Workshop on High-Performance Big Data, Deep Learning, and Cloud Computing (HPBDC), Co-located with IEEE International Parallel and Distributed Processing Symposium (IPDPS)},
      year = {2019},
      pages = {452--459},
      url = {https://arxiv.org/abs/1903.06950},
      doi = {https://doi.org/10.1109/IPDPSW.2019.00085}
    }
    					
    khochare:icdcn:2019 Khochare, A. & Simmhan, Y.
    A scalable and composable analytics platform for distributed wide-area tracking
    2019 ACM International Conference on Distributed Computing and Networking (ICDCN) , pp. 506   inproceedings
    BibTeX:
    @inproceedings{khochare:icdcn:2019,
      author = {Aakash Khochare and Yogesh Simmhan},
      title = {A scalable and composable analytics platform for distributed wide-area tracking},
      booktitle = {ACM International Conference on Distributed Computing and Networking (ICDCN)},
      year = {2019},
      pages = {506},
      note = {Extended Abstract},
      doi = {https://doi.org/10.1145/3288599.3299753}
    }
    					
    dindokar:cloud:2019 Dindokar, R. & Simmhan, Y.
    Adaptive Partition Migration for Irregular Graph Algorithms on Elastic Resources
    2019 IEEE International Conference on Cloud Computing (CLOUD) , pp. 281-290   inproceedings iisc, graph, cloud, goffish
    BibTeX:
    @inproceedings{dindokar:cloud:2019,
      author = {Ravikant Dindokar and Yogesh Simmhan},
      title = {Adaptive Partition Migration for Irregular Graph Algorithms on Elastic Resources},
      booktitle = {IEEE International Conference on Cloud Computing (CLOUD)},
      year = {2019},
      pages = {281--290},
      note = {[CORE B]},
      doi = {https://doi.org/10.1109/CLOUD.2019.00-28}
    }
    					
    chaudhary:hipcw:2019 Chaudhary, D.; Kahali, B. & Simmhan, Y.
    An Empirical Study on Efficient Storage of Human Genome Data
    2019 Women in Data Science and Computing Workshop, Co-located with IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC) , pp. 87-92   inproceedings
    BibTeX:
    @inproceedings{chaudhary:hipcw:2019,
      author = {Diksha Chaudhary and Bratati Kahali and Yogesh Simmhan},
      title = {An Empirical Study on Efficient Storage of Human Genome Data},
      booktitle = {Women in Data Science and Computing Workshop, Co-located with IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC)},
      year = {2019},
      pages = {87--92},
      doi = {https://doi.org/10.1109/HiPCW.2019.00030}
    }
    					
    khochare:ccgrid:2019 Khochare, A.; Ramachandra, S.; Ramesh, S. & Simmhan, Y.
    Dynamic Scaling of Video Analytics for Wide-area Tracking in Urban Spaces
    2019 IEEE International Scalable Computing Challenge (SCALE), Co-located with IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) , pp. 76-81   inproceedings iisc, edge, video analytics
    BibTeX:
    @inproceedings{khochare:ccgrid:2019,
      author = {Aakash Khochare and Sheshadri Ramachandra and Shriram Ramesh and Yogesh Simmhan},
      title = {Dynamic Scaling of Video Analytics for Wide-area Tracking in Urban Spaces},
      booktitle = {IEEE International Scalable Computing Challenge (SCALE), Co-located with IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)},
      year = {2019},
      pages = {76--81},
      note = {SCALE Challenge Winner},
      doi = {https://doi.org/10.1109/CCGRID.2019.00018}
    }
    					
    monga:icws:2019 Monga, S.K.; Sheshadri K, R. & Simmhan, Y.
    ElfStore: A Resilient Data Storage Service for Federated Edge and Fog Resources
    2019 IEEE International Conference on Web Services (ICWS) , pp. 336-345   inproceedings iisc, edge, fog, storage, reliability
    BibTeX:
    @inproceedings{monga:icws:2019,
      author = {Sumit Kumar Monga and Sheshadri K R and Yogesh Simmhan},
      title = {ElfStore: A Resilient Data Storage Service for Federated Edge and Fog Resources},
      booktitle = {IEEE International Conference on Web Services (ICWS)},
      year = {2019},
      pages = {336--345},
      note = {[CORE A]},
      doi = {https://doi.org/10.1109/ICWS.2019.00062}
    }
    					
    alva:ccwi:2019 Alva, P.; Sheetal Kumar, K.R.; Simmhan, Y. & Mohan Kumar, M.S.
    Enabling Equitable Water Supply in a Mega-city using a Big Data Analytics Platform
    2019 International Conference on Computing and Control for Water Industry (CCWI) , pp. 1-2   inproceedings
    BibTeX:
    @inproceedings{alva:ccwi:2019,
      author = {Prithvi Alva and Sheetal Kumar K.R. and Yogesh Simmhan and Mohan Kumar M.S.},
      title = {Enabling Equitable Water Supply in a Mega-city using a Big Data Analytics Platform},
      booktitle = {International Conference on Computing and Control for Water Industry (CCWI)},
      year = {2019},
      pages = {1--2},
      note = {Extended Abstract}
    }
    					
    simmhan:escience:2019 Simmhan, Y.; Hegde, M.; Zele, R.; Tripathi, S.N.; Nair, S.; Monga, S.K.; Sahu, R.; Dixit, K.; Sutaria, R.; Mishra, B.; Sharma, A. & Anand, S.V.R.
    SATVAM: Toward an IoT Cyber-Infrastructure for Low-Cost Urban Air Quality Monitoring
    2019 IEEE International Conference on eScience (eScience) , pp. 57-66   inproceedings
    BibTeX:
    @inproceedings{simmhan:escience:2019,
      author = {Yogesh Simmhan and Malati Hegde and Rajesh Zele and Sachchida N. Tripathi and Srijith Nair and Sumit K. Monga and Ravi Sahu and Kuldeep Dixit and Ronak Sutaria and Brijesh Mishra and Anamika Sharma and Anand SVR},
      title = {SATVAM: Toward an IoT Cyber-Infrastructure for Low-Cost Urban Air Quality Monitoring},
      booktitle = {IEEE International Conference on eScience (eScience)},
      year = {2019},
      pages = {57--66},
      doi = {https://doi.org/10.1109/eScience.2019.00014}
    }
    					
    chaturvedi:isorc:2019 Chaturvedi, S. & Simmhan, Y.
    Toward Resilient Stream Processing on Clouds using Moving Target Defense
    2019 IEEE International Symposium on Real-Time Distributed Computing (ISORC) , pp. 134-142   inproceedings
    BibTeX:
    @inproceedings{chaturvedi:isorc:2019,
      author = {Shilpa Chaturvedi and Yogesh Simmhan},
      title = {Toward Resilient Stream Processing on Clouds using Moving Target Defense},
      booktitle = {IEEE International Symposium on Real-Time Distributed Computing (ISORC)},
      year = {2019},
      pages = {134--142},
      doi = {https://doi.org/10.1109/ISORC.2019.00035}
    }
    					
    shen:icfec:2019 Shen, H. & Simmhan, Y. (Hrsg.)
    Proceedings of the IEEE International Conference on Fog and Edge Computing (ICFEC)
    2019   proceedings
    BibTeX:
    @proceedings{shen:icfec:2019,,
      title = {Proceedings of the IEEE International Conference on Fog and Edge Computing (ICFEC)},
      year = {2019},
      url = {https://ieeexplore.ieee.org/xpl/conhome/8730889/proceeding}
    }
    					
    ghosh:tcps:2018 Ghosh, R. & Simmhan, Y.
    Distributed Scheduling of Event Analytics across Edge and Cloud
    2018 ACM Transactions on Cyber-Physical Systems (TCPS)
    Vol. 2 (4) , pp. 24:1-24:28  
    article iisc, peer reviewed, stream processing, edge computing, iot
    BibTeX:
    @article{ghosh:tcps:2018,
      author = {Rajrup Ghosh and Yogesh Simmhan},
      title = {Distributed Scheduling of Event Analytics across Edge and Cloud},
      journal = {ACM Transactions on Cyber-Physical Systems (TCPS)},
      year = {2018},
      volume = {2},
      number = {4},
      pages = {24:1--24:28},
      url = {https://arxiv.org/abs/1608.01537},
      doi = {https://doi.org/10.1145/3140256}
    }
    					
    shukla:jpdc:2018 Shukla, A. & Simmhan, Y.
    Model-driven Scheduling for Distributed Stream Processing Systems
    2018 Journal of Parallel and Distributed Computing (JPDC)
    Vol. 117 , pp. 98-114  
    article peer reviewed, iisc, stream processing
    BibTeX:
    @article{shukla:jpdc:2018,
      author = {Anshu Shukla and Yogesh Simmhan},
      title = {Model-driven Scheduling for Distributed Stream Processing Systems},
      journal = {Journal of Parallel and Distributed Computing (JPDC)},
      year = {2018},
      volume = {117},
      pages = {98--114},
      url = {https://arxiv.org/abs/1702.01785},
      doi = {https://doi.org/10.1016/j.jpdc.2018.02.003}
    }
    					
    heidari:csur:2018 Heidari, S.; Simmhan, Y.; Calheiros, R.N. & Buyya, R.
    Scalable Graph Processing Frameworks: A Taxonomy and Open Challenges
    2018 ACM Computing Surveys (CSUR)
    Vol. 51 (3) , pp. 1-53  
    article peer reviewed, iisc, graph processing
    BibTeX:
    @article{heidari:csur:2018,
      author = {Safiollah Heidari and Yogesh Simmhan and Rodrigo N. Calheiros and Rajkumar Buyya},
      title = {Scalable Graph Processing Frameworks: A Taxonomy and Open Challenges},
      journal = {ACM Computing Surveys (CSUR)},
      year = {2018},
      volume = {51},
      number = {3},
      pages = {1--53},
      url = {https://dl.acm.org/citation.cfm?id=3199523},
      doi = {https://doi.org/10.1145/3199523}
    }
    					
    simmhan:spe:2018 Simmhan, Y.; Ravindra, P.; Chaturvedi, S.; Hegde, M. & Ballamajalu, R.
    Towards a Data-driven IoT Software Architecture for Smart City Utilities
    2018 Software: Practice and Experience
    Vol. 48 (7) , pp. 1390-1416  
    article peer reviewed, iisc, smart city, iot
    BibTeX:
    @article{simmhan:spe:2018,
      author = {Yogesh Simmhan and Pushkara Ravindra and Shilpa Chaturvedi and Malati Hegde and Rashmi Ballamajalu},
      title = {Towards a Data-driven IoT Software Architecture for Smart City Utilities},
      journal = {Software: Practice and Experience},
      year = {2018},
      volume = {48},
      number = {7},
      pages = {1390--1416},
      url = {http://arxiv.org/abs/1803.02500},
      doi = {https://doi.org/10.1002/spe.2580}
    }
    					
    ghosh:ccgrid:2018 Ghosh, R.; Reddy, S.P. & Simmhan, Y.
    Adaptive Energy-aware Scheduling of Dynamic Event Analytics across Edge and Cloud Resources
    2018 IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) , pp. 1-11   inproceedings
    BibTeX:
    @inproceedings{ghosh:ccgrid:2018,
      author = {Rajrup Ghosh and Siva Prakash Reddy and Yogesh Simmhan},
      title = {Adaptive Energy-aware Scheduling of Dynamic Event Analytics across Edge and Cloud Resources},
      booktitle = {IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)},
      year = {2018},
      pages = {1--11},
      note = {[CORE A]},
      url = {https://arxiv.org/abs/1801.01087}
    }
    					
    shukla:icdcs:2018 Shukla, A. & Simmhan, Y.
    Toward Reliable and Rapid Elasticity for Streaming Dataflows on Clouds
    2018 IEEE International Conference on Distributed Computing Systems (ICDCS) , pp. 1-11   inproceedings peer reviewed, iisc, stream processing
    BibTeX:
    @inproceedings{shukla:icdcs:2018,
      author = {Anshu Shukla and Yogesh Simmhan},
      title = {Toward Reliable and Rapid Elasticity for Streaming Dataflows on Clouds},
      booktitle = {IEEE International Conference on Distributed Computing Systems (ICDCS)},
      year = {2018},
      pages = {1--11},
      note = {[CORE A]},
      url = {https://arxiv.org/abs/1712.00605}
    }
    					
    badiger:europar:2018 Badiger, S.; Baheti, S. & Simmhan, Y.
    VIoLET: A Large-scale Virtual Environment for Internet of Things
    2018 International European Conference on Parallel and Distributed Computing (EuroPar) , pp. 1-16   inproceedings iisc, peer reviewed, iot
    BibTeX:
    @inproceedings{badiger:europar:2018,
      author = {Shreyas Badiger and Shrey Baheti and Yogesh Simmhan},
      title = {VIoLET: A Large-scale Virtual Environment for Internet of Things},
      booktitle = {International European Conference on Parallel and Distributed Computing (EuroPar)},
      year = {2018},
      pages = {1--16},
      note = {[CORE A]},
      url = {https://github.com/dream-lab/VIoLET}
    }
    					
    jha:ccpe:2017 Jha, S.; Luckow, D.S.K.A.; Rana, O. & amd Neil Chue Hong, Y.S.
    Introducing Distributed Dynamic Data-intensive (D3) Science: Understanding Applications and Infrastructure
    2017 Concurrency and Computation: Practice and Experience
    Vol. 29 (8)  
    article peer reviewed, iisc, escience, big data
    BibTeX:
    @article{jha:ccpe:2017,
      author = {Shantenu Jha and Daniel S. Katz Andre Luckow and Omer Rana and Yogesh Simmhan amd Neil Chue Hong},
      title = {Introducing Distributed Dynamic Data-intensive (D3) Science: Understanding Applications and Infrastructure},
      journal = {Concurrency and Computation: Practice and Experience},
      year = {2017},
      volume = {29},
      number = {8},
      url = {https://github.com/radical-project/3DPAS},
      doi = {https://doi.org/10.1002/cpe.4032}
    }
    					
    simmhan:iotn:2017 Simmhan, Y.
    IoT Analytics Across Edge and Cloud Platforms
    2017 IEEE Internet of Things Newsletter   article iisc, edge computing, iot
    BibTeX:
    @article{simmhan:iotn:2017,
      author = {Yogesh Simmhan},
      title = {IoT Analytics Across Edge and Cloud Platforms},
      journal = {IEEE Internet of Things Newsletter},
      year = {2017},
      url = {http://iot.ieee.org/newsletter/may-2017/iot-analytics-across-edge-and-cloud-platforms}
    }
    					
    zhou:fgcs:2017 Zhou, Q.; Simmhan, Y. & Prasanna, V.
    Knowledge-infused and Consistent Complex Event Processing over Real-time and Persistent Streams
    2017 Future Generation Computer Systems
    Vol. 76 , pp. 391-406  
    article peer reviewed, cep, stream processing, semantics, iisc
    BibTeX:
    @article{zhou:fgcs:2017,
      author = {Qunzhi Zhou and Yogesh Simmhan and Viktor Prasanna},
      title = {Knowledge-infused and Consistent Complex Event Processing over Real-time and Persistent Streams},
      journal = {Future Generation Computer Systems},
      year = {2017},
      volume = {76},
      pages = {391--406},
      doi = {https://doi.org/10.1016/j.future.2016.10.030}
    }
    					
    shukla:ccpe:2017 Shukla, A.; Chaturvedi, S. & Simmhan, Y.
    RIoTBench: An IoT Benchmark for Distributed Stream Processing Systems
    2017 Concurrency and Computation: Practice and Experience
    Vol. 29 (21) , pp. 1-22  
    article iisc, iot, stream processing, benchmark, peer reviewed
    BibTeX:
    @article{shukla:ccpe:2017,
      author = {Anshu Shukla and Shilpa Chaturvedi and Yogesh Simmhan},
      title = {RIoTBench: An IoT Benchmark for Distributed Stream Processing Systems},
      journal = {Concurrency and Computation: Practice and Experience},
      year = {2017},
      volume = {29},
      number = {21},
      pages = {1--22},
      url = {https://arxiv.org/abs/1701.08530},
      doi = {https://doi.org/10.1002/cpe.4257}
    }
    					
    kalyanasundaram:hipc:2017 Kalyanasundaram, J. & Simmhan, Y.
    ARM Wrestling with Big Data: A Study of Commodity ARM64 Server for Big Data Workloads
    2017 IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC) , pp. 1-10   inproceedings iisc, peer reviewed, big data, low power
    BibTeX:
    @inproceedings{kalyanasundaram:hipc:2017,
      author = {Jayanth Kalyanasundaram and Yogesh Simmhan},
      title = {ARM Wrestling with Big Data: A Study of Commodity ARM64 Server for Big Data Workloads},
      booktitle = {IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC)},
      year = {2017},
      pages = {1--10},
      note = {Best paper finalist, [CORE B]},
      url = {https://arxiv.org/abs/1701.05996},
      doi = {https://doi.org/10.1109/HiPC.2017.00032}
    }
    					
    dindokar:hipcw:2017 Dindokar, R. & Simmhan, Y.
    Characterization of Vertex-centric Breadth First Search for Lattice Graphs
    2017 IEEE International Workshop on Foundations in Big Data Computing (BigDF), Co-located with HiPC , pp. 1-8   inproceedings iisc, peer reviewed, graph processing
    BibTeX:
    @inproceedings{dindokar:hipcw:2017,
      author = {Ravikant Dindokar and Yogesh Simmhan},
      title = {Characterization of Vertex-centric Breadth First Search for Lattice Graphs},
      booktitle = {IEEE International Workshop on Foundations in Big Data Computing (BigDF), Co-located with HiPC},
      year = {2017},
      pages = {1--8},
      doi = {https://doi.org/10.1109/HiPCW.2017.00014}
    }
    					
    chaturvedi:escience:2017 Chaturvedi, S.; Tyagi, S. & Simmhan, Y.
    Collaborative Reuse of Streaming Dataflows in IoT Applications
    2017 IEEE International Conference on eScience (eScience) , pp. 1-10   inproceedings iisc, peer reviewed, iot, stream processing
    BibTeX:
    @inproceedings{chaturvedi:escience:2017,
      author = {Shilpa Chaturvedi and Sahil Tyagi and Yogesh Simmhan},
      title = {Collaborative Reuse of Streaming Dataflows in IoT Applications},
      booktitle = {IEEE International Conference on eScience (eScience)},
      year = {2017},
      pages = {1--10},
      note = {[CORE A]},
      url = {https://arxiv.org/abs/1709.03332},
      doi = {https://doi.org/10.1109/eScience.2017.54}
    }
    					
    varshney:icfec:2017 Varshney, P. & Simmhan, Y.
    Demystifying Fog Computing: Characterizing Architectures, Applications and Abstractions
    2017 IEEE International Conference on Fog and Edge Computing (ICFEC) , pp. 1-10   inproceedings peer reviewed, iisc, cloud, iot, fog, edge
    BibTeX:
    @inproceedings{varshney:icfec:2017,
      author = {Prateeksha Varshney and Yogesh Simmhan},
      title = {Demystifying Fog Computing: Characterizing Architectures, Applications and Abstractions},
      booktitle = {IEEE International Conference on Fog and Edge Computing (ICFEC)},
      year = {2017},
      pages = {1--10},
      url = {https://arxiv.org/abs/1702.06331},
      doi = {https://doi.org/10.1109/ICFEC.2017.20}
    }
    					
    khochare:iscocw:2017 Khochare, A.; Ravindra, P.; Reddy, S.P. & Simmhan, Y.
    Distributed Video Analytics across Edge and Cloud using ECHO
    2017 International Conference on Service-Oriented Computing (ICSOC) Demo , pp. 1-6   inproceedings iisc, peer reviewed, iot, edge computing
    BibTeX:
    @inproceedings{khochare:iscocw:2017,
      author = {Aakash Khochare and Pushkara Ravindra and Siva Prakash Reddy and Yogesh Simmhan},
      title = {Distributed Video Analytics across Edge and Cloud using ECHO},
      booktitle = {International Conference on Service-Oriented Computing (ICSOC) Demo},
      year = {2017},
      pages = {1--6},
      url = {http://www.icsoc.spilab.es/wp-content/uploads/2017/10/Distributed-Video-Analytics-across-Edge-and-Cloud-using-ECHO.pdf}
    }
    					
    ravindra:iscoc:2017 Ravindra, P.; Khochare, A.; Reddy, S.P.; Sharma, S.; Varshney, P. & Simmhan, Y.
    ECHO: An Adaptive Orchestration Platform for Hybrid Dataflows across Cloud and Edge
    2017 International Conference on Service-Oriented Computing (ICSOC) , pp. 1-16   inproceedings iisc, peer reviewed, iot, edge computing
    BibTeX:
    @inproceedings{ravindra:iscoc:2017,
      author = {Pushkara Ravindra and Aakash Khochare and Siva Prakash Reddy and Sarthak Sharma and Prateeksha Varshney and Yogesh Simmhan},
      title = {ECHO: An Adaptive Orchestration Platform for Hybrid Dataflows across Cloud and Edge},
      booktitle = {International Conference on Service-Oriented Computing (ICSOC)},
      year = {2017},
      pages = {1--16},
      note = {[CORE A]},
      url = {https://arxiv.org/abs/1707.00889},
      doi = {https://doi.org/10.1007/978-3-319-69035-3_28}
    }
    					
    simmhan:ccpe:2016 Simmhan, Y.; Ramakrishnan, L.; Antoniu, G. & Goble, C.
    Editorial: Cloud computing for data-driven science and engineering
    2016 Concurrency and Computation: Practice and Experience   article iisc, editorial
    BibTeX:
    @article{simmhan:ccpe:2016,
      author = {Yogesh Simmhan and Lavanya Ramakrishnan and Gabriel Antoniu and Carole Goble},
      title = {Editorial: Cloud computing for data-driven science and engineering},
      journal = {Concurrency and Computation: Practice and Experience},
      year = {2016},
      url = {http://onlinelibrary.wiley.com/doi/10.1002/cpe.3668/full},
      doi = {https://doi.org/10.1002/cpe.3668}
    }
    					
    simmhan:bidatabook:2016 Simmhan, Y. & Perera, S. Pyne, S.; Rao, B.L.S.P. & Rao, S.B. (Hrsg.)
    Big Data Analytics Platforms for Real-Time Applications in IoT ( Big Data Analytics: Methods and Applications )
    2016 Big Data Analytics: Methods and Applications , pp. 115-135   inbook iisc, big data, peer reviewed
    BibTeX:
    @inbook{simmhan:bidatabook:2016,
      author = {Yogesh Simmhan and Srinath Perera},
      title = {Big Data Analytics: Methods and Applications},
      publisher = {Springer India},
      year = {2016},
      pages = {115--135},
      doi = {https://doi.org/10.1007/978-81-322-3628-3_7}
    }
    					
    dindokar:bigdata:2016 Dindokar, R.; Choudhury, N. & Simmhan, Y.
    A Meta-graph Approach to Analyze Subgraph-centric Distributed Programming Models
    2016 IEEE International Conference on Big Data (Big Data) , pp. 37-47   inproceedings graph, goffish, meta-graph, analysis, iisc, peer reviewed
    BibTeX:
    @inproceedings{dindokar:bigdata:2016,
      author = {Ravikant Dindokar and Neel Choudhury and Yogesh Simmhan},
      title = {A Meta-graph Approach to Analyze Subgraph-centric Distributed Programming Models},
      booktitle = {IEEE International Conference on Big Data (Big Data)},
      year = {2016},
      pages = {37--47},
      url = {http://ieeexplore.ieee.org/document/7840587/},
      doi = {https://doi.org/10.1109/BigData.2016.7840587}
    }
    					
    shukla:tpctc:2016 Shukla, A. & Simmhan, Y.
    Benchmarking Distributed Stream Processing Platforms for IoT Applications
    2016
    Vol. 10080 TPC Technology Conference on Performance Evaluation & Benchmarking (TPCTC) , pp. 90-106  
    inproceedings iot, peer reviewed, iisc, stream, benchmark
    BibTeX:
    @inproceedings{shukla:tpctc:2016,
      author = {Anshu Shukla and Yogesh Simmhan},
      title = {Benchmarking Distributed Stream Processing Platforms for IoT Applications},
      booktitle = {TPC Technology Conference on Performance Evaluation & Benchmarking (TPCTC)},
      year = {2016},
      volume = {10080},
      pages = {90--106},
      url = {https://arxiv.org/abs/1606.07621},
      doi = {https://doi.org/10.1007/978-3-319-54334-5_7}
    }
    					
    dindokar:ccgrid:2016 Dindokar, R. & Simmhan, Y.
    Elastic Partition Placement for Non-stationary Graph Algorithms
    2016 IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid) , pp. 90-93   inproceedings goffish, peer reviewed, iisc, graph, cloud
    BibTeX:
    @inproceedings{dindokar:ccgrid:2016,
      author = {Ravikant Dindokar and Yogesh Simmhan},
      title = {Elastic Partition Placement for Non-stationary Graph Algorithms},
      booktitle = {IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid)},
      year = {2016},
      pages = {90--93},
      note = {Short Paper, [CORE A]},
      url = {http://ieeexplore.ieee.org/document/7515673/},
      doi = {https://doi.org/10.1109/CCGrid.2016.97}
    }
    					
    jamadagni:ccgrid:2016 Jamadagni, N. & Simmhan, Y.
    GoDB: From Batch Processing to Distributed Querying over Property Graphs
    2016 IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid) , pp. 281-290   inproceedings godb, goffish, peer reviewed, iisc, graph
    BibTeX:
    @inproceedings{jamadagni:ccgrid:2016,
      author = {Nitin Jamadagni and Yogesh Simmhan},
      title = {GoDB: From Batch Processing to Distributed Querying over Property Graphs},
      booktitle = {IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid)},
      year = {2016},
      pages = {281--290},
      note = {[CORE A]},
      url = {http://ieeexplore.ieee.org/document/7515700/},
      doi = {https://doi.org/10.1109/CCGrid.2016.105}
    }
    					
    aluru:jpdc:2015 Aluru, S. & Simmhan, Y.
    Editorial: Scalable Systems for Big Data Management and Analytics
    2015 Journal of Parallel and Distributed Systems (JPDC)   article editorial, iisc, big data
    BibTeX:
    @article{aluru:jpdc:2015,
      author = {Srinivas Aluru and Yogesh Simmhan},
      title = {Editorial: Scalable Systems for Big Data Management and Analytics},
      journal = {Journal of Parallel and Distributed Systems (JPDC)},
      year = {2015},
      note = {To Appear}
    }
    					
    Aman:tkde:2015 Aman, S.; Simmhan, Y. & Prasanna, V.
    Holistic Measures for Evaluating Prediction Models in Smart Grids
    2015 IEEE Transactions on Knowledge and Data Engineering (TKDE)
    Vol. 27 (2) , pp. 475-488  
    article usc, machine learning, smart grid, peer reviewed, iisc
    BibTeX:
    @article{Aman:tkde:2015,
      author = {Saima Aman and Yogesh Simmhan and Viktor Prasanna},
      title = {Holistic Measures for Evaluating Prediction Models in Smart Grids},
      journal = {IEEE Transactions on Knowledge and Data Engineering (TKDE)},
      year = {2015},
      volume = {27},
      number = {2},
      pages = {475--488},
      note = {[IF 2.476, CORE A]},
      doi = {https://doi.org/10.1109/TKDE.2014.2327022}
    }
    					
    kumbhare:tcc:2015 Kumbhare, A.G.; Simmhan, Y.; Frincu, M. & Prasanna, V.K.
    Reactive Resource Provisioning Heuristics for Dynamic Dataflows on Cloud Infrastructure
    2015 IEEE Transactions on Cloud Computing (TCC)
    Vol. 3 (2) , pp. 105-118  
    article peer reviewed, iisc, stream processing, cloud
    BibTeX:
    @article{kumbhare:tcc:2015,
      author = {Alok Gautam Kumbhare and Yogesh Simmhan and Marc Frincu and Viktor K. Prasanna},
      title = {Reactive Resource Provisioning Heuristics for Dynamic Dataflows on Cloud Infrastructure},
      journal = {IEEE Transactions on Cloud Computing (TCC)},
      year = {2015},
      volume = {3},
      number = {2},
      pages = {105--118},
      doi = {https://doi.org/10.1109/TCC.2015.2394316}
    }
    					
    mishra:iotn:2015 Misra, P.; Simmhan, Y. & Warrior, J.
    Towards a Practical Architecture for Internet of Things: An India-centric View
    2015 IEEE Internet of Things Newsletter , pp. 1-2   article iot, iisc
    BibTeX:
    @article{mishra:iotn:2015,
      author = {Prasant Misra and Yogesh Simmhan and Jay Warrior},
      title = {Towards a Practical Architecture for Internet of Things: An India-centric View},
      journal = {IEEE Internet of Things Newsletter},
      year = {2015},
      pages = {1-2},
      url = {http://iot.ieee.org/newsletter/january-2015/towards-a-practical-architecture-for-internet-of-things-an-india-centric-view.html}
    }
    					
    dindokar:parlearning:2015 Dindokar, R.; Choudhury, N. & Simmhan, Y.
    Analysis of Subgraph-centric Distributed Shortest Path Algorithm
    2015 IEEE International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics (ParLearning), Co-located with IPDPS , pp. 1185-1190   inproceedings peer reviewed, iisc, graph processing
    BibTeX:
    @inproceedings{dindokar:parlearning:2015,
      author = {Ravikant Dindokar and Neel Choudhury and Yogesh Simmhan},
      title = {Analysis of Subgraph-centric Distributed Shortest Path Algorithm},
      booktitle = {IEEE International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics (ParLearning), Co-located with IPDPS},
      year = {2015},
      pages = {1185--1190},
      note = {Short paper},
      url = {http://ieeexplore.ieee.org/document/7284445/},
      doi = {https://doi.org/10.1109/IPDPSW.2015.87}
    }
    					
    simmhan:wbdb:2015 Simmhan, Y.; Shukla, A. & Verma, A.
    Benchmarking Fast Data Platforms for the Aadhaar Biometric Database
    2015
    Vol. 10044 Workshop on Big Data Benchmarking (WBDB) , pp. 21-39  
    inproceedings iisc, stream processing, uidai, benchmark, peer reviewed
    BibTeX:
    @inproceedings{simmhan:wbdb:2015,
      author = {Yogesh Simmhan and Anshu Shukla and Arun Verma},
      title = {Benchmarking Fast Data Platforms for the Aadhaar Biometric Database},
      booktitle = {Workshop on Big Data Benchmarking (WBDB)},
      year = {2015},
      volume = {10044},
      pages = {21--39},
      url = {http://arxiv.org/abs/1510.04160},
      doi = {https://doi.org/10.1007/978-3-319-49748-8_2}
    }
    					
    shukla:hipcw:2015 Shukla, A.; Sharma, T. & Simmhan, Y.
    Characterizing Distributed Stream Processing Systems for IoT Applications
    2015 Workshop on Architectural Support and Middleware for InfoSymbiotics/ Dynamic Data Driven Applications Systems (DDDAS), co-located with High Performance Computing Conference (HiPC) , pp. 61   inproceedings iisc, iot, stream processing, peer reviewed
    BibTeX:
    @inproceedings{shukla:hipcw:2015,
      author = {Anshu Shukla and Tarun Sharma and Yogesh Simmhan},
      title = {Characterizing Distributed Stream Processing Systems for IoT Applications},
      booktitle = {Workshop on Architectural Support and Middleware for InfoSymbiotics/ Dynamic Data Driven Applications Systems (DDDAS), co-located with High Performance Computing Conference (HiPC)},
      year = {2015},
      pages = {61},
      note = {Extended abstract},
      doi = {https://doi.org/10.1109/HiPCW.2015.22}
    }
    					
    simmhan:ipdps:2015 Simmhan, Y.; Choudhury, N.; Wickramaarachchi, C.; Kumbhare, A.; Frincu, M.; Raghavendra, C. & Prasanna, V.
    Distributed Programming over Time-series Graphs
    2015 IEEE International Parallel & Distributed Processing Symposium (IPDPS) , pp. 809-818   inproceedings graph processing, timeseries, goffish, iisc, usc, peer reviewed
    BibTeX:
    @inproceedings{simmhan:ipdps:2015,
      author = {Yogesh Simmhan and Neel Choudhury and Charith Wickramaarachchi and Alok Kumbhare and Marc Frincu and Cauligi Raghavendra and Viktor Prasanna},
      title = {Distributed Programming over Time-series Graphs},
      booktitle = {IEEE International Parallel & Distributed Processing Symposium (IPDPS)},
      year = {2015},
      pages = {809--818},
      note = {[CORE A]},
      url = {http://ieeexplore.ieee.org/document/7161567/},
      doi = {https://doi.org/10.1109/IPDPS.2015.66}
    }
    					
    kumbhare:icdcs:2015 Kumbhare, A.; Frincu, M.; Simmhan, Y. & Prasanna, V.K.
    Fault-Tolerant and Elastic Streaming MapReduce with Decentralized Coordination
    2015 IEEE International Conference on Distributed Computing Systems (ICDCS) , pp. 328-338   inproceedings iisc, peer reviewed, mapreduce, stream processing
    BibTeX:
    @inproceedings{kumbhare:icdcs:2015,
      author = {Alok Kumbhare and Marc Frincu and Yogesh Simmhan and Viktor K. Prasanna},
      title = {Fault-Tolerant and Elastic Streaming MapReduce with Decentralized Coordination},
      booktitle = {IEEE International Conference on Distributed Computing Systems (ICDCS)},
      year = {2015},
      pages = {328--338},
      note = {[Core A]},
      url = {http://ieeexplore.ieee.org/document/7164919/},
      doi = {https://doi.org/10.1109/ICDCS.2015.41}
    }
    					
    aman:sgcomm:2015 Aman, S.; Frincu, M.; Chelmis, C.; Noor, M.; Simmhan, Y. & Prasanna, V.K.
    Prediction Models for Dynamic Demand Response: Requirements, Challenges, and Insights
    2015 IEEE International Conference on Smart Grid Communications (SmartGridComm) , pp. 1-6   inproceedings iisc, peer reviewed, smart grid, iot
    BibTeX:
    @inproceedings{aman:sgcomm:2015,
      author = {Saima Aman and Marc Frincu and Charalampos Chelmis and Muhammad Noor and Yogesh Simmhan and Viktor K. Prasanna},
      title = {Prediction Models for Dynamic Demand Response: Requirements, Challenges, and Insights},
      booktitle = {IEEE International Conference on Smart Grid Communications (SmartGridComm)},
      year = {2015},
      pages = {1--6},
      url = {http://ieeexplore.ieee.org/document/7436323/},
      doi = {https://doi.org/10.1109/SmartGridComm.2015.7436323}
    }
    					
    kushwaha:ccem:2014 Kushwaha, V. & Simmhan, Y.
    An Analysis of Spot-Priced Clouds for Practical Job Scheduling
    2014 IEEE Cloud Computing for Emerging Markets (CCEM) , pp. 1-8   inproceedings iisc, cloud, spot, peer reviewed
    BibTeX:
    @inproceedings{kushwaha:ccem:2014,
      author = {Vedsar Kushwaha and Yogesh Simmhan},
      title = {An Analysis of Spot-Priced Clouds for Practical Job Scheduling},
      booktitle = {IEEE Cloud Computing for Emerging Markets (CCEM)},
      year = {2014},
      pages = {1--8},
      doi = {https://doi.org/10.1109/CCEM.2014.7015488}
    }
    					
    chu:ipdps:2014 Chu, H.-Y. & Simmhan, Y.
    Cost-efficient and Resilient Job Life-cycle Management on Hybrid Clouds
    2014 IEEE International Parallel & Distributed Processing Symposium (IPDPS) , pp. 327-336   inproceedings usc, cloud, peer reviewed, iisc
    BibTeX:
    @inproceedings{chu:ipdps:2014,
      author = {Hsuan-Yi Chu and Yogesh Simmhan},
      title = {Cost-efficient and Resilient Job Life-cycle Management on Hybrid Clouds},
      booktitle = {IEEE International Parallel & Distributed Processing Symposium (IPDPS)},
      year = {2014},
      pages = {327--336},
      note = {[CORE A]},
      url = {http://ieeexplore.ieee.org/document/6877267/},
      doi = {https://doi.org/10.1109/IPDPS.2014.43}
    }
    					
    govindarajan:comad:2014 Govindarajan, N.; Simmhan, Y.; Jamadagni, N. & Misra, P.
    Event Processing across Edge and the Cloud for Internet of Things Applications
    2014 International Conference on Management of Data (COMAD) , pp. 101-104   inproceedings iisc, event processing, cep, iot, peer reviewed, poster
    BibTeX:
    @inproceedings{govindarajan:comad:2014,
      author = {Nithyashri Govindarajan and Yogesh Simmhan and Nitin Jamadagni and Prasant Misra},
      title = {Event Processing across Edge and the Cloud for Internet of Things Applications},
      booktitle = {International Conference on Management of Data (COMAD)},
      year = {2014},
      pages = {101--104},
      note = {Short paper, [CORE B]},
      url = {http://dl.acm.org/citation.cfm?id=2726970.2726985}
    }
    					
    simmhan:europar:2014 Simmhan, Y.; Kumbhare, A.; Wickramaarachchi, C.; Nagarkar, S.; Ravi, S.; Raghavendra, C. & Prasanna, V.
    GoFFish: A Sub-Graph Centric Framework for Large-Scale Graph Analytics
    2014
    Vol. 8632 International European Conference on Parallel Processing (Euro-Par) , pp. 451-462  
    inproceedings graphs, goffish, cluster, usc, peer reviewed, iisc
    BibTeX:
    @inproceedings{simmhan:europar:2014,
      author = {Yogesh Simmhan and Alok Kumbhare and Charith Wickramaarachchi and Soonil Nagarkar and Santosh Ravi and Cauligi Raghavendra and Viktor Prasanna},
      title = {GoFFish: A Sub-Graph Centric Framework for Large-Scale Graph Analytics},
      booktitle = {International European Conference on Parallel Processing (Euro-Par)},
      year = {2014},
      volume = {8632},
      pages = {451--462},
      note = {[CORE A]},
      doi = {https://doi.org/10.1007/978-3-319-09873-9_38}
    }
    					
    kumbhare:ccgrid:2014 Kumbhare, A.; Simmhan, Y. & Prasanna, V.K.
    PLAStiCC: Predictive Look-Ahead Scheduling for Continuous dataflows on Clouds
    2014 IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) , pp. 344-353   inproceedings continuous dataflow, workflow, floe, cloud, iisc, usc, peer reviewed, iisc
    BibTeX:
    @inproceedings{kumbhare:ccgrid:2014,
      author = {Alok Kumbhare and Yogesh Simmhan and Viktor K. Prasanna},
      title = {PLAStiCC: Predictive Look-Ahead Scheduling for Continuous dataflows on Clouds},
      booktitle = {IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)},
      year = {2014},
      pages = {344--353},
      note = {[CORE A]},
      url = {http://ieeexplore.ieee.org/document/6846470/},
      doi = {https://doi.org/10.1109/CCGrid.2014.60}
    }
    					
    badam:comad:2014 Badam, N.C. & Simmhan, Y.
    Subgraph Rank: PageRank for SubgraphCentric Distributed Graph Processing
    2014 International Conference on Management of Data (COMAD) , pp. 38-49   inproceedings iisc, graph, goffish, algorithm, peer reviewed
    BibTeX:
    @inproceedings{badam:comad:2014,
      author = {Nitin Chandra Badam and Yogesh Simmhan},
      title = {Subgraph Rank: PageRank for SubgraphCentric Distributed Graph Processing},
      booktitle = {International Conference on Management of Data (COMAD)},
      year = {2014},
      pages = {38--49},
      note = {[CORE B]},
      url = {http://dl.acm.org/citation.cfm?id=2726970.2726979}
    }
    					
    simmhan:cise:2013 Simmhan, Y.; Aman, S.; Kumbhare, A.; Liu, R.; Stevens, S.; Zhou, Q. & Prasanna, V.
    Cloud-Based Software Platform for Big Data Analytics in Smart Grids
    2013 Computing in Science and Engineering
    Vol. 15 (4) , pp. 38 - 47  
    article usc, smart grid, cloud, peer reviewed
    BibTeX:
    @article{simmhan:cise:2013,
      author = {Yogesh Simmhan and Saima Aman and Alok Kumbhare and Rongyang Liu and Sam Stevens and Qunzhi Zhou and Viktor Prasanna},
      title = {Cloud-Based Software Platform for Big Data Analytics in Smart Grids},
      journal = {Computing in Science and Engineering},
      publisher = {IEEE and AIP},
      year = {2013},
      volume = {15},
      number = {4},
      pages = {38 - 47},
      note = {[IF 1.422, CORE C]},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-cise-2013.pdf},
      doi = {https://doi.org/10.1109/MCSE.2013.39}
    }
    					
    Aman:comm:2013 Aman, S.; Simmhan, Y. & Prasanna, V.K.
    Energy Management Systems: State of the Art and Emerging Trends
    2013 IEEE Communications Magazine
    Vol. 51 (1) , pp. 114 -119  
    article smart grid, peer reviewed, usc
    BibTeX:
    @article{Aman:comm:2013,
      author = {Saima Aman and Yogesh Simmhan and Viktor K. Prasanna},
      title = {Energy Management Systems: State of the Art and Emerging Trends},
      journal = {IEEE Communications Magazine},
      publisher = {IEEE},
      year = {2013},
      volume = {51},
      number = {1},
      pages = {114 -119},
      note = {[IF 3.785]},
      doi = {https://doi.org/10.1109/MCOM.2013.6400447}
    }
    					
    Wickramaarachchi:escience:2013 Wickramaarachchi, C. & Simmhan, Y.
    Continuous Dataflow Update Strategies for Mission-Critical Applications
    2013 IEEE Internatrional Conference on eScience (eScience) , pp. 155-163   inproceedings usc, cloud, workflow, continuous dataflow, peer reviewed
    BibTeX:
    @inproceedings{Wickramaarachchi:escience:2013,
      author = {Charith Wickramaarachchi and Yogesh Simmhan},
      title = {Continuous Dataflow Update Strategies for Mission-Critical Applications},
      booktitle = {IEEE Internatrional Conference on eScience (eScience)},
      year = {2013},
      pages = {155--163},
      note = {[CORE A]},
      url = {http://ceng.usc.edu/ simmhan/pubs/wickramaarachchi-escience-2013.pdf},
      doi = {https://doi.org/10.1109/eScience.2013.35}
    }
    					
    kumbhare:sc:2013 Kumbhare, A.; Simmhan, Y. & Prasanna, V.
    Exploiting Application Dynamism and Cloud Elasticity for Continuous Dataflows
    2013 IEEE/ACM International Conference for High Performance Computing Networking, Storage, and Analysis (SC) , pp. 1-12   inproceedings usc, cloud, workflow, continuous dataflow, peer reviewed
    BibTeX:
    @inproceedings{kumbhare:sc:2013,
      author = {Alok Kumbhare and Yogesh Simmhan and Viktor Prasanna},
      title = {Exploiting Application Dynamism and Cloud Elasticity for Continuous Dataflows},
      booktitle = {IEEE/ACM International Conference for High Performance Computing Networking, Storage, and Analysis (SC)},
      year = {2013},
      pages = {1--12},
      note = {[CORE A]},
      doi = {https://doi.org/10.1145/2503210.2503240}
    }
    					
    redekopp:ipdps:2013 Redekopp, M.; Simmhan, Y. & Prasanna, V.K.
    Optimizations and Analysis of BSP Graph Processing Models on Public Clouds
    2013 IEEE International Parallel & Distributed Processing Symposium (IPDPS) , pp. 203-214   inproceedings usc, cloud, graphs, azure, peer reviewed
    BibTeX:
    @inproceedings{redekopp:ipdps:2013,
      author = {Mark Redekopp and Yogesh Simmhan and Viktor K. Prasanna},
      title = {Optimizations and Analysis of BSP Graph Processing Models on Public Clouds},
      booktitle = {IEEE International Parallel & Distributed Processing Symposium (IPDPS)},
      year = {2013},
      pages = {203--214},
      note = {[CORE A]},
      url = {https://ieeexplore.ieee.org/document/6569812/},
      doi = {https://doi.org/10.1109/IPDPS.2013.76}
    }
    					
    simmhan:smartcities:2013 Simmhan, Y. & Noor, M.U.
    Scalable Prediction of Energy Consumption using Incremental Time Series Clustering
    2013 Workshop on Big Data and Smarter Cities, Co-located with IEEE International Conference on Big Data , pp. 29-36   inproceedings smart grid, analytics, usc, peer reviewed
    BibTeX:
    @inproceedings{simmhan:smartcities:2013,
      author = {Yogesh Simmhan and Muhammad Usman Noor},
      title = {Scalable Prediction of Energy Consumption using Incremental Time Series Clustering},
      booktitle = {Workshop on Big Data and Smarter Cities, Co-located with IEEE International Conference on Big Data},
      year = {2013},
      pages = {29--36},
      doi = {https://doi.org/10.1109/BigData.2013.6691774}
    }
    					
    zhou:bigdata:2013 Zhou, Q.; Simmhan, Y. & Prasanna, V.
    Towards Hybrid Online On-Demand Querying of Realtime Data with Stateful Complex Event Processing
    2013 IEEE International Conference on Big Data (BigData) , pp. 199-205   inproceedings smart grid, cep, usc, peer reviewed, short
    BibTeX:
    @inproceedings{zhou:bigdata:2013,
      author = {Qunzhi Zhou and Yogesh Simmhan and Viktor Prasanna},
      title = {Towards Hybrid Online On-Demand Querying of Realtime Data with Stateful Complex Event Processing},
      booktitle = {IEEE International Conference on Big Data (BigData)},
      year = {2013},
      pages = {199--205},
      doi = {https://doi.org/10.1109/BigData.2013.6691575}
    }
    					
    Simmhan:scale:2012 Simmhan, Y.; Agarwal, V.; Aman, S.; Kumbhare, A.; Natarajan, S.; Rajguru, N.; Robinson, I.; Stevens, S.; Yin, W.; Zhou, Q. & Prasanna, V.
    Adaptive Energy Forecasting and Information Diffusion for Smart Power Grids
    2012 IEEE International Scalable Computing Challenge (SCALE) , pp. 1-4   inproceedings hadoop, openplanet, floe, workflow, information integration, smart grid, peer reviewed, usc, short
    BibTeX:
    @inproceedings{Simmhan:scale:2012,
      author = {Yogesh Simmhan and Vaibhav Agarwal and Saima Aman and Alok Kumbhare and Sreedhar Natarajan and Nikhil Rajguru and Ian Robinson and Samuel Stevens and Wei Yin and Qunzhi Zhou and Viktor Prasanna},
      title = {Adaptive Energy Forecasting and Information Diffusion for Smart Power Grids},
      booktitle = {IEEE International Scalable Computing Challenge (SCALE)},
      year = {2012},
      pages = {1--4},
      note = {SCALE Challenge Winner},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-scale-2012.pdf}
    }
    					
    Kumbhare:cloud:2012 Kumbhare, A.; Simmhan, Y. & Prasanna, V.
    Cryptonite: A Secure and Performant Data Repository on Public Clouds
    2012 IEEE International Cloud Computing Conference (CLOUD) , pp. 510-517   inproceedings usc, smart grid, security, data privacy, cloud, azure, peer reviewed
    BibTeX:
    @inproceedings{Kumbhare:cloud:2012,
      author = {Alok Kumbhare and Yogesh Simmhan and Viktor Prasanna},
      title = {Cryptonite: A Secure and Performant Data Repository on Public Clouds},
      booktitle = {IEEE International Cloud Computing Conference (CLOUD)},
      year = {2012},
      pages = {510--517},
      note = {[CORE B]},
      url = {https://ieeexplore.ieee.org/document/6253545/},
      doi = {https://doi.org/10.1109/CLOUD.2012.109}
    }
    					
    Zhou:iswc:2012 Zhao, Q.; Simmhan, Y. & Prasanna, V.K.
    Incorporating Semantic Knowledge into Stream Processing for Smart Grid Applications
    2012
    Vol. 7650 International Semantic Web Conference (ISWC) , pp. 257-273  
    inproceedings peer reviewed, smart grid, cep, usc
    BibTeX:
    @inproceedings{Zhou:iswc:2012,
      author = {Qunzhi Zhao and Yogesh Simmhan and Viktor K. Prasanna},
      title = {Incorporating Semantic Knowledge into Stream Processing for Smart Grid Applications},
      booktitle = {International Semantic Web Conference (ISWC)},
      year = {2012},
      volume = {7650},
      pages = {257--273},
      note = {[CORE A]},
      url = {http://iswc2012.semanticweb.org/sites/default/files/76500254.pdf},
      doi = {https://doi.org/10.1007/978-3-642-35173-0_17}
    }
    					
    Zhao:ipaw:2012 Zhao, J.; Simmhan, Y. & Prasanna, V.
    Presenting Apropos Provenance for Situation Awareness and Forensics
    2012
    Vol. 7525 International Proveanance and Annotation Workshop , pp. 250-253  
    inproceedings provenance, smart grid, usc, peer reviewed, short
    BibTeX:
    @inproceedings{Zhao:ipaw:2012,
      author = {Jing Zhao and Yogesh Simmhan and Viktor Prasanna},
      title = {Presenting Apropos Provenance for Situation Awareness and Forensics},
      booktitle = {International Proveanance and Annotation Workshop},
      publisher = {Springer},
      year = {2012},
      volume = {7525},
      pages = {250--253},
      note = {Poster},
      url = {http://dx.doi.org/10.1007/978-3-642-34222-6_30},
      doi = {https://doi.org/10.1007/978-3-642-34222-6_30}
    }
    					
    Yin:mapreduce:2012 Yin, W.; Simmhan, Y. & Prasanna, V.
    Scalable Regression Tree Learning on Hadoop using OpenPlanet
    2012 ACM International Workshop on MapReduce and its Applications (MAPREDUCE) , pp. 57-64   inproceedings cloud, machine learning, map reduce, hadoop, smart grid, peer reviewed, usc
    Abstract: As scientific and engineering domains attempt to effectively analyze the deluge of data arriving from sensors and instruments, machine learning is becoming a key data mining tool to build prediction models. Regression tree is a popular learning model that combines decision trees and linear regression to forecast numerical target variables based on a set of input features. Map Reduce is well suited for addressing such data intensive learning applications, and a proprietary regression tree algorithm, PLANET, using MapReduce has been proposed earlier. In this paper, we describe an open source implement of this algorithm, OpenPlanet, on the Hadoop framework using a hybrid approach. Further, we evaluate the performance of OpenPlanet using realworld datasets from the Smart Power Grid domain to perform energy use forecasting, and propose tuning strategies of Hadoop parameters to improve the performance of the default configuration by 75% for a training dataset of 17 million tuples on a 64-core Hadoop cluster on FutureGrid.
    BibTeX:
    @inproceedings{Yin:mapreduce:2012,
      author = {Wei Yin and Yogesh Simmhan and Viktor Prasanna},
      title = {Scalable Regression Tree Learning on Hadoop using OpenPlanet},
      booktitle = {ACM International Workshop on MapReduce and its Applications (MAPREDUCE)},
      year = {2012},
      pages = {57--64},
      url = {http://ceng.usc.edu/ simmhan/pubs/yin-mapreduce-2012.pdf},
      doi = {https://doi.org/10.1145/2287016.2287027}
    }
    					
    Zhou:itng:2012 Zhou, Q.; Natarajan, S.; Simmhan, Y. & Prasanna, V.
    Semantic Information Modeling for Emerging Applications in Smart Grid
    2012 IEEE International Conference on Information Technology : New Generations (ITNG) , pp. 775-782   inproceedings usc, smart grid, semantic, information integration, peer reviewed
    Abstract: Abstract—Smart Grid modernizes power grid by integrating digital and information technologies. Millions of smart meters, intelligent appliances and communication infrastructures are under deployment allowing advanced IT applications to be developed to protect and optimize power grid operations. Demand response (DR) is one such emerging application to optimize electricity demand by curtailing/shifting power load when peak load occurs. Existing DR approaches are mostly based on static plans such as pricing policies and load shedding schedules. However, improvements to power management applications rely on data emanated from existing and new information sources with the grow of Smart Grid information space. In particular, dynamic DR algorithms may depend on information from smart meters that report interval-based power consumption measurement, HVAC systems that monitor buildings heat and humidity, and even weather forecast services. In order for emerging Smart Grid applications to take advantage of the diverse data influx, extensible information integration is required. In this paper, we develop an integrated Smart Grid information model using Semantic Web techniques and present case studies of using semantic information for dynamic DR. We show the semantic model facilitates information integration and knowledge representation for developing the next generation Smart Grid applications.
    BibTeX:
    @inproceedings{Zhou:itng:2012,
      author = {Qunzhi Zhou and Sreedhar Natarajan and Yogesh Simmhan and Viktor Prasanna},
      title = {Semantic Information Modeling for Emerging Applications in Smart Grid},
      booktitle = {IEEE International Conference on Information Technology : New Generations (ITNG)},
      year = {2012},
      pages = {775--782},
      url = {http://dx.doi.org/10.1109/ITNG.2012.150},
      doi = {https://doi.org/10.1109/ITNG.2012.150}
    }
    					
    Simmhan:sciencecloud:2012 Simmhan, Y.; Antoniu, G.; Goble, C. & Ramakrishnan, L. Simmhan, Y.; Antoniu, G.; Goble, C. & Ramakrishnan, L. (Hrsg.)
    Proceedings of the 3rd International Workshop on Scientific Cloud Computing (ScienceCloud)
    2012   proceedings editorial, usc
    BibTeX:
    @proceedings{Simmhan:sciencecloud:2012,
      author = {Yogesh Simmhan and Gabriel Antoniu and Carole Goble and Lavanya Ramakrishnan},
      title = {Proceedings of the 3rd International Workshop on Scientific Cloud Computing (ScienceCloud)},
      publisher = {ACM},
      year = {2012}
    }
    					
    Simmhan:fgcs:2011 Simmhan, Y. & Barga, R. Simmhan, Y.; Groth, P. & Moreau, L. (Hrsg.)
    Analysis of approaches for supporting the Open Provenance Model: A case study of the Trident workflow workbench
    2011 Future Generation Computer Systems (FGCS)
    Vol. 27 , pp. 790-796  
    article msr, provenance, opm, trident, workflow, inter-operability, provenance challenge, peer reviewed
    Abstract: The Trident workbench is a platform for composing, executing and managing scientific workflows. While Trident collects provenance in its native provenance model, the third provenance challenge was an opportunity to build support for the Open Provenance Model into Trident. There are several possible approaches to harmonize our native model with OPM, and such choices are also available to other existing provenance and workflow systems working towards OPM compatibility. We identify and analyze the relative merits of these approaches in an effort to inform practitioners planning to support OPM in their existing provenance/workflow systems. Further, we describe our experience with using the integration approach we choose to interoperate with other teams as part of the challenge.
    BibTeX:
    @article{Simmhan:fgcs:2011,
      author = {Yogesh Simmhan and Roger Barga},
      title = {Analysis of approaches for supporting the Open Provenance Model: A case study of the Trident workflow workbench},
      journal = {Future Generation Computer Systems (FGCS)},
      publisher = {Elsevier},
      year = {2011},
      volume = {27},
      pages = {790--796},
      note = {[IF 2.43, CORE A]},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-fgcs-2011.pdf},
      doi = {https://doi.org/10.1016/j.future.2010.10.005}
    }
    					
    Zhao:ijca:2011 Zhao, J.; Simmhan, Y.; Gomadam, K. & Prasanna, V.K.
    Querying Provenance Information in Distributed Environments
    2011 International Journal of Computers and Their Applications (IJCA)
    Vol. 18 (3) , pp. 196-215  
    article usc, smart oilfield, provenance, peer reviewed, special issue
    Abstract: The growing recognition of the importance of provenance for data intensive and multidisciplinary domains is leading to careful collection of provenance. One consequence of this is the proliferation of provenance repositories hosted for individual organization or communities, with limited ability to reconstruct and query for and on provenance across them. Community standards like the Open Provenance Model (OPM) allow uniform interpretation and exchange of provenance metadata but do not prescribe query or service specifications to access provenance. If data reuse and sharing across institutions is not accompanied by passing provenance at the time of data exchange, we need to track the provenance and query for them or over them across distributed provenance repositories. In this article, we present approaches for querying over distributed provenance information, and address two common provenance query models that we formalize: provenance retrieval query and provenance filter query. Our problem is motivated by Smart Oilfield applications in the energy informatics domain, and we evaluate the performance of our algorithms using synthetic workflows based on the domain.
    BibTeX:
    @article{Zhao:ijca:2011,
      author = {Jing Zhao and Yogesh Simmhan and Karthik Gomadam and Viktor K. Prasanna},
      title = {Querying Provenance Information in Distributed Environments},
      journal = {International Journal of Computers and Their Applications (IJCA)},
      publisher = {ISCA},
      year = {2011},
      volume = {18},
      number = {3},
      pages = {196--215},
      url = {http://ceng.usc.edu/ simmhan/pubs/zhao-ijca-2011.pdf}
    }
    					
    Moreau:fgcs:2011 Moreau, L.; Clifford, B.; Freire, J.; Futrelle, J.; Gil, Y.; Groth, P.; Kwasnikowska, N.; Miles, S.; Missier, P.; Myers, J.; Plale, B.; Simmhan, Y.; Stephan, E. & den Bussche, J.V. Simmhan, Y.; Groth, P. & Moreau, L. (Hrsg.)
    The Open Provenance Model core specification (v1.1)
    2011 Future Generation Computer Systems (FGCS)
    Vol. 27 , pp. 743-756  
    article msr, provenance, opm, representation, inter-operability, peer reviewed
    Abstract: The Open Provenance Model is a model of provenance that is designed to meet the following requirements: (1) Allow provenance information to be exchanged between systems, by means of a compatibility layer based on a shared provenance model. (2) Allow developers to build and share tools that operate on such a provenance model. (3) Define provenance in a precise, technology-agnostic manner. (4) Support a digital representation of provenance for any “thing”, whether produced by computer systems or not. (5) Allow multiple levels of description to coexist. (6) Define a core set of rules that identify the valid inferences that can be made on provenance representation. This document contains the specification of the Open Provenance Model (v1.1) resulting from a community effort to achieve inter-operability in the Provenance Challenge series.
    BibTeX:
    @article{Moreau:fgcs:2011,
      author = {Luc Moreau and Ben Clifford and Juliana Freire and Joe Futrelle and Yolanda Gil and Paul Groth and Natalia Kwasnikowska and Simon Miles and Paolo Missier and Jim Myers and Beth Plale and Yogesh Simmhan and Eric Stephan and Jan Van den Bussche},
      title = {The Open Provenance Model core specification (v1.1)},
      journal = {Future Generation Computer Systems (FGCS)},
      publisher = {Elsevier},
      year = {2011},
      volume = {27},
      pages = {743--756},
      note = {[IF 2.43, CORE A]},
      url = {http://ceng.usc.edu/ simmhan/pubs/moreau-fgcs-2011.pdf},
      doi = {https://doi.org/10.1016/j.future.2010.07.005}
    }
    					
    Simmhan:ijca:2011 Simmhan, Y. & Plale, B.
    Using Provenance for Personalized Quality Ranking of Scientific Datasets
    2011 International Journal of Computers and Their Applications (IJCA)
    Vol. 18 (3) , pp. 180-195  
    article usc, provenance, iu, peer reviewed, karma, special issue
    Abstract: The rapid growth of eScience has led to an explosion in the creation and availability of scientific datasets that includes raw instrument data and derived datasets from model simulations. A large number of these datasets are surfacing online in public and private catalogs, often annotated with XML metadata, as part of community efforts to foster open research. With this rapid expansion comes the challenge of filtering and selecting datasets that best match the needs of scientists. We address a key aspect of the scientific data discovery process by ranking search results according to a personalized data quality score based on a declarative quality profile to help scientists select the most suitable data for their applications. Our quality model is resilient to missing metadata using a novel strategy that uses provenance in its absence. Intuitively, our premise is that the quality score for a dataset depends on its provenance – the scientific task and its inputs that created the dataset – and it is possible to define a quality function based on provenance metadata that predicts the same quality score as one evaluated using the user’s quality profile over the complete metadata. Here, we present a model and architecture for data quality scoring, apply machine learning techniques to construct a quality function that uses provenance as proxy for missing metadata, and empirically test the prediction power of our quality function. Our results show that for some scientific tasks, quality scores based on provenance closely track the quality scores based on complete metadata properties, with error margins between 1 – 29%.
    BibTeX:
    @article{Simmhan:ijca:2011,
      author = {Yogesh Simmhan and Beth Plale},
      title = {Using Provenance for Personalized Quality Ranking of Scientific Datasets},
      journal = {International Journal of Computers and Their Applications (IJCA)},
      publisher = {ISCA},
      year = {2011},
      volume = {18},
      number = {3},
      pages = {180--195},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-ijca-2011.pdf}
    }
    					
    Simmhan:greenit:2011 Simmhan, Y.; Zhou, Q. & Prasanna, V.K. Kim, J.H. & Lee, M.J. (Hrsg.)
    Semantic Information Integration for Smart Grid Applications ( Green IT: Technologies and Applications )
    2011 Green IT: Technologies and Applications , pp. 361-380   inbook usc, smart grid, semantic, information integration, peer reviewed
    Abstract: The Los Angeles Smart Grid Project aims to use informatics techniques to bring about a quantum leap in the way demand response load optimization is performed in utilities. Semantic information integration, from sources as diverse as Internet-connected smart meters and social networks, is a linchpin to support the advanced analytics and mining algorithms required for this. In association with it, semantic complex event processing system will allow consumer and utility managers to easily specify and enact energy policies continuously. We present the information systems architecture for the project that is under development, and discuss research issues that emerge from having to design a system that supports 1.4 million customers and a rich ecosystem of Smart Grid applications from users, third party vendors, the utility and regulators.
    BibTeX:
    @inbook{Simmhan:greenit:2011,
      author = {Yogesh Simmhan and Qunzhi Zhou and Viktor K. Prasanna},
      title = {Green IT: Technologies and Applications},
      publisher = {Springer Berlin Heidelberg},
      year = {2011},
      pages = {361--380},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-greenit-2011.pdf},
      doi = {https://doi.org/10.1007/978-3-642-22179-8_19}
    }
    					
    Simmhan:sciencecloud:2011 Simmhan, Y.; Cao, B.; Giakkoupis, M. & Prasanna, V.K.
    Adaptive rate stream processing for smart grid applications on clouds
    2011 ACM International Workshop on Scientific Cloud Computing (ScienceCloud) , pp. 33-38   inproceedings usc, smart grid, cloud, streaming, peer reviewed, short paper
    Abstract: Pervasive smart meters that continuously measure power usage by consumers within a smart (power) grid are providing utilities and power systems researchers with unprecedented volumes of information through streams that need to be processed and analyzed in near realtime. We introduce the use of Cloud platforms to perform scalable, latency sensitive stream processing for eEngineering applications in the smart grid domain. One unique aspect of our work is the use of adaptive rate control to throttle the rate of generation of power events by smart meters, which meets accuracy requirements of smart grid applications while consuming 50% lesser bandwidth resources in the Cloud.
    BibTeX:
    @inproceedings{Simmhan:sciencecloud:2011,
      author = {Yogesh Simmhan and Baohua Cao and Michail Giakkoupis and Viktor K. Prasanna},
      title = {Adaptive rate stream processing for smart grid applications on clouds},
      booktitle = {ACM International Workshop on Scientific Cloud Computing (ScienceCloud)},
      year = {2011},
      pages = {33--38},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-sciencecloud-2011.pdf},
      doi = {https://doi.org/10.1145/1996109.1996116}
    }
    					
    Simmhan:cloud:2011 Simmhan, Y.; Kumbhare, A.; Cao, B. & Prasanna, V.K.
    An Analysis of Security and Privacy Issues in Smart Grid Software Architectures on Clouds
    2011 IEEE International Cloud Computing Conference (CLOUD) , pp. 582-589   inproceedings usc, cloud, security, privacy, smart grid, peer reviewed
    Abstract: Power utilities globally are increasingly upgrading to Smart Grids that use bi-directional communication with the consumer to enable an information-driven approach to distributed energy management. Clouds offer features well suited for Smart Grid software platforms and applications, such as elastic resources and shared services. However, the security and privacy concerns inherent in an informationrich Smart Grid environment are further exacerbated by their deployment on Clouds. Here, we present an analysis of security and privacy issues in a Smart Grids software architecture operating on different Cloud environments, in the form of a taxonomy. We use the Los Angeles Smart Grid Project that is underway in the largest U.S. municipal utility to drive this analysis that will benefit both Cloud practitioners targeting Smart Grid applications, and Cloud researchers investigating security and privacy.
    BibTeX:
    @inproceedings{Simmhan:cloud:2011,
      author = {Yogesh Simmhan and Alok Kumbhare and Baohua Cao and Viktor K. Prasanna},
      title = {An Analysis of Security and Privacy Issues in Smart Grid Software Architectures on Clouds},
      booktitle = {IEEE International Cloud Computing Conference (CLOUD)},
      publisher = {IEEE},
      year = {2011},
      pages = {582--589},
      note = {[CORE B]},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-cloud-2011.pdf},
      doi = {https://doi.org/10.1109/CLOUD.2011.107}
    }
    					
    Kumbhare:datacloud:2011 Kumbhare, A.; Simmhan, Y. & Prasanna, V.
    Designing a Secure Storage Repository for Sharing Scientific Datasets using Public Clouds
    2011 ACM International Workshop on Data Intensive Computing in the Clouds (DataCloud-SC11) , pp. 31-40   inproceedings peer reviewed, cloud, azure, security, smart grid, usc
    BibTeX:
    @inproceedings{Kumbhare:datacloud:2011,
      author = {Alok Kumbhare and Yogesh Simmhan and Viktor Prasanna},
      title = {Designing a Secure Storage Repository for Sharing Scientific Datasets using Public Clouds},
      booktitle = {ACM International Workshop on Data Intensive Computing in the Clouds (DataCloud-SC11)},
      year = {2011},
      pages = {31--40},
      url = {http://ceng.usc.edu/ simmhan/pubs/kumbhare-datacloud-2011.pdf},
      doi = {https://doi.org/10.1145/2087522.2087530}
    }
    					
    Aman:dddm:2011 Aman, S.; Simmhan, Y. & Prasanna, V.K.
    Improving Energy Use Forecast for Campus Micro-grids using Indirect Indicators
    2011 International Workshop on Domain Driven Data Mining (DDDM) , pp. 389-397   inproceedings usc, smart grid, machine learning, peer reviewed
    BibTeX:
    @inproceedings{Aman:dddm:2011,
      author = {Saima Aman and Yogesh Simmhan and Viktor K. Prasanna},
      title = {Improving Energy Use Forecast for Campus Micro-grids using Indirect Indicators},
      booktitle = {International Workshop on Domain Driven Data Mining (DDDM)},
      year = {2011},
      pages = {389--397},
      url = {http://ceng.usc.edu/ simmhan/pubs/aman-dddm-2011.pdf},
      doi = {https://doi.org/10.1109/ICDMW.2011.95}
    }
    					
    Redekopp:pargraph:2011 Redekopp, M.; Simmhan, Y. & Prasanna, V.K.
    Performance Analysis of Vertex-centric Graph Algorithms on the Azure Cloud Platform
    2011 IEEE Workshop on Parallel Algorithms and Software for Analysis of Massive Graphs (ParGraph) , pp. 1-8   inproceedings graphs, azure, cloud, peer reviewed, usc
    Abstract: Finding key vertices in large graphs is an important problem in many applications such as social networks, bioinformatics, and distribution networks. Betweenness centrality is a popular algorithm for finding such vertices and has been studied extensively, yielding several parallel formulations suitable to supercomputers and clusters. In this paper we implement and study betweenness centrality in the context of cloud-based platforms using Microsoft Windows Azure as our case study. We demonstrate scalable parallel performance and investigate key issues related to a cloud-based implementation including mitigating penalties associated with VM failures as well as the impact of communication overheads in the cloud. We use a combination of empirical and analytical evaluation using both synthetic small-world and real-world social interaction graphs.
    BibTeX:
    @inproceedings{Redekopp:pargraph:2011,
      author = {Mark Redekopp and Yogesh Simmhan and Viktor K. Prasanna},
      title = {Performance Analysis of Vertex-centric Graph Algorithms on the Azure Cloud Platform},
      booktitle = {IEEE Workshop on Parallel Algorithms and Software for Analysis of Massive Graphs (ParGraph)},
      year = {2011},
      pages = {1--8},
      url = {http://halcyon.usc.edu/ pk/prasannawebsite/papers/2011/redekopp-pargraph-2011.pdf}
    }
    					
    Simmhan:hpcdb:2011 Simmhan, Y.; van Ingen, C.; Heasley, J. & Szalay, A.
    Stargazing through a Digital Veil: Managing a Large Scale Sky Survey using Distributed Databases on HPC Clusters
    2011 Workshop on High-Performance Computing meets Databases (HPCDB) , pp. 33-36   inproceedings usc, msr, escience, data management, hpc, graywulf, panstarrs, databases, peer reviewed
    BibTeX:
    @inproceedings{Simmhan:hpcdb:2011,
      author = {Yogesh Simmhan and Catharine van Ingen and Jim Heasley and Alex Szalay},
      title = {Stargazing through a Digital Veil: Managing a Large Scale Sky Survey using Distributed Databases on HPC Clusters},
      booktitle = {Workshop on High-Performance Computing meets Databases (HPCDB)},
      year = {2011},
      pages = {33--36},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-hpcdb-2011.pdf},
      doi = {https://doi.org/10.1145/2125636.2125648}
    }
    					
    Zhou:debs:2011 Zhou, Q.; Simmhan, Y. & Prasanna, V.K.
    Towards an inexact semantic complex event processing framework
    2011 International Conference on Distributed Event-Based System (DEBS) , pp. 401-402   inproceedings usc, smart grid. cep, semantic, peer reviewed, poster
    Abstract: Complex event processing (CEP) deals with detecting real-time situations, represented as event patterns, from among an event cloud. The state-of-the-art CEP systems process events as plain data tuples and are limited to detect precisely defined patterns. Emerging application areas like optimization in smart power grids require CEP to incorporate semantic knowledge of the domain for easier pattern specification, and detect inexact patterns in the presence of uncertainties. In this paper, we present motivating use cases, discuss limitations of existing CEP systems and describe our work towards an Inexact Semantic Complex Event Processing (InSCEP) framework.
    BibTeX:
    @inproceedings{Zhou:debs:2011,
      author = {Qunzhi Zhou and Yogesh Simmhan and Viktor K. Prasanna},
      title = {Towards an inexact semantic complex event processing framework},
      booktitle = {International Conference on Distributed Event-Based System (DEBS)},
      publisher = {ACM},
      year = {2011},
      pages = {401--402},
      note = {Poster},
      url = {http://ceng.usc.edu/ simmhan/pubs/zhou-debs-2011.pdf},
      doi = {https://doi.org/10.1145/2002259.2002331}
    }
    					
    Simmhan:buildsys:2011 Simmhan, Y.; Prasanna, V.; Aman, S.; Natarajan, S.; Yin, W. & Zhou, Q.
    Towards Data-driven Demand-Response Optimization in a Campus Microgrid
    2011 Workshop On Embedded Sensing Systems For Energy-Efficiency In Buildings (BuildSys) , pp. 41-42   inproceedings usc, smart grid. information integration, cep, machine learning, peer reviewed, demo
    Abstract: We describe and demonstrate a prototype software architecture to support data-driven demand response optimization (DR) in the USC campus microgrid, as part of the Los Angeles Smart Grid Demonstration Project. The architecture includes a semantic information repository that integrates diverse data sources to support DR, demand forecasting using scalable machine-learned models, and detection of load curtailment opportunities by matching complex event patterns.
    BibTeX:
    @inproceedings{Simmhan:buildsys:2011,
      author = {Yogesh Simmhan and Viktor Prasanna and Saima Aman and Sreedhar Natarajan and Wei Yin and Qunzhi Zhou},
      title = {Towards Data-driven Demand-Response Optimization in a Campus Microgrid},
      booktitle = {Workshop On Embedded Sensing Systems For Energy-Efficiency In Buildings (BuildSys)},
      publisher = {ACM},
      year = {2011},
      pages = {41--42},
      note = {Demo},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-buildsys-2011.pdf},
      doi = {https://doi.org/10.1145/2434020.2434032}
    }
    					
    Zinn:ccgrid:2011 Zinn, D.; Hart, Q.; McPhillips, T.M.; Ludäscher, B.; Simmhan, Y.; Giakkoupis, M. & Prasanna, V.K.
    Towards Reliable, Performant Workflows for Streaming-Applications on Cloud Platforms
    2011 IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) , pp. 235-244   inproceedings usc, smart grid, cloud, streaming, peer reviewed, escience
    Abstract: Scientific workflows are commonplace in eScience applications. Yet, the lack of integrated support for data models, including streaming data, structured collections and files, is limiting the ability of workflows to support emerging applications in energy informatics that are stream oriented. This is compounded by the absence of Cloud data services that support reliable and performant streams. In this paper, we propose and present a scientific workflow framework that supports streams as first-class data, and is optimized for performant and reliable execution across desktop and Cloud platforms. The workflow framework features and its empirical evaluation on a private Eucalyptus Cloud are presented.
    BibTeX:
    @inproceedings{Zinn:ccgrid:2011,
      author = {Daniel Zinn and Quinn Hart and Timothy M. McPhillips and Bertram Ludäscher and Yogesh Simmhan and Michail Giakkoupis and Viktor K. Prasanna},
      title = {Towards Reliable, Performant Workflows for Streaming-Applications on Cloud Platforms},
      booktitle = {IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)},
      publisher = {IEEE},
      year = {2011},
      pages = {235--244},
      note = {[CORE A]},
      url = {http://ceng.usc.edu/ simmhan/pubs/zinn-ccgrid-2011.pdf},
      doi = {https://doi.org/10.1109/CCGrid.2011.74}
    }
    					
    Simmhan:HiPC:2011 Simmhan, Y. & Srinivasan, A. Simmhan, Y. & Srinivasan, A. (Hrsg.)
    HiPC 2011 Student Research Symposium: Message from the co-chairs
    2011 High Performance Computing Conference (HiPC)   proceedings editorial, usc
    BibTeX:
    @proceedings{Simmhan:HiPC:2011,
      author = {Yogesh Simmhan and Ashok Srinivasan},
      title = {HiPC 2011 Student Research Symposium: Message from the co-chairs},
      booktitle = {High Performance Computing Conference (HiPC)},
      year = {2011}
    }
    					
    Raicu:ScienceCloud2011 Raicu, I.; Beckman, P.; Foster, I.T. & Simmhan, Y. Raicu, I.; Beckman, P.; Foster, I.T. & Simmhan, Y. (Hrsg.)
    Proceedings of the 2nd International Workshop on Scientific Cloud Computing (ScienceCloud)
    2011   proceedings editorial, usc
    BibTeX:
    @proceedings{Raicu:ScienceCloud2011,
      author = {Ioan Raicu and Pete Beckman and Ian T. Foster and Yogesh Simmhan},
      title = {Proceedings of the 2nd International Workshop on Scientific Cloud Computing (ScienceCloud)},
      publisher = {ACM},
      year = {2011},
      url = {http://dx.doi.org/10.1145/1996109}
    }
    					
    Barga:deb:2010 Barga, R.; Simmhan, Y.; Withana, E.C.; Sahoo, S.; Jackson, J. & Araujo, N. Tan, W.-C. (Hrsg.)
    Provenance for Scientific Workflows: Towards Reproducible Research
    2010 Data Engineering Bulletin (DEB)
    Vol. 33 (3) , pp. 50-59  
    article msr, provenance, trident, workflow, peer reviewed
    BibTeX:
    @article{Barga:deb:2010,
      author = {Roger Barga and Yogesh Simmhan and Eran Chinthaka Withana and Satya Sahoo and Jared Jackson and Nelson Araujo},
      title = {Provenance for Scientific Workflows: Towards Reproducible Research},
      journal = {Data Engineering Bulletin (DEB)},
      publisher = {IEEE},
      year = {2010},
      volume = {33},
      number = {3},
      pages = {50--59},
      url = {http://sites.computer.org/debull/A10sept/barga.pdf}
    }
    					
    Simmhan:works:2010 Simmhan, Y.; Soroush, E.; van Ingen, C.; Agarwal, D. & Ramakrishnan, L.
    BReW: Blackbox resource selection for e-Science workflows
    2010 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS) , pp. 1-10   inproceedings msr, escience, workflow, cloud, scheduling, peer reviewed
    Abstract: Workflows are commonly used to model data intensive scientific analysis. As computational resource needs increase for eScience, emerging platforms like clouds present additional resource choices for scientists and policy makers. We introduce BReW, a tool enables users to make rapid, highlevel platform selection for their workflows using limited workflow knowledge. This helps make informed decisions on whether to port a workflow to a new platform. Our analysis of synthetic and real eScience workflows shows that using just total runtime length, maximum task fanout, and total data used and produced by the workflow, BReW can provide platform predictions comparable to whitebox models with detailed workflow knowledge.
    BibTeX:
    @inproceedings{Simmhan:works:2010,
      author = {Yogesh Simmhan and Emad Soroush and Catharine van Ingen and Deb Agarwal and Lavanya Ramakrishnan},
      title = {BReW: Blackbox resource selection for e-Science workflows},
      booktitle = {IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS)},
      year = {2010},
      pages = {1--10},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-works-2010.pdf},
      doi = {https://doi.org/10.1109/WORKS.2010.5671857}
    }
    					
    Simmhan:cloud:2010 Simmhan, Y.; van Ingen, C.; Subramanian, G. & Li, J.
    Bridging the Gap between Desktop and the Cloud for eScience Applications
    2010 IEEE International Cloud Computing Conference (CLOUD) , pp. 474-481   inproceedings msr, cloud, workflow, escience, generic worker, genomics, peer reviewed
    Abstract: The widely discussed scientific data deluge creates a need to computationally scale out eScience applications beyond the local desktop and cope with variable loads over time. Cloud computing offers a scalable, economic, on-demand model well matched to these needs. Yet cloud computing creates gaps that must be crossed to move existing science applications to the cloud. In this article, we propose a Generic Worker framework to deploy and invoke science applications in the cloud with minimal user effort and predictable cost-effective performance. Our framework addresses three distinct challenges posed by the cloud: the complexity of application deployment, invocation of cloud applications from desktop clients, and efficient transparent data transfers across desktop and the cloud. We present an implementation of the Generic Worker for the Microsoft Azure Cloud and evaluate its use for a genomics application. Our evaluation shows that the user complexity to port and scale the application is substantially reduced while introducing a negligible performance overhead of of <; 5% for the genomics application when scaling to 20 VM instances.
    BibTeX:
    @inproceedings{Simmhan:cloud:2010,
      author = {Yogesh Simmhan and Catharine van Ingen and Girish Subramanian and Jie Li},
      title = {Bridging the Gap between Desktop and the Cloud for eScience Applications},
      booktitle = {IEEE International Cloud Computing Conference (CLOUD)},
      publisher = {IEEE},
      year = {2010},
      pages = {474--481},
      note = {[CORE B]},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-cloud-2010.pdf},
      doi = {https://doi.org/10.1109/CLOUD.2010.72}
    }
    					
    Simmhan:sciencecloud:2010 Simmhan, Y. & Ramakrishnan, L.
    Comparison of resource platform selection approaches for scientific workflows
    2010 International Workshop on Scientific Cloud Computing (ScienceCloud) , pp. 445-450   inproceedings msr, cloud, escience, hpc, resource management, workflows, azure, scheduling, peer reviewed, short paper
    Abstract: Cloud computing is increasingly considered as an additional computational resource platform for scientific workflows. The cloud offers opportunity to scale-out applications from desktops and local cluster resources. Each platform has different properties (e.g., queue wait times in high performance systems, virtual machine startup overhead in clouds) and characteristics (e.g., custom environments in cloud) that makes choosing from these diverse resource platforms for a workflow execution a challenge for scientists. Scientists are often faced with deciding resource platform selection trade-offs with limited information on the actual workflows. While many workflow planning methods have explored resource selection or task scheduling, these methods often require fine-scale characterization of the workflow that is onerous for a scientist. In this paper, we describe our early exploratory work in using blackbox characteristics for a cost-benefit analysis of using different resource platforms. In our blackbox method, we use only limited high-level information on the workflow length, width, and data sizes. The length and width are indicative of the workflow duration and parallelism. We compare the effectiveness of this approach to other resource selection models using two exemplar scientific workflows on desktop, local cluster, HPC center, and cloud platforms. Early results suggest that the blackbox model often makes the same resource selections as a more fine-grained whitebox model. We believe the simplicity of the blackbox model can help inform a scientist on the applicability of a new resource platform, such as cloud resources, even before porting an existing workflow.
    BibTeX:
    @inproceedings{Simmhan:sciencecloud:2010,
      author = {Yogesh Simmhan and Lavanya Ramakrishnan},
      title = {Comparison of resource platform selection approaches for scientific workflows},
      booktitle = {International Workshop on Scientific Cloud Computing (ScienceCloud)},
      publisher = {ACM},
      year = {2010},
      pages = {445--450},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-sciencecloud-2010.pdf},
      doi = {https://doi.org/10.1145/1851476.1851541}
    }
    					
    Simmhan:cloudcom:2010 Simmhan, Y.; Giakkoupis, M.; Cao, B. & Prasanna, V.K.
    On Using Cloud Platforms in a Software Architecture for Smart Energy Grids
    2010 International Conference on Cloud Computing Technology and Science (CloudCom) , pp. 1-3   inproceedings usc, energy informatics, smart grid, cloud, poster, peer reviewed
    Abstract: Increasing concern about energy consumption is leading to infrastructure that continuously monitors consumer energy usage and allow power utilities to provide dynamic feedback to curtail peak power load. Smart Grid infrastructure being deployed globally needs scalable software platforms to rapidly integrate and analyze information streaming from millions of smart meters, forecast power usage and respond to operational events. Cloud platforms are well suited to support such data and compute intensive, always-on applications. We examine opportunities and challenges of using cloud platforms for such applications in the emerging domain of energy informatics.
    BibTeX:
    @inproceedings{Simmhan:cloudcom:2010,
      author = {Yogesh Simmhan and Michail Giakkoupis and Baohua Cao and Viktor K. Prasanna},
      title = {On Using Cloud Platforms in a Software Architecture for Smart Energy Grids},
      booktitle = {International Conference on Cloud Computing Technology and Science (CloudCom)},
      publisher = {IEEE},
      year = {2010},
      pages = {1--3},
      note = {Poster [CORE C]},
      url = {http://salsahpc.indiana.edu/CloudCom2010/EPoster/cloudcom2010_submission_269.pdf}
    }
    					
    Simmhan:ipaw:2010 Simmhan, Y. & Gomadam, K. McGuinness, D.; Michaelis, J. & Moreau, L. (Hrsg.)
    Social Web-Scale Provenance in the Cloud
    2010
    Vol. 6378 International Provenance and Annotation Workshop (IPAW) , pp. 298-300  
    inproceedings msr, provenance, social network, cloud, poster, peer reviewed, short paper
    Abstract: The lower barrier to entry for users to create and share resources through applications like Facebook and Twitter, and the commoditization of social Web data has heightened issues of privacy, attribution, and copyright. These make it important to track the provenance of social Web data. We outline and discuss key engineering, privacy, and monetization challenges in collecting and analyzing provenance of social Web resources.
    BibTeX:
    @inproceedings{Simmhan:ipaw:2010,
      author = {Yogesh Simmhan and Karthik Gomadam},
      title = {Social Web-Scale Provenance in the Cloud},
      booktitle = {International Provenance and Annotation Workshop (IPAW)},
      publisher = {Springer Berlin / Heidelberg},
      year = {2010},
      volume = {6378},
      pages = {298--300},
      url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-ipaw-2010.pdf},
      doi = {https://doi.org/10.1007/978-3-642-17819-1_39}
    }
    					
    Zinn:works:2010 Zinn, D.; Hart, Q.; Ludascher, B. & Simmhan, Y.
    Streaming satellite data to cloud workflows for on-demand computing of environmental data products
    2010 Workshop on Workflows in Support of Large-Scale Science (WORKS) , pp. 1-8   inproceedings usc, streaming, workflow, cloud, escience, peer reviewed
    Abstract: Environmental data arriving constantly from satellites and weather stations are used to compute weather coefficients that are essential for agriculture and viticulture. For example, the reference evapotranspiration (ET0) coefficient, overlaid on regional maps, is provided each day by the California Department of Water Resources to local farmers and turf managers to plan daily water use. Scaling out single-processor compute/data intensive applications operating on realtime data to support more users and higher-resolution data poses data engineering challenges. Cloud computing helps data providers expand resource capacity to meet growing needs besides supporting scientific needs like reprocessing historic data using new models. In this article, we examine migration of a legacy script used for daily ET0 computation by CIMIS to a workflow model that eases deployment to and scaling on the Windows Azure Cloud. Our architecture incorporates a direct streaming model into Cloud virtual machines (VMs) that improves the performance by 130% to 160% for our workflow over using Cloud storage for data staging, used commonly. The streaming workflows achieve runtimes comparable to desktop execution for single VMs and a linear speed-up when using multiple VMs, thus allowing computation of environmental coefficients at a much larger resolution than done presently.
    BibTeX:
    @inproceedings{Zinn:works:2010,
      author = {Daniel Zinn and Quinn Hart and Bertram Ludascher and Yogesh Simmhan},
      title = {Streaming satellite data to cloud workflows for on-demand computing of environmental data products},
      booktitle = {Workshop on Workflows in Support of Large-Scale Science (WORKS)},
      publisher = {IEEE},
      year = {2010},
      pages = {1--8},
      url = {http://ceng.usc.edu/ simmhan/pubs/zinn-works-2010.pdf},
      doi = {https://doi.org/10.1109/WORKS.2010.5671841}
    }
    					
    Simmhan:escience:2009 Simmhan, Y.; van Ingen, C.; Szalay, A.; Barga, R. & Heasley, J.
    Building Reliable Data Pipelines for Managing Community Data Using Scientific Workflows
    2009 IEEE International Conference on eScience (eScience) , pp. 321-328   inproceedings msr, workflows, data management, cloud, panstarrs, escience, peer reviewed
    Abstract: The growing amount of scientific data from sensors and field observations is posing a challenge to ᅢツᅡ﾿data valetsᅢツᅡ﾿ responsible for managing them in data repositories. These repositories built on commodity clusters need to reliably ingest data continuously and ensure its availability to a wide user community. Workflows provide several benefits to modeling data-intensive science applications and many of these benefits can help manage the data ingest pipelines too. But using workflows is not panacea in itself and data valets need to consider several issues when designing workflows that behave reliably on fault prone hardware while retaining the consistency of the scientific data. In this paper, we propose workflow designs for reliable data ingest in a distributed environment and identify workflow framework features to support resilience. We illustrate these using the data pipeline for the Pan-STARRS repository, one of the largest digital surveys that accumulates 100TB of data annually to support 300 astronomers.
    BibTeX:
    @inproceedings{Simmhan:escience:2009,
      author = {Yogesh Simmhan and Catharine van Ingen and Alex Szalay and Roger Barga and Jim Heasley},
      title = {Building Reliable Data Pipelines for Managing Community Data Using Scientific Workflows},
      booktitle = {IEEE International Conference on eScience (eScience)},
      publisher = {IEEE},
      year = {2009},
      pages = {321--328},
      note = {[CORE A]},
      doi = {https://doi.org/10.1109/e-Science.2009.52}
    }
    					
    Simmhan:advcomp:2009 Simmhan, Y.; Barga, R.; van Ingen, C.; Lazowska, E. & Szalay, A.
    Building the Trident Scientific Workflow Workbench for Data Management in the Cloud
    2009 Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP) , pp. 41-50   inproceedings msr, workflows, escience, data management, cloud, hpc, trident, panstarrs, peer reviewed
    Abstract: Scientific workflows have gained popularity for modeling and executing in silico experiments by scientists for problem-solving. These workflows primarily engage in computation and data transformation tasks to perform scientific analysis in the Science Cloud. Increasingly workflows are gaining use in managing the scientific data when they arrive from external sensors and are prepared for becoming science ready and available for use in the Cloud. While not directly part of the scientific analysis, these workflows operating behind the Cloud on behalf of the -data valetsᅢツᅡ﾿ play an important role in end-to-end management of scientific data products. They share several features with traditional scientific workflows: both are data intensive and use Cloud resources. However, they also differ in significant respects, for example, in the reliability required, scheduling constraints and the use of provenance collected. In this article, we investigate these two classes of workflows - Science Application workflows and Data Preparation workflows - and use these to drive common and distinct requirements from workflow systems for eScience in the Cloud. We use workflow examples from two collaborations, the NEPTUNE oceanography project and the Pan-STARRS astronomy project, to draw out our comparison. Our analysis of these workflows classes can guide the evolution of workflow systems to support emerging applications in the Cloud and the Trident Scientific Workbench is one such workflow system that has directly benefitted from this to meet the needs of these two eScience projects.
    BibTeX:
    @inproceedings{Simmhan:advcomp:2009,
      author = {Yogesh Simmhan and Roger Barga and Catharine van Ingen and Ed Lazowska and Alex Szalay},
      title = {Building the Trident Scientific Workflow Workbench for Data Management in the Cloud},
      booktitle = {Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP)},
      publisher = {IEEE},
      year = {2009},
      pages = {41--50},
      doi = {https://doi.org/10.1109/ADVCOMP.2009.14}
    }
    					
    Simmhan:hicss:2009 Simmhan, Y.; Barga, R.; van Ingen, C.; Nieto-Santisteban, M.; Dobos, L.; Li, N.; Shipway, M.; Szalay, A.S.; Werner, S. & Heasley, J.
    GrayWulf: Scalable Software Architecture for Data Intensive Computing
    2009 Hawaii International Conference on System Sciences (HICSS) , pp. 1-10   inproceedings msr, workflows, escience, data management, cloud, hpc, trident, graywulf, panstarrs, peer reviewed
    Abstract: Big data presents new challenges to both cluster infrastructure software and parallel application design. We present a set of software services and design principles for data intensive computing with petabyte data sets, named GrayWulf. These services are intended for deployment on a cluster of commodity servers similar to the well-known Beowulf clusters. We use the Pan-STARRS system currently under development as an example of the architecture and principles in action.
    BibTeX:
    @inproceedings{Simmhan:hicss:2009,
      author = {Yogesh Simmhan and Roger Barga and Catharine van Ingen and Maria Nieto-Santisteban and Lazslo Dobos and Nolan Li and Michael Shipway and Alexander S. Szalay and Sue Werner and Jim Heasley},
      title = {GrayWulf: Scalable Software Architecture for Data Intensive Computing},
      booktitle = {Hawaii International Conference on System Sciences (HICSS)},
      publisher = {IEEE},
      year = {2009},
      pages = {1--10},
      note = {[CORE A]},
      doi = {https://doi.org/10.1109/HICSS.2009.235}
    }
    					
    Cao:swf:2009 Cao, B.; Plale, B.; Subramanian, G.; Robertson, E. & Simmhan, Y.
    Provenance Information Model of Karma Version 3
    2009 International Workshop on Scientific Workflows (SWF) , pp. 348-351   inproceedings msr, karma, provenance, workflow, peer reviewed
    Abstract: Provenance that captures e-Science activity has long term value only if the right amount and kind of information is collected. In this paper, we propose a two-layer model for representing provenance information capable of representing both execution information and higher level process details. The information model forms the basis for efficient relational database storage and query, and sets the stage for investigation of the necessary and sufficient information for long-term preservation.
    BibTeX:
    @inproceedings{Cao:swf:2009,
      author = {Bin Cao and Beth Plale and Girish Subramanian and Ed Robertson and Yogesh Simmhan},
      title = {Provenance Information Model of Karma Version 3},
      booktitle = {International Workshop on Scientific Workflows (SWF)},
      publisher = {IEEE},
      year = {2009},
      pages = {348--351},
      doi = {https://doi.org/10.1109/SERVICES-I.2009.54}
    }
    					
    Cao:swpm:2009 Cao, B.; Plale, B.; Subramanian, G.; Missier, P.; Goble, C. & Simmhan, Y. Freire, J.; Missier, P. & Sahoo, S.S. (Hrsg.)
    Semantically Annotated Provenance in the Life Science Grid
    2009
    Vol. 526 International Workshop on the role of Semantic Web in Provenance Management (SWPM) , pp. 1-6  
    inproceedings msr, provenance, karma, lsg, semantic web, life sciences, escience, peer reviewed
    Abstract: Selected semantic annotation on raw provenance data can help bridge the gap between low level provenance events (e.g., service invocations, data creation, message passing) and the high-level view that the user has of his/her investigation (e.g., data retrieval and analysis). In this initial investigation we added semantically annotated provenance to the Life Science Grid, a cyber-infrastructure framework supporting interactive data exploration and automated data analysis tools, through (i) automated data provenance collection and (ii) automated semantic enrichment of the collected provenance metadata. We use a paradigmatic life sciences use case of interactive data exploration to show that semantically annotated provenance can help users recognize the occurrence of specific patterns of investigation from an otherwise low-level sequence of elementary interaction events.
    BibTeX:
    @inproceedings{Cao:swpm:2009,
      author = {Bin Cao and Beth Plale and Girish Subramanian and Paolo Missier and Carole Goble and Yogesh Simmhan},
      title = {Semantically Annotated Provenance in the Life Science Grid},
      booktitle = {International Workshop on the role of Semantic Web in Provenance Management (SWPM)},
      publisher = {CEUR-WS.org},
      year = {2009},
      volume = {526},
      pages = {1--6},
      url = {http://ceur-ws.org/Vol-526/paper_5.pdf}
    }
    					
    Simmhan:ijwsr:2008 Simmhan, Y.L.; Plale, B. & Gannon, D.
    Karma2: Provenance Management for Data-Driven Workflows
    2008 International Journal of Web Services Research (IJWSR)
    Vol. 5 (2) , pp. 1-22  
    article msr, provenance, karma, workflow, escience, peer reviewed
    Abstract: The increasing ability for the sciences to sense the world around us is resulting in a growing need for datadriven e-Science applications that are under the control of workflows composed of services on the Grid. The focus of our work is on provenance collection for these workflows that are necessary to validate the work-flow and to determine quality of generated data products. The challenge we address is to record uniform and usable provenance metadata that meets the domain needs while minimizing the modification burden on the service authors and the performance overhead on the workflow engine and the services. The framework is based on generating discrete provenance activities during the lifecycle of a workflow execution that can be aggregated to form complex data and process provenance graphs that can span across workflows. The implementation uses a loosely coupled publish-subscribe architecture for propagating these activities, and the capabilities of the system satisfy the needs of detailed provenance collection. A performance evaluation of a prototype finds a minimal performance overhead (in the range of 1% for an eight-service workflow using 271 data products).
    BibTeX:
    @article{Simmhan:ijwsr:2008,
      author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon},
      title = {Karma2: Provenance Management for Data-Driven Workflows},
      journal = {International Journal of Web Services Research (IJWSR)},
      publisher = {IGI Publishing},
      year = {2008},
      volume = {5},
      number = {2},
      pages = {1--22},
      note = {[IF 0.371, CORE C]},
      doi = {https://doi.org/10.4018/jwsr.2008040101}
    }
    					
    Simmhan:cpe:2008 Simmhan, Y.L.; Plale, B. & Gannon, D.
    Query capabilities of the Karma provenance framework
    2008 Concurrency and Computation: Practice & Experience, Special Issue on The First Provenance Challenge
    Vol. 20 , pp. 441-451  
    article iu, provenance, data provenance, process provenance, provenance queries, workflows, karma, escience, provenance challenge, peer reviewed
    Abstract: Provenance metadata in e-Science captures the derivation history of data products generated from scientific workflows. Provenance forms a glue linking workflow execution with associated data products, and finds use in determining the quality of derived data, tracking resource usage, and for verifying and validating scientific experiments. In this article, we discuss the scope of provenance collected in the Karma provenance framework used in the LEAD Cyberinfrastructure project, distinguishing provenance metadata from generic annotations. We further describe our approaches to querying for different forms of provenance in Karma in the context of queries in the first provenance challenge. We use an incremental, building-block method to construct provenance queries based on the fundamental querying capabilities provided by the Karma service centered on the provenance data model. This has the advantage of keeping the Karma service generic and simple, and yet supports a wide range of queries. Karma successfully answers all but one challenge query. Copyright © 2007 John Wiley & Sons, Ltd.
    BibTeX:
    @article{Simmhan:cpe:2008,
      author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon},
      title = {Query capabilities of the Karma provenance framework},
      journal = {Concurrency and Computation: Practice & Experience, Special Issue on The First Provenance Challenge},
      publisher = {John Wiley and Sons Ltd.},
      year = {2008},
      volume = {20},
      pages = {441--451},
      note = {[IF 0.636, CORE A]},
      doi = {https://doi.org/10.1002/cpe.v20:5}
    }
    					
    Moreau:cpe:2008 Moreau, L.; Ludäscher, B.; Altintas, I.; Barga, R.S.; Bowers, S.; Callahan, S.; George Chin, J.; Clifford, B.; Cohen, S.; Cohen-Boulakia, S.; Davidson, S.; Deelman, E.; Digiampietri, L.; Foster, I.; Freire, J.; Frew, J.; Futrelle, J.; Gibson, T.; Gil, Y.; Goble, C.; Golbeck, J.; Groth, P.; Holland, D.A.; Jiang, S.; Kim, J.; Koop, D.; Krenek, A.; McPhillips, T.; Mehta, G.; Miles, S.; Metzger, D.; Munroe, S.; Myers, J.; Plale, B.; Podhorszki, N.; Ratnakar, V.; Santos, E.; Scheidegger, C.; Schuchardt, K.; Seltzer, M.; Simmhan, Y.L.; Silva, C.; Slaughter, P.; Stephan, E.; Stevens, R.; Turi, D.; Vo, H.; Wilde, M.; Zhao, J. & Zhao, Y.
    Special Issue: The First Provenance Challenge
    2008 Concurrency and Computation: Practice & Experience, Special Issue on The First Provenance Challenge
    Vol. 20 , pp. 409-418  
    article iu, provenance, provenance challenge
    Abstract: The first Provenance Challenge was set up in order to provide a forum for the community to understand the capabilities of different provenance systems and the expressiveness of their provenance representations. To this end, a functional magnetic resonance imaging workflow was defined, which participants had to either simulate or run in order to produce some provenance representation, from which a set of identified queries had to be implemented and executed. Sixteen teams responded to the challenge, and submitted their inputs. In this paper, we present the challenge workflow and queries, and summarize the participants' contributions. Copyright © 2007 John Wiley & Sons, Ltd.
    BibTeX:
    @article{Moreau:cpe:2008,
      author = {Luc Moreau and Bertram Ludäscher and Ilkay Altintas and Roger S. Barga and Shawn Bowers and Steven Callahan and George Chin, Jr. and Ben Clifford and Shirley Cohen and Sarah Cohen-Boulakia and Susan Davidson and Ewa Deelman and Luciano Digiampietri and Ian Foster and Juliana Freire and James Frew and Joe Futrelle and Tara Gibson and Yolanda Gil and Carole Goble and Jennifer Golbeck and Paul Groth and David A. Holland and Sheng Jiang and Jihie Kim and David Koop and Ales Krenek and Timothy McPhillips and Gaurang Mehta and Simon Miles and Dominic Metzger and Steve Munroe and Jim Myers and Beth Plale and Norbert Podhorszki and Varun Ratnakar and Emanuele Santos and Carlos Scheidegger and Karen Schuchardt and Margo Seltzer and Yogesh L. Simmhan and Claudio Silva and Peter Slaughter and Eric Stephan and Robert Stevens and Daniele Turi and Huy Vo and Mike Wilde and Jun Zhao and Yong Zhao},
      title = {Special Issue: The First Provenance Challenge},
      journal = {Concurrency and Computation: Practice & Experience, Special Issue on The First Provenance Challenge},
      publisher = {John Wiley and Sons Ltd.},
      year = {2008},
      volume = {20},
      pages = {409-418},
      note = {[CORE A]},
      doi = {https://doi.org/10.1002/cpe.v20:5}
    }
    					
    Gannon:hpcbook:2008 Gannon, D.; Plale, B.; Christie, M.; Huang, Y.; Jensen, S.; Liu, N.; Marru, S.; Pallickara, S.; Perera, S.; Shirasuna, S.; Simmhan, Y.; Slominski, A.; Sun, Y. & Vijayakumar, N. Grandinetti, L. (Hrsg.)
    Building Grid Portals for e-Science: A Service Oriented Architecture ( High Performance Computing and Grids in Action )
    2008 High Performance Computing and Grids in Action
    Vol. 16 , pp. 149-166  
    inbook iu,escience, portal, web service, lead, peer reviewed
    Abstract: Grids are built by communities who need a shared cyberinfrastructure to make progress on the critical problems they are currently confronting. An e-science portal is a conventional Web portal that sits on top of a rich collection of web services that allow a community of users access to shared data and application resources without exposing them to the details of Grid computing. In this chapter we describe a service-oriented architecture to support this type of portal.
    BibTeX:
    @inbook{Gannon:hpcbook:2008,
      author = {Dennis Gannon and Beth Plale and Marcus Christie and Yi Huang and Scott Jensen and Ning Liu and Suresh Marru and Sangmi Pallickara and Srinath Perera and Satoshi Shirasuna and Yogesh Simmhan and Aleksander Slominski and Yiming Sun and Nithya Vijayakumar},
      title = {High Performance Computing and Grids in Action},
      publisher = {IOS Press},
      year = {2008},
      volume = {16},
      pages = {149--166},
      url = {http://www.booksonline.iospress.nl/Content/View.aspx?piid=8567}
    }
    					
    Barga:clade:2008 Barga, R.S.; Fay, D.; Guo, D.; Newhouse, S.; Simmhan, Y. & Szalay, A.
    Efficient scheduling of scientific workflows in a high performance computing cluster
    2008 International Workshop on Challenges of Large Applications in Distributed Environments (CLADE) , pp. 63-68   inproceedings msr, data intensive, escience, scheduling, workflow, hpc, peer reviewed
    Abstract: The scientific computing community, especially academia is clearly in need of technology to handle and organize the 1-100+ Terabyte datasets coming from computer simulations and scientific instrumentation. In this paper we briefly describe GrayWulf, an exemplar cluster for data intensive applications using SQL Server and HPC Clusters. One of the key software components of GrayWulf is Trident, a scientific workflow workbench that performs automatic scheduling of workflows across the cluster. We examine the challenges of scheduling workflows on GrayWulf, algorithms to improve performance, and present early results from applying Trident to schedule data loading workflows on GrayWulf for an actual e-Science project
    BibTeX:
    @inproceedings{Barga:clade:2008,
      author = {Roger S. Barga and Dan Fay and Dean Guo and Steven Newhouse and Yogesh Simmhan and Alex Szalay},
      title = {Efficient scheduling of scientific workflows in a high performance computing cluster},
      booktitle = {International Workshop on Challenges of Large Applications in Distributed Environments (CLADE)},
      publisher = {ACM},
      year = {2008},
      pages = {63--68},
      note = {[CORE C]},
      doi = {https://doi.org/10.1145/1383529.1383545}
    }
    					
    Simmhan:escience:2008 Simmhan, Y.; Barga, R.; van Ingen, C.; Lazowska, E. & Szalay, A.
    On Building Scientific Workflow Systems for Data Management in the Cloud
    2008 IEEE International Conference on eScience (eScience) , pp. 434-435   inproceedings msr, workflows, escience, data management, cloud, hpc, trident, panstarrs, poster, peer reviewed
    Abstract: Scientific workflows have become an archetype to model in silico experiments in the Cloud by scientists. There is a class of workflows that are used to by "data valets" to prepare raw data from scientific instruments into a science-ready form for use by scientists. These share data-intensive traits with traditional scientific workflows, yet differ significantly, for example, in the required degree of reliability and the type of provenance collected. We compare and contrast science application and data valet workflows through exemplar eScience projects to drive shared and unique requirements for scientific workflows across diverse users in a Science Cloud.
    BibTeX:
    @inproceedings{Simmhan:escience:2008,
      author = {Yogesh Simmhan and Roger Barga and Catharine van Ingen and Ed Lazowska and Alex Szalay},
      title = {On Building Scientific Workflow Systems for Data Management in the Cloud},
      booktitle = {IEEE International Conference on eScience (eScience)},
      publisher = {IEEE},
      year = {2008},
      pages = {434--435},
      note = {Poster [CORE A]},
      doi = {https://doi.org/10.1109/eScience.2008.150}
    }
    					
    Barga:escience:2008 Barga, R.; Jackson, J.; Araujo, N.; Guo, D.; Gautam, N. & Simmhan, Y.
    The Trident Scientific Workflow Workbench
    2008 IEEE International Conference on eScience (eScience) , pp. 317-318   inproceedings msr, workflows, escience, trident, panstarrs, neptune, demo, peer reviewed
    Abstract: In our demonstration we present Trident, a scientific workflow workbench built on top of a commercial workflow system to leverage existing functionality to the extent possible. Trident is being developed in collaboration with the scientific computing community for use in a number of ongoing eScience projects that make use of scientific workflows, in particular the Pan-STARRS sky survey project and the Ocean Observatory Initiative. In our demonstration of Trident we will illustrate the ability to utilize both local and cloud resources for storage and execution, as well as services such as provenance, monitoring, logging and scheduling workflows over clusters. Our goal is to release Trident in early 2009 as an open source accelerator for others to use for eScience projects and to continue extending with support for new workflow features and services.
    BibTeX:
    @inproceedings{Barga:escience:2008,
      author = {Roger Barga and Jared Jackson and Nelson Araujo and Dean Guo and Nitin Gautam and Yogesh Simmhan},
      title = {The Trident Scientific Workflow Workbench},
      booktitle = {IEEE International Conference on eScience (eScience)},
      publisher = {IEEE},
      year = {2008},
      pages = {317--318},
      note = {Demo [CORE A]},
      doi = {https://doi.org/10.1109/eScience.2008.126}
    }
    					
    Gannon:wfbook:2007 Gannon, D.; Plale, B.; Marru, S.; Kandaswamy, G.; Simmhan, Y. & Shirasuna, S. Gannon, D.; Deelman, E.; Shields, M. & Taylor, I. (Hrsg.)
    Dynamic, Adaptive Workflows for Mesoscale Meteorology ( Workflows for eScience: Scientific Workflows for Grids )
    2007 Workflows for eScience: Scientific Workflows for Grids , pp. 126-142   inbook iu, workflows, grid, escience, peer reviewed
    Abstract: The Linked Environments for Atmospheric Discovery (LEAD) [122] is a National Science Foundation funded1 project to change the paradigm for mesoscale weather prediction from one of static, fixed-schedule computational forecasts to one that is adaptive and driven by weather events. It is a collaboration of eight institutions,2 led by Kelvin Droegemeier of the University of Oklahoma, with the goal of enabling far more accurate and timely predictions of tornadoes and hurricanes than previously considered possible. The traditional approach to weather prediction is a four-phase activity. In the first phase, data from sensors are collected. The sensors include ground instruments such as humidity and temperature detectors, and lightning strike detectors and atmospheric measurements taken from balloons, commercial aircraft, radars, and satellites. The second phase is data assimilation, in which the gathered data are merged together into a set of consistent initial and boundary conditions for a large simulation. The third phase is the weather prediction, which applies numerical equations to measured conditions in order to project future weather conditions. The final phase is the generation of visual images of the processed data products that are analyzed to make predictions. Each phase of activity is performed by one or more application components.
    BibTeX:
    @inbook{Gannon:wfbook:2007,
      author = {Dennis Gannon and Beth Plale and Suresh Marru and Gopi Kandaswamy and Yogesh Simmhan and Satoshi Shirasuna},
      title = {Workflows for eScience: Scientific Workflows for Grids},
      publisher = {Springer London},
      year = {2007},
      pages = {126--142},
      doi = {https://doi.org/10.1007/978-1-84628-757-2_9}
    }
    					
    Simmhan:gbpse:2006 Simmhan, Y.; Pallickara, S.; Vijayakumar, N. & Plale, B. Gaffney, P. & Pool, J. (Hrsg.)
    Data Management in Dynamic Environment-driven Computational Science
    2007
    Vol. 239 Grid-Based Problem Solving Environments , pp. 317-333  
    inproceedings iu, data management, lead, provenance, portal, mylead, karma, calder, escience, peer reviewed
    Abstract: Advances in numerical modeling, computational hardware and problem solving environments have driven the growth of computational science over the past decades. Science gateways, based on service oriented architectures and scientific workflows, provide yet another step in democratizing access to advanced numerical and scientific tools, computational resource and massive data storage, and fostering collaborations. Dynamic, data-driven applications, such as those found in weather forecasting, present interesting challenges to Science Gateways, which are being addressed as part of the LEAD Cyberinfrastructure project. In this article, we discuss three important data related problems faced by such adaptive data-driven environments: managing a user’s personal workspace and metadata on the Grid, tracking the provenance of scientific workflows and data products, and continuous data mining over observational weather data.
    BibTeX:
    @inproceedings{Simmhan:gbpse:2006,
      author = {Yogesh Simmhan and Sangmi Pallickara and Nithya Vijayakumar and Beth Plale},
      title = {Data Management in Dynamic Environment-driven Computational Science},
      booktitle = {Grid-Based Problem Solving Environments},
      publisher = {Springer Boston},
      year = {2007},
      volume = {239},
      pages = {317--333},
      doi = {https://doi.org/10.1007/978-0-387-73659-4_17}
    }
    					
    Ramakrishnan:iccs:2007 Ramakrishnan, L.; Simmhan, Y. & Plale, B. Shi, Y.; van Albada, G.; Dongarra, J. & Sloot, P. (Hrsg.)
    Realization of Dynamically Adaptive Weather Analysis and Forecasting in LEAD: Four Years Down the Road
    2007
    Vol. 4487 International Conference on Computational Science (ICCS) , pp. 1122-1129  
    inproceedings iu, lead, escience, workflow, peer reviewed
    Abstract: Linked Environments for Atmospheric Discovery (LEAD) is a large-scale cyberinfrastructure effort in support of mesoscale meteorology. One of the primary goals of the infrastructure is support for real-time dynamic, adaptive response to severe weather. In this paper we revisit the conception of dynamic adaptivity as appeared in our 2005 DDDAS workshop paper, and discuss changes since the original conceptualization, and lessons learned in working with a complex service oriented architecture in support of data driven science.
    BibTeX:
    @inproceedings{Ramakrishnan:iccs:2007,
      author = {Ramakrishnan, Lavanya and Simmhan, Yogesh and Plale, Beth},
      title = {Realization of Dynamically Adaptive Weather Analysis and Forecasting in LEAD: Four Years Down the Road},
      booktitle = {International Conference on Computational Science (ICCS)},
      publisher = {Springer Berlin / Heidelberg},
      year = {2007},
      volume = {4487},
      pages = {1122--1129},
      note = {[CORE A]},
      doi = {https://doi.org/10.1007/978-3-540-72584-8_147}
    }
    					
    Simmhan:icws:2006 Simmhan, Y.L.; Plale, B. & Gannon, D.
    A Framework for Collecting Provenance in Data-Centric Scientific Workflows
    2006 International Conference on Web Services (ICWS) , pp. 427-436   inproceedings iu, provenance, escience, karma, workflows, peer reviewed
    Abstract: The increasing ability for the earth sciences to sense the world around us is resulting in a growing need for data-driven applications that are under the control of data-centric workflows composed of grid- and web- services. The focus of our work is on provenance collection for these workflows, necessary to validate the workflow and to determine quality of generated data products. The challenge we address is to record uniform and usable provenance metadata that meets the domain needs while minimizing the modification burden on the service authors and the performance overhead on the workflow engine and the services. The framework, based on a loosely-coupled publish-subscribe architecture for propagating provenance activities, satisfies the needs of detailed provenance collection while a performance evaluation of a prototype finds a minimal performance overhead (in the range of 1% for an eight service workflow using 271 data products).
    BibTeX:
    @inproceedings{Simmhan:icws:2006,
      author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon},
      title = {A Framework for Collecting Provenance in Data-Centric Scientific Workflows},
      booktitle = {International Conference on Web Services (ICWS)},
      publisher = {IEEE},
      year = {2006},
      pages = {427--436},
      note = {[CORE A]},
      doi = {https://doi.org/10.1109/ICWS.2006.5}
    }
    					
    Simmhan:ipaw:2006 Simmhan, Y.L.; Plale, B. & Gannon, D. Moreau, L. & Foster, I. (Hrsg.)
    Performance Evaluation of the Karma Provenance Framework for Scientific Workflows
    2006
    Vol. 4145 International Provenance and Annotation Workshop (IPAW) , pp. 222-236  
    inproceedings iu, provenance, escience, karma, workflows, peer reviewed
    Abstract: Provenance about workflow executions and data derivations in scientific applications help estimate data quality, track resources, and validate in silico experiments. The Karma provenance framework provides a means to collect workflow, process, and data provenance from data-driven scientific workflows and is used in the Linked Environments for Atmospheric Discovery (LEAD) project. This paper presents a performance analysis of the Karma service as compared against the contemporary PReServ provenance service. Our study finds that Karma scales exceedingly well for collecting and querying provenance records, showing linear or sub-linear scaling with increasing number of provenance records and clients when tested against workloads in the order of 10,000 application-service invocations and over 36 concurrent clients.
    BibTeX:
    @inproceedings{Simmhan:ipaw:2006,
      author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon},
      title = {Performance Evaluation of the Karma Provenance Framework for Scientific Workflows},
      booktitle = {International Provenance and Annotation Workshop (IPAW)},
      publisher = {Springer Berlin / Heidelberg},
      year = {2006},
      volume = {4145},
      pages = {222--236},
      doi = {https://doi.org/10.1007/11890850_23}
    }
    					
    Simmhan:sciflow:2006 Simmhan, Y.L.; Plale, B. & Gannon, D.
    Towards a Quality Model for Effective Data Selection in Collaboratories
    2006 Workshop on Workflow and Data Flow for Scientific Applications (SciFlow) , pp. 1-4   inproceedings iu, provenance, escience, karma, workflows, short paper, peer reviewed
    Abstract: Data-driven scientific applications utilize workflow frameworks to execute complex dataflows, resulting in derived data products of unknown quality. We discuss our on-going research on a quality model that provides users with an integrated estimate of the data quality that is tuned to their application needs, and is available as a numerical quality score that enables uniform comparison of datasets, and increases community’s trust in derived data.
    BibTeX:
    @inproceedings{Simmhan:sciflow:2006,
      author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon},
      title = {Towards a Quality Model for Effective Data Selection in Collaboratories},
      booktitle = {Workshop on Workflow and Data Flow for Scientific Applications (SciFlow)},
      publisher = {IEEE},
      year = {2006},
      pages = {1--4},
      doi = {https://doi.org/10.1109/ICDEW.2006.150}
    }
    					
    Simmhan:record:2005 Simmhan, Y.; Plale, B. & Gannon, D.
    A Survey of Data Provenance in e-Science
    2005 SIGMOD Record
    Vol. 34 (3) , pp. 31-36  
    article iu, provenance, escience, peer reviewed
    Abstract: Data management is growing in complexity as large-scale applications take advantage of the loosely coupled resources brought together by grid middleware and by abundant storage capacity. Metadata describing the data products used in and generated by these applications is essential to disambiguate the data and enable reuse. Data provenance, one kind of metadata, pertains to the derivation history of a data product starting from its original sources. In this paper we create a taxonomy of data provenance characteristics and apply it to current research efforts in e-science, focusing primarily on scientific workflow approaches. The main aspect of our taxonomy categorizes provenance systems based on why they record provenance, what they describe, how they represent and store provenance, and ways to disseminate it. The survey culminates with an identification of open research problems in the field.
    BibTeX:
    @article{Simmhan:record:2005,
      author = {Yogesh Simmhan and Beth Plale and Dennis Gannon},
      title = {A Survey of Data Provenance in e-Science},
      journal = {SIGMOD Record},
      publisher = {ACM},
      year = {2005},
      volume = {34},
      number = {3},
      pages = {31--36},
      note = {[IF 0.667]},
      doi = {https://doi.org/10.1145/1084805.1084812}
    }
    					
    Gannon:ieee:2005 Gannon, D.; Alameda, J.; Chipara, O.; Christie, M.; Dukle, V.; Fang, L.; Farellee, M.; Fox, G.; Hampton, S.; Kandaswamy, G.; Kodeboyina, D.; Moad, C.; Pierce, M.; Plale, B.; Rossi, A.; Simmhan, Y.; Sarangi, A.; Slominski, A.; Shirasauna, S. & Thomas, T.
    Building Grid Portal Applications from a Web-Service Component Architecture
    2005 Proceedings of the IEEE, Special issue on Grid Computing
    Vol. 93 (3) , pp. 551-563  
    article iu,grid, portal,web service, peer reviewed
    Abstract: This paper describes an approach to building Grid applications based on the premise that users who wish to access and run these applications prefer to do so without becoming experts on Grid technology. We describe an application architecture based on wrapping user applications and application workflows as web services and web service resources.These services are visible to the users and to resource providers through a family of Grid portal components that can be used to configure, launch and monitor complex applications in the scientific language of the end user. The applications in this model are instantiated by an application factory service. The layered design of the architecture makes it possible for an expert to configure an application factory service with a custom user interface client that may be dynamical loaded into the portal.
    BibTeX:
    @article{Gannon:ieee:2005,
      author = {Dennis Gannon and Jay Alameda and Octav Chipara and Marcus Christie and Vinayak Dukle and Liang Fang and Matthew Farellee and Geoffrey Fox and Shawn Hampton and Gopi Kandaswamy and Deepti Kodeboyina and Charlie Moad and Marlon Pierce and Beth Plale and Albert Rossi and Yogesh Simmhan and Anuraag Sarangi and Aleksander Slominski and Satoshi Shirasauna and Thomas Thomas},
      title = {Building Grid Portal Applications from a Web-Service Component Architecture},
      journal = {Proceedings of the IEEE, Special issue on Grid Computing},
      publisher = {IEEE},
      year = {2005},
      volume = {93},
      number = {3},
      pages = {551--563},
      note = {[IF 6.81]},
      doi = {https://doi.org/10.1109/JPROC.2004.842756}
    }
    					
    Gannon:icsoc:2005 Gannon, D.; Plale, B.; Christie, M.; Fang, L.; Huang, Y.; Jensen, S.; Kandaswamy, G.; Marru, S.; Pallickara, S.L.; Shirasuna, S.; Simmhan, Y.; Slominski, A. & Sun, Y. Benatallah, B.; Casati, F. & Traverso, P. (Hrsg.)
    Service Oriented Architectures for Science Gateways on Grid Systems
    2005
    Vol. 3826 International Conference on Service-Oriented Computing (ICSOC) , pp. 21-32  
    inproceedings iu, portal, web service, grid, peer reviewed
    Abstract: Grid computing is about allocating distributed collections of resources including computers, storage systems, networks and instruments to form a coherent system devoted to a “virtual organization” of users who share a common interest in solving a complex problem or building an efficient agile enterprise. Service oriented architectures have emerged as the standard way to build Grids. This paper provides a brief look at the Open Grid Service Architecture, a standard being proposed by the Global Grid Forum, which provides the foundational concepts of most Grid systems. Above this Grid foundation is a layer of application-oriented services that are managed by workflow tools and “science gateway” portals that provide users transparent access to the applications that use the resources of a Grid. In this paper we will also describe these Gateway framework services and discuss how they relate to and use Grid services.
    BibTeX:
    @inproceedings{Gannon:icsoc:2005,
      author = {Dennis Gannon and Beth Plale and Marcus Christie and Liang Fang and Yi Huang and Scott Jensen and Gopi Kandaswamy and Suresh Marru and Sangmi Lee Pallickara and Satoshi Shirasuna and Yogesh Simmhan and Aleksander Slominski and Yiming Sun},
      title = {Service Oriented Architectures for Science Gateways on Grid Systems},
      booktitle = {International Conference on Service-Oriented Computing (ICSOC)},
      publisher = {Springer Berlin / Heidelberg},
      year = {2005},
      volume = {3826},
      pages = {21--32},
      note = {[CORE A]},
      doi = {https://doi.org/10.1007/11596141_3}
    }
    					
    Gannon:clade:2004 Gannon, D.; Krishnan, S.; Fang, L.; Kandaswamy, G.; Simmhan, Y. & Slominski, A. IEEE
    On Building Parallel and Grid Applications: Component Technology and Distributed Services
    2004 International Workshop on Challenges of Large Applications in Distributed Environments (CLADE) , pp. 44-51   inproceedings iu, grid, web service, escience, component, peer reviewed
    Abstract: Software Component Frameworks are well known in the commercial business application world and now this technology is being explored with great interest as a way to build large-scale scientific application on parallel computers. In the case of Grid systems, the current architectural model is based on the emerging web services framework. In this paper we describe progress that has been made on the Common Component Architecture model (CCA) and discuss its success and limitations when applied to problems in Grid computing. Our primary conclusion is that a component model fits very well with a services-oriented Grid, but the model of composition must allow for a very dynamic (both in space and it time) control of composition. We note that this adds a new dimension to conventional service workflow and it extends the “Inversion of Control” aspects of must component systems.
    BibTeX:
    @inproceedings{Gannon:clade:2004,
      author = {Dennis Gannon and Sriram Krishnan and Liang Fang and Gopi Kandaswamy and Yogesh Simmhan and Aleksander Slominski},
      title = {On Building Parallel and Grid Applications: Component Technology and Distributed Services},
      booktitle = {International Workshop on Challenges of Large Applications in Distributed Environments (CLADE)},
      year = {2004},
      pages = {44--51},
      note = {[CORE C]},
      doi = {https://doi.org/10.1109/CLADE.2004.1309091}
    }
    					
    Gannon:dbgs:2003 Gannon, D.; Christie, M.; Chipara, O.; Fang, L.; Farrellee, M.; Kandaswamy, G.; Lu, W.; Plale, B.; Slominski, A.; Sarangi, A. & Simmhan, Y.L.
    Building Grid Services for User Portals
    2003 Workshop on Designing and Building Grid Services (DBGS)   inproceedings iu, portal, grid, web service, escience, peer reviewed
    BibTeX:
    @inproceedings{Gannon:dbgs:2003,
      author = {Dennis Gannon and Marcus Christie and Octav Chipara and Liang Fang and Matthew Farrellee and Gopi Kandaswamy and Wei Lu and Beth Plale and Aleksander Slominski and Anuraag Sarangi and Yogesh L. Simmhan},
      title = {Building Grid Services for User Portals},
      booktitle = {Workshop on Designing and Building Grid Services (DBGS)},
      publisher = {GGF},
      year = {2003},
      url = {http://www.mcs.anl.gov/ keahey/DBGS/DBGS_files/dbgs_papers/gannon.pdf}
    }
    					
    Gannon:cluster:2002 Gannon, D.; Bramley, R.; Fox, G.; Smallen, S.; Rossi, A.; Ananthakrishnan, R.; Bertrand, F.; Chiu, K.; Farrellee, M.; Govindaraju, M.; Krishnan, S.; Ramakrishnan, L.; Simmhan, Y.; Slominski, A.; Ma, Y.; Olariu, C. & Rey-Cenvaz, N.
    Programming the Grid: Distributed Software Components, P2P and Grid Web Services for Scientific Applications
    2002 Cluster Computing
    Vol. 5 (3) , pp. 325-336  
    article iu, component, grid, web service, escience, peer reviewed
    Abstract: Computational Grids have become an important asset in large-scale scientific and engineering research. By providing a set of services that allow a widely distributed collection of resources to be tied together into a relatively seamless computing framework, teams of researchers can collaborate to solve problems that they could not have attempted before. Unfortunately the task of building Grid applications remains extremely difficult because there are few tools available to support developers. To build reliable and re-usable Grid applications, programmers must be equipped with a programming framework that hides the details of most Grid services and allows the developer a consistent, non-complex model in which applications can be composed from well tested, reliable sub-units. This paper describes experiences with using a software component framework for building Grid applications. The framework, which is based on the DOE Common Component Architecture (CCA), allows individual components to export function/service interfaces that can be remotely invoked by other components. The framework also provides a simple messaging/event system for asynchronous notification between application components. The paper also describes how the emerging Web-Services model fits with a component-oriented application design philosophy. To illustrate the connection between web services and Grid application programming we describe a simple design pattern for application factory services which can be used to simplify the task of building reliable Grid programs. Finally we address several issues of Grid programming that better understood from the perspective of Peer-to-Peer (P2P) systems. In particular we describe how models for collaboration and resource sharing fit well with many grid application scenarios.
    BibTeX:
    @article{Gannon:cluster:2002,
      author = {Dennis Gannon and Randall Bramley and Geoffrey Fox and Shava Smallen and Al Rossi and Rachana Ananthakrishnan and Felipe Bertrand and Kenneth Chiu and Matt Farrellee and Madhusudhan Govindaraju and Sriram Krishnan and Lavanya Ramakrishnan and Yogesh Simmhan and Aleksander Slominski and Yu Ma and Caroline Olariu and Nicolas Rey-Cenvaz},
      title = {Programming the Grid: Distributed Software Components, P2P and Grid Web Services for Scientific Applications},
      journal = {Cluster Computing},
      publisher = {Springer Netherlands},
      year = {2002},
      volume = {5},
      number = {3},
      pages = {325--336},
      note = {[IF 0.519]},
      doi = {https://doi.org/10.1023/A:1015633507128}
    }
    					
    Krishnan:sciprog:2002 Krishnan, S.; Bramley, R.; Gannon, D.; Ananthakrishnan, R.; Govindaraju, M.; Slominski, A.; Simmhan, Y.; Alameda, J.; Alkire, R.; Drews, T. & Webb, E.
    The XCAT Science Portal
    2002 Scientific Programming
    Vol. 10 (4) , pp. 303--317  
    article iu, component, portal, escience, peer reviewed
    Abstract: This paper describes the design and prototype implementation of the XCAT Grid Science Portal. The portal lets grid application programmers script complex distributed computations and package these applications with simple interfaces for others to use. Each application is packaged as a notebook which consists of webpages and editable parameterized scripts. The portal is a workstation-based specialized personal web server, capable of executing the application scripts and launching remote grid applications for the user. The portal server can receive event streams published by the application and grid resource information published by Network Weather Service(NWS) or Autopilot sensors. Notebooks can be published and stored in web based archives for others to retrieve and modify. The XCAT Grid Science Portal has been tested with various applications, including the distributed simulation of chemical processes in semiconductor manufacturing and collaboratory support for X-ray crystallographers.
    BibTeX:
    @article{Krishnan:sciprog:2002,
      author = {Sriram Krishnan and Randall Bramley and Dennis Gannon and Rachana Ananthakrishnan and Madhusudhan Govindaraju and Aleksander Slominski and Yogesh Simmhan and Jay Alameda and Richard Alkire and Timothy Drews and Eric Webb},
      title = {The XCAT Science Portal},
      journal = {Scientific Programming},
      publisher = {IOS Press},
      year = {2002},
      volume = {10},
      number = {4},
      pages = {303---317},
      note = {[IF 0.967]},
      url = {https://content.iospress.com/articles/scientific-programming/spr00107}
    }
    					

    Created by JabRef on 09/10/2020.