Key | Author / Editor / Organization | Title | Year | Journal / Conference / Book | Pub Type | Keywords |
---|---|---|---|---|---|---|
varshney:spe:2020 | Varshney, P. & Simmhan, Y. |
Characterizing Application Scheduling on Edge, Fog and Cloud Computing Resources
|
2020 |
Software: Practice and Experience Vol. 50 (5) , pp. 558-595 |
article | iisc, cloud, edge, fog, survey |
BibTeX:
@article{varshney:spe:2020, author = {Prateeksha Varshney and Yogesh Simmhan}, title = {Characterizing Application Scheduling on Edge, Fog and Cloud Computing Resources}, journal = {Software: Practice and Experience}, year = {2020}, volume = {50}, number = {5}, pages = {558--595}, doi = {https://doi.org/10.1002/spe.2699} } |
||||||
simmhan:jiisc:2020 | Simmhan, Y.; Rambha, T.; Khochare, A.; Ramesh, S.; Baranawal, A.; George, J.V.; Bhope, R.A.; Namtirtha, A.; Sundararajan, A.; Bhargav, S.S.; Thakkar, N. & Kiran, R. |
GoCoronaGo: Privacy Respecting Contact Tracing for COVID-19 Management
|
2020 | Journal of the Indian Institute of Science | article | |
BibTeX:
@article{simmhan:jiisc:2020, author = {Yogesh Simmhan and Tarun Rambha and Aakash Khochare and Shriram Ramesh and Animesh Baranawal and John Varghese George and Rahul Atul Bhope and Amrita Namtirtha and Amritha Sundararajan and Sharath Suresh Bhargav and Nihar Thakkar and Raj Kiran}, title = {GoCoronaGo: Privacy Respecting Contact Tracing for COVID-19 Management}, journal = {Journal of the Indian Institute of Science}, year = {2020}, note = {To Appear}, url = {https://arxiv.org/abs/2009.04916} } |
||||||
ramesh:ccgrid:2020 | Ramesh, S.; Baranawal, A. & Simmhan, Y. |
A Distributed Path Query Engine for Temporal Property Graphs
|
2020 | IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID) , pp. 499-508 | inproceedings | |
BibTeX:
@inproceedings{ramesh:ccgrid:2020, author = {Shriram Ramesh and Animesh Baranawal and Yogesh Simmhan}, title = {A Distributed Path Query Engine for Temporal Property Graphs}, booktitle = {IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID)}, year = {2020}, pages = {499--508}, doi = {https://doi.org/10.1109/CCGrid49817.2020.00-43} } |
||||||
gandhi:icde:2020 | Gandhi, S. & Simmhan, Y. |
An Interval-centric Model for Distributed Computing over Temporal Graphs
|
2020 | IEEE International Conference on Data Engineering (ICDE) , pp. 1129-1140 | inproceedings | |
BibTeX:
@inproceedings{gandhi:icde:2020, author = {Swapnil Gandhi and Yogesh Simmhan}, title = {An Interval-centric Model for Distributed Computing over Temporal Graphs}, booktitle = {IEEE International Conference on Data Engineering (ICDE)}, year = {2020}, pages = {1129--1140}, doi = {https://doi.org/10.1109/ICDE48307.2020.00102} } |
||||||
acharya:comsnet:2020 | Acharya, S.; Bharadwaj, A.; Simmhan, Y.; Gopalan, A.; Parag, P. & Tyagi, H. |
CORNET: A Co-Simulation Middleware for Robot Networks
|
2020 | IEEE International Conference on COMmunication Systems & NETworkS (COMSNETS) , pp. 245-251 | inproceedings | |
BibTeX:
@inproceedings{acharya:comsnet:2020, author = {Srikrishna Acharya and Amrutur Bharadwaj and Yogesh Simmhan and Aditya Gopalan and Parimal Parag and Himanshu Tyagi}, title = {CORNET: A Co-Simulation Middleware for Robot Networks}, booktitle = {IEEE International Conference on COMmunication Systems & NETworkS (COMSNETS)}, year = {2020}, pages = {245--251}, doi = {https://doi.org/10.1109/COMSNETS48256.2020.9027459} } |
||||||
garg:europar:2020 | Garg, D.; Shirolkar, P.; Shukla, A. & Simmhan, Y. |
TorqueDB: Distributed Querying of Time-Series Data from Edge-local Storage
|
2020 |
Vol. 12247 International Conference on Parallel and Distributed Computing (Euro-Par) , pp. 281-295 |
inproceedings | |
BibTeX:
@inproceedings{garg:europar:2020, author = {Dhruv Garg and Prathik Shirolkar and Anshu Shukla and Yogesh Simmhan}, title = {TorqueDB: Distributed Querying of Time-Series Data from Edge-local Storage}, booktitle = {International Conference on Parallel and Distributed Computing (Euro-Par)}, publisher = {Springer}, year = {2020}, volume = {12247}, pages = {281--295}, doi = {https://doi.org/10.1007/978-3-030-57675-2%5C_18} } |
||||||
simmhan:icfec:2020 | Simmhan, Y. & Varghese, B. (Hrsg.) |
Proceedings of the IEEE International Conference on Fog and Edge Computing (ICFEC)
|
2020 | proceedings | ||
BibTeX:
@proceedings{simmhan:icfec:2020,, title = {Proceedings of the IEEE International Conference on Fog and Edge Computing (ICFEC)}, year = {2020}, doi = {https://doi.org/10.1109/ICFEC50348.2020} } |
||||||
mueller:isorc:2020 | Mueller, F.; Cucinotta, T. & Simmhan, Y. (Hrsg.) |
Proceedings of the IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC)
|
2020 | proceedings | ||
BibTeX:
@proceedings{mueller:isorc:2020,, title = {Proceedings of the IEEE International Symposium on Object-Oriented Real-Time Distributed Computing (ISORC)}, year = {2020}, doi = {https://doi.org/10.1109/ISORC49007.2020} } |
||||||
buyya:csur:2019 | Buyya, R.; Srirama, S.N.; Casale, G.; Calheiros, R.N.; Simmhan, Y.; Varghese, B.; Gelenbe, E.; Javadi, B.; Vaquero, L.M.; Netto, M.A.S.; Toosi, A.N.; Rodriguez, M.A.; Llorente, I.M.; di Vimercati, S.D.C.; Samarati, P.; Milojicic, D.S.; Varela, C.A.; Bahsoon, R.; de Assunção, M.D.; Rana, O.; Zhou, W.; Jin, H.; Gentzsch, W.; Zomaya, A.Y. & Shen, H. |
A Manifesto for Future Generation Cloud Computing: Research Directions for the Next Decade
|
2019 |
ACM Computing Surveys (CSUR) Vol. 51 (5) , pp. 105:1-105:38 |
article | iisc, cloud |
BibTeX:
@article{buyya:csur:2019, author = {Rajkumar Buyya and Satish Narayana Srirama and Giuliano Casale and Rodrigo N. Calheiros and Yogesh Simmhan and Blesson Varghese and Erol Gelenbe and Bahman Javadi and Luis Miguel Vaquero and Marco A. S. Netto and Adel Nadjaran Toosi and Maria Alejandra Rodriguez and Ignacio Mart\in Llorente and Sabrina De Capitani di Vimercati and Pierangela Samarati and Dejan S. Milojicic and Carlos A. Varela and Rami Bahsoon and Marcos Dias de Assunção and Omer Rana and Wanlei Zhou and Hai Jin and Wolfgang Gentzsch and Albert Y. Zomaya and Haiying Shen}, title = {A Manifesto for Future Generation Cloud Computing: Research Directions for the Next Decade}, journal = {ACM Computing Surveys (CSUR)}, year = {2019}, volume = {51}, number = {5}, pages = {105:1--105:38}, url = {https://arxiv.org/abs/1711.09123}, doi = {https://doi.org/10.1145/3241737} } |
||||||
varshney:tpds:2019 | Varshney, P. & Simmhan, Y. |
AutoBoT: Resilient and Cost-effective Scheduling of a Bag of Tasks on Spot VMs
|
2019 |
IEEE Transactions on Parallel and Distributed Systems (TPDS) Vol. 30 (7) , pp. 1512-1527 |
article | iisc, cloud, scheduling, spot vm |
BibTeX:
@article{varshney:tpds:2019, author = {Prateeksha Varshney and Yogesh Simmhan}, title = {AutoBoT: Resilient and Cost-effective Scheduling of a Bag of Tasks on Spot VMs}, journal = {IEEE Transactions on Parallel and Distributed Systems (TPDS)}, year = {2019}, volume = {30}, number = {7}, pages = {1512--1527}, doi = {https://doi.org/10.1109/TPDS.2018.2889851} } |
||||||
simhan:encycl:2019 | Simmhan, Y. Sakr, S. & Zomaya, A.Y. (Hrsg.) |
Big Data and Fog Computing (
Encyclopedia of Big Data Technologies
)
|
2019 | Encyclopedia of Big Data Technologies | inbook | iisc, big data, fog computing, iot, peer reviewed |
BibTeX:
@inbook{simhan:encycl:2019, author = {Yogesh Simmhan}, title = {Encyclopedia of Big Data Technologies}, publisher = {Springer}, year = {2019}, url = {http://arxiv.org/abs/1712.09552}, doi = {https://doi.org/10.1007/978-3-319-63962-8_41-1} } |
||||||
jaiswal:ipdpsw:2019 | Jaiswal, S.D. & Simmhan, Y. |
A Partition-centric Distributed Algorithm for Identifying Euler Circuits in Large Graphs
|
2019 | IEEE International Workshop on High-Performance Big Data, Deep Learning, and Cloud Computing (HPBDC), Co-located with IEEE International Parallel and Distributed Processing Symposium (IPDPS) , pp. 452-459 | inproceedings | iisc, graph, subgraph centric, algorithm |
BibTeX:
@inproceedings{jaiswal:ipdpsw:2019, author = {Siddharth D. Jaiswal and Yogesh Simmhan}, title = {A Partition-centric Distributed Algorithm for Identifying Euler Circuits in Large Graphs}, booktitle = {IEEE International Workshop on High-Performance Big Data, Deep Learning, and Cloud Computing (HPBDC), Co-located with IEEE International Parallel and Distributed Processing Symposium (IPDPS)}, year = {2019}, pages = {452--459}, url = {https://arxiv.org/abs/1903.06950}, doi = {https://doi.org/10.1109/IPDPSW.2019.00085} } |
||||||
khochare:icdcn:2019 | Khochare, A. & Simmhan, Y. |
A scalable and composable analytics platform for distributed wide-area tracking
|
2019 | ACM International Conference on Distributed Computing and Networking (ICDCN) , pp. 506 | inproceedings | |
BibTeX:
@inproceedings{khochare:icdcn:2019, author = {Aakash Khochare and Yogesh Simmhan}, title = {A scalable and composable analytics platform for distributed wide-area tracking}, booktitle = {ACM International Conference on Distributed Computing and Networking (ICDCN)}, year = {2019}, pages = {506}, note = {Extended Abstract}, doi = {https://doi.org/10.1145/3288599.3299753} } |
||||||
dindokar:cloud:2019 | Dindokar, R. & Simmhan, Y. |
Adaptive Partition Migration for Irregular Graph Algorithms on Elastic Resources
|
2019 | IEEE International Conference on Cloud Computing (CLOUD) , pp. 281-290 | inproceedings | iisc, graph, cloud, goffish |
BibTeX:
@inproceedings{dindokar:cloud:2019, author = {Ravikant Dindokar and Yogesh Simmhan}, title = {Adaptive Partition Migration for Irregular Graph Algorithms on Elastic Resources}, booktitle = {IEEE International Conference on Cloud Computing (CLOUD)}, year = {2019}, pages = {281--290}, note = {[CORE B]}, doi = {https://doi.org/10.1109/CLOUD.2019.00-28} } |
||||||
chaudhary:hipcw:2019 | Chaudhary, D.; Kahali, B. & Simmhan, Y. |
An Empirical Study on Efficient Storage of Human Genome Data
|
2019 | Women in Data Science and Computing Workshop, Co-located with IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC) , pp. 87-92 | inproceedings | |
BibTeX:
@inproceedings{chaudhary:hipcw:2019, author = {Diksha Chaudhary and Bratati Kahali and Yogesh Simmhan}, title = {An Empirical Study on Efficient Storage of Human Genome Data}, booktitle = {Women in Data Science and Computing Workshop, Co-located with IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC)}, year = {2019}, pages = {87--92}, doi = {https://doi.org/10.1109/HiPCW.2019.00030} } |
||||||
khochare:ccgrid:2019 | Khochare, A.; Ramachandra, S.; Ramesh, S. & Simmhan, Y. |
Dynamic Scaling of Video Analytics for Wide-area Tracking in Urban Spaces
|
2019 | IEEE International Scalable Computing Challenge (SCALE), Co-located with IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) , pp. 76-81 | inproceedings | iisc, edge, video analytics |
BibTeX:
@inproceedings{khochare:ccgrid:2019, author = {Aakash Khochare and Sheshadri Ramachandra and Shriram Ramesh and Yogesh Simmhan}, title = {Dynamic Scaling of Video Analytics for Wide-area Tracking in Urban Spaces}, booktitle = {IEEE International Scalable Computing Challenge (SCALE), Co-located with IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)}, year = {2019}, pages = {76--81}, note = {SCALE Challenge Winner}, doi = {https://doi.org/10.1109/CCGRID.2019.00018} } |
||||||
monga:icws:2019 | Monga, S.K.; Sheshadri K, R. & Simmhan, Y. |
ElfStore: A Resilient Data Storage Service for Federated Edge and Fog Resources
|
2019 | IEEE International Conference on Web Services (ICWS) , pp. 336-345 | inproceedings | iisc, edge, fog, storage, reliability |
BibTeX:
@inproceedings{monga:icws:2019, author = {Sumit Kumar Monga and Sheshadri K R and Yogesh Simmhan}, title = {ElfStore: A Resilient Data Storage Service for Federated Edge and Fog Resources}, booktitle = {IEEE International Conference on Web Services (ICWS)}, year = {2019}, pages = {336--345}, note = {[CORE A]}, doi = {https://doi.org/10.1109/ICWS.2019.00062} } |
||||||
alva:ccwi:2019 | Alva, P.; Sheetal Kumar, K.R.; Simmhan, Y. & Mohan Kumar, M.S. |
Enabling Equitable Water Supply in a Mega-city using a Big Data Analytics Platform
|
2019 | International Conference on Computing and Control for Water Industry (CCWI) , pp. 1-2 | inproceedings | |
BibTeX:
@inproceedings{alva:ccwi:2019, author = {Prithvi Alva and Sheetal Kumar K.R. and Yogesh Simmhan and Mohan Kumar M.S.}, title = {Enabling Equitable Water Supply in a Mega-city using a Big Data Analytics Platform}, booktitle = {International Conference on Computing and Control for Water Industry (CCWI)}, year = {2019}, pages = {1--2}, note = {Extended Abstract} } |
||||||
simmhan:escience:2019 | Simmhan, Y.; Hegde, M.; Zele, R.; Tripathi, S.N.; Nair, S.; Monga, S.K.; Sahu, R.; Dixit, K.; Sutaria, R.; Mishra, B.; Sharma, A. & Anand, S.V.R. |
SATVAM: Toward an IoT Cyber-Infrastructure for Low-Cost Urban Air Quality Monitoring
|
2019 | IEEE International Conference on eScience (eScience) , pp. 57-66 | inproceedings | |
BibTeX:
@inproceedings{simmhan:escience:2019, author = {Yogesh Simmhan and Malati Hegde and Rajesh Zele and Sachchida N. Tripathi and Srijith Nair and Sumit K. Monga and Ravi Sahu and Kuldeep Dixit and Ronak Sutaria and Brijesh Mishra and Anamika Sharma and Anand SVR}, title = {SATVAM: Toward an IoT Cyber-Infrastructure for Low-Cost Urban Air Quality Monitoring}, booktitle = {IEEE International Conference on eScience (eScience)}, year = {2019}, pages = {57--66}, doi = {https://doi.org/10.1109/eScience.2019.00014} } |
||||||
chaturvedi:isorc:2019 | Chaturvedi, S. & Simmhan, Y. |
Toward Resilient Stream Processing on Clouds using Moving Target Defense
|
2019 | IEEE International Symposium on Real-Time Distributed Computing (ISORC) , pp. 134-142 | inproceedings | |
BibTeX:
@inproceedings{chaturvedi:isorc:2019, author = {Shilpa Chaturvedi and Yogesh Simmhan}, title = {Toward Resilient Stream Processing on Clouds using Moving Target Defense}, booktitle = {IEEE International Symposium on Real-Time Distributed Computing (ISORC)}, year = {2019}, pages = {134--142}, doi = {https://doi.org/10.1109/ISORC.2019.00035} } |
||||||
shen:icfec:2019 | Shen, H. & Simmhan, Y. (Hrsg.) |
Proceedings of the IEEE International Conference on Fog and Edge Computing (ICFEC)
|
2019 | proceedings | ||
BibTeX:
@proceedings{shen:icfec:2019,, title = {Proceedings of the IEEE International Conference on Fog and Edge Computing (ICFEC)}, year = {2019}, url = {https://ieeexplore.ieee.org/xpl/conhome/8730889/proceeding} } |
||||||
ghosh:tcps:2018 | Ghosh, R. & Simmhan, Y. |
Distributed Scheduling of Event Analytics across Edge and Cloud
|
2018 |
ACM Transactions on Cyber-Physical Systems (TCPS) Vol. 2 (4) , pp. 24:1-24:28 |
article | iisc, peer reviewed, stream processing, edge computing, iot |
BibTeX:
@article{ghosh:tcps:2018, author = {Rajrup Ghosh and Yogesh Simmhan}, title = {Distributed Scheduling of Event Analytics across Edge and Cloud}, journal = {ACM Transactions on Cyber-Physical Systems (TCPS)}, year = {2018}, volume = {2}, number = {4}, pages = {24:1--24:28}, url = {https://arxiv.org/abs/1608.01537}, doi = {https://doi.org/10.1145/3140256} } |
||||||
shukla:jpdc:2018 | Shukla, A. & Simmhan, Y. |
Model-driven Scheduling for Distributed Stream Processing Systems
|
2018 |
Journal of Parallel and Distributed Computing (JPDC) Vol. 117 , pp. 98-114 |
article | peer reviewed, iisc, stream processing |
BibTeX:
@article{shukla:jpdc:2018, author = {Anshu Shukla and Yogesh Simmhan}, title = {Model-driven Scheduling for Distributed Stream Processing Systems}, journal = {Journal of Parallel and Distributed Computing (JPDC)}, year = {2018}, volume = {117}, pages = {98--114}, url = {https://arxiv.org/abs/1702.01785}, doi = {https://doi.org/10.1016/j.jpdc.2018.02.003} } |
||||||
heidari:csur:2018 | Heidari, S.; Simmhan, Y.; Calheiros, R.N. & Buyya, R. |
Scalable Graph Processing Frameworks: A Taxonomy and Open Challenges
|
2018 |
ACM Computing Surveys (CSUR) Vol. 51 (3) , pp. 1-53 |
article | peer reviewed, iisc, graph processing |
BibTeX:
@article{heidari:csur:2018, author = {Safiollah Heidari and Yogesh Simmhan and Rodrigo N. Calheiros and Rajkumar Buyya}, title = {Scalable Graph Processing Frameworks: A Taxonomy and Open Challenges}, journal = {ACM Computing Surveys (CSUR)}, year = {2018}, volume = {51}, number = {3}, pages = {1--53}, url = {https://dl.acm.org/citation.cfm?id=3199523}, doi = {https://doi.org/10.1145/3199523} } |
||||||
simmhan:spe:2018 | Simmhan, Y.; Ravindra, P.; Chaturvedi, S.; Hegde, M. & Ballamajalu, R. |
Towards a Data-driven IoT Software Architecture for Smart City Utilities
|
2018 |
Software: Practice and Experience Vol. 48 (7) , pp. 1390-1416 |
article | peer reviewed, iisc, smart city, iot |
BibTeX:
@article{simmhan:spe:2018, author = {Yogesh Simmhan and Pushkara Ravindra and Shilpa Chaturvedi and Malati Hegde and Rashmi Ballamajalu}, title = {Towards a Data-driven IoT Software Architecture for Smart City Utilities}, journal = {Software: Practice and Experience}, year = {2018}, volume = {48}, number = {7}, pages = {1390--1416}, url = {http://arxiv.org/abs/1803.02500}, doi = {https://doi.org/10.1002/spe.2580} } |
||||||
ghosh:ccgrid:2018 | Ghosh, R.; Reddy, S.P. & Simmhan, Y. |
Adaptive Energy-aware Scheduling of Dynamic Event Analytics across Edge and Cloud Resources
|
2018 | IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) , pp. 1-11 | inproceedings | |
BibTeX:
@inproceedings{ghosh:ccgrid:2018, author = {Rajrup Ghosh and Siva Prakash Reddy and Yogesh Simmhan}, title = {Adaptive Energy-aware Scheduling of Dynamic Event Analytics across Edge and Cloud Resources}, booktitle = {IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)}, year = {2018}, pages = {1--11}, note = {[CORE A]}, url = {https://arxiv.org/abs/1801.01087} } |
||||||
shukla:icdcs:2018 | Shukla, A. & Simmhan, Y. |
Toward Reliable and Rapid Elasticity for Streaming Dataflows on Clouds
|
2018 | IEEE International Conference on Distributed Computing Systems (ICDCS) , pp. 1-11 | inproceedings | peer reviewed, iisc, stream processing |
BibTeX:
@inproceedings{shukla:icdcs:2018, author = {Anshu Shukla and Yogesh Simmhan}, title = {Toward Reliable and Rapid Elasticity for Streaming Dataflows on Clouds}, booktitle = {IEEE International Conference on Distributed Computing Systems (ICDCS)}, year = {2018}, pages = {1--11}, note = {[CORE A]}, url = {https://arxiv.org/abs/1712.00605} } |
||||||
badiger:europar:2018 | Badiger, S.; Baheti, S. & Simmhan, Y. |
VIoLET: A Large-scale Virtual Environment for Internet of Things
|
2018 | International European Conference on Parallel and Distributed Computing (EuroPar) , pp. 1-16 | inproceedings | iisc, peer reviewed, iot |
BibTeX:
@inproceedings{badiger:europar:2018, author = {Shreyas Badiger and Shrey Baheti and Yogesh Simmhan}, title = {VIoLET: A Large-scale Virtual Environment for Internet of Things}, booktitle = {International European Conference on Parallel and Distributed Computing (EuroPar)}, year = {2018}, pages = {1--16}, note = {[CORE A]}, url = {https://github.com/dream-lab/VIoLET} } |
||||||
jha:ccpe:2017 | Jha, S.; Luckow, D.S.K.A.; Rana, O. & amd Neil Chue Hong, Y.S. |
Introducing Distributed Dynamic Data-intensive (D3) Science: Understanding Applications and Infrastructure
|
2017 |
Concurrency and Computation: Practice and Experience Vol. 29 (8) |
article | peer reviewed, iisc, escience, big data |
BibTeX:
@article{jha:ccpe:2017, author = {Shantenu Jha and Daniel S. Katz Andre Luckow and Omer Rana and Yogesh Simmhan amd Neil Chue Hong}, title = {Introducing Distributed Dynamic Data-intensive (D3) Science: Understanding Applications and Infrastructure}, journal = {Concurrency and Computation: Practice and Experience}, year = {2017}, volume = {29}, number = {8}, url = {https://github.com/radical-project/3DPAS}, doi = {https://doi.org/10.1002/cpe.4032} } |
||||||
simmhan:iotn:2017 | Simmhan, Y. |
IoT Analytics Across Edge and Cloud Platforms
|
2017 | IEEE Internet of Things Newsletter | article | iisc, edge computing, iot |
BibTeX:
@article{simmhan:iotn:2017, author = {Yogesh Simmhan}, title = {IoT Analytics Across Edge and Cloud Platforms}, journal = {IEEE Internet of Things Newsletter}, year = {2017}, url = {http://iot.ieee.org/newsletter/may-2017/iot-analytics-across-edge-and-cloud-platforms} } |
||||||
zhou:fgcs:2017 | Zhou, Q.; Simmhan, Y. & Prasanna, V. |
Knowledge-infused and Consistent Complex Event Processing over Real-time and Persistent Streams
|
2017 |
Future Generation Computer Systems Vol. 76 , pp. 391-406 |
article | peer reviewed, cep, stream processing, semantics, iisc |
BibTeX:
@article{zhou:fgcs:2017, author = {Qunzhi Zhou and Yogesh Simmhan and Viktor Prasanna}, title = {Knowledge-infused and Consistent Complex Event Processing over Real-time and Persistent Streams}, journal = {Future Generation Computer Systems}, year = {2017}, volume = {76}, pages = {391--406}, doi = {https://doi.org/10.1016/j.future.2016.10.030} } |
||||||
shukla:ccpe:2017 | Shukla, A.; Chaturvedi, S. & Simmhan, Y. |
RIoTBench: An IoT Benchmark for Distributed Stream Processing Systems
|
2017 |
Concurrency and Computation: Practice and Experience Vol. 29 (21) , pp. 1-22 |
article | iisc, iot, stream processing, benchmark, peer reviewed |
BibTeX:
@article{shukla:ccpe:2017, author = {Anshu Shukla and Shilpa Chaturvedi and Yogesh Simmhan}, title = {RIoTBench: An IoT Benchmark for Distributed Stream Processing Systems}, journal = {Concurrency and Computation: Practice and Experience}, year = {2017}, volume = {29}, number = {21}, pages = {1--22}, url = {https://arxiv.org/abs/1701.08530}, doi = {https://doi.org/10.1002/cpe.4257} } |
||||||
kalyanasundaram:hipc:2017 | Kalyanasundaram, J. & Simmhan, Y. |
ARM Wrestling with Big Data: A Study of Commodity ARM64 Server for Big Data Workloads
|
2017 | IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC) , pp. 1-10 | inproceedings | iisc, peer reviewed, big data, low power |
BibTeX:
@inproceedings{kalyanasundaram:hipc:2017, author = {Jayanth Kalyanasundaram and Yogesh Simmhan}, title = {ARM Wrestling with Big Data: A Study of Commodity ARM64 Server for Big Data Workloads}, booktitle = {IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC)}, year = {2017}, pages = {1--10}, note = {Best paper finalist, [CORE B]}, url = {https://arxiv.org/abs/1701.05996}, doi = {https://doi.org/10.1109/HiPC.2017.00032} } |
||||||
dindokar:hipcw:2017 | Dindokar, R. & Simmhan, Y. |
Characterization of Vertex-centric Breadth First Search for Lattice Graphs
|
2017 | IEEE International Workshop on Foundations in Big Data Computing (BigDF), Co-located with HiPC , pp. 1-8 | inproceedings | iisc, peer reviewed, graph processing |
BibTeX:
@inproceedings{dindokar:hipcw:2017, author = {Ravikant Dindokar and Yogesh Simmhan}, title = {Characterization of Vertex-centric Breadth First Search for Lattice Graphs}, booktitle = {IEEE International Workshop on Foundations in Big Data Computing (BigDF), Co-located with HiPC}, year = {2017}, pages = {1--8}, doi = {https://doi.org/10.1109/HiPCW.2017.00014} } |
||||||
chaturvedi:escience:2017 | Chaturvedi, S.; Tyagi, S. & Simmhan, Y. |
Collaborative Reuse of Streaming Dataflows in IoT Applications
|
2017 | IEEE International Conference on eScience (eScience) , pp. 1-10 | inproceedings | iisc, peer reviewed, iot, stream processing |
BibTeX:
@inproceedings{chaturvedi:escience:2017, author = {Shilpa Chaturvedi and Sahil Tyagi and Yogesh Simmhan}, title = {Collaborative Reuse of Streaming Dataflows in IoT Applications}, booktitle = {IEEE International Conference on eScience (eScience)}, year = {2017}, pages = {1--10}, note = {[CORE A]}, url = {https://arxiv.org/abs/1709.03332}, doi = {https://doi.org/10.1109/eScience.2017.54} } |
||||||
varshney:icfec:2017 | Varshney, P. & Simmhan, Y. |
Demystifying Fog Computing: Characterizing Architectures, Applications and Abstractions
|
2017 | IEEE International Conference on Fog and Edge Computing (ICFEC) , pp. 1-10 | inproceedings | peer reviewed, iisc, cloud, iot, fog, edge |
BibTeX:
@inproceedings{varshney:icfec:2017, author = {Prateeksha Varshney and Yogesh Simmhan}, title = {Demystifying Fog Computing: Characterizing Architectures, Applications and Abstractions}, booktitle = {IEEE International Conference on Fog and Edge Computing (ICFEC)}, year = {2017}, pages = {1--10}, url = {https://arxiv.org/abs/1702.06331}, doi = {https://doi.org/10.1109/ICFEC.2017.20} } |
||||||
khochare:iscocw:2017 | Khochare, A.; Ravindra, P.; Reddy, S.P. & Simmhan, Y. |
Distributed Video Analytics across Edge and Cloud using ECHO
|
2017 | International Conference on Service-Oriented Computing (ICSOC) Demo , pp. 1-6 | inproceedings | iisc, peer reviewed, iot, edge computing |
BibTeX:
@inproceedings{khochare:iscocw:2017, author = {Aakash Khochare and Pushkara Ravindra and Siva Prakash Reddy and Yogesh Simmhan}, title = {Distributed Video Analytics across Edge and Cloud using ECHO}, booktitle = {International Conference on Service-Oriented Computing (ICSOC) Demo}, year = {2017}, pages = {1--6}, url = {http://www.icsoc.spilab.es/wp-content/uploads/2017/10/Distributed-Video-Analytics-across-Edge-and-Cloud-using-ECHO.pdf} } |
||||||
ravindra:iscoc:2017 | Ravindra, P.; Khochare, A.; Reddy, S.P.; Sharma, S.; Varshney, P. & Simmhan, Y. |
ECHO: An Adaptive Orchestration Platform for Hybrid Dataflows across Cloud and Edge
|
2017 | International Conference on Service-Oriented Computing (ICSOC) , pp. 1-16 | inproceedings | iisc, peer reviewed, iot, edge computing |
BibTeX:
@inproceedings{ravindra:iscoc:2017, author = {Pushkara Ravindra and Aakash Khochare and Siva Prakash Reddy and Sarthak Sharma and Prateeksha Varshney and Yogesh Simmhan}, title = {ECHO: An Adaptive Orchestration Platform for Hybrid Dataflows across Cloud and Edge}, booktitle = {International Conference on Service-Oriented Computing (ICSOC)}, year = {2017}, pages = {1--16}, note = {[CORE A]}, url = {https://arxiv.org/abs/1707.00889}, doi = {https://doi.org/10.1007/978-3-319-69035-3_28} } |
||||||
simmhan:ccpe:2016 | Simmhan, Y.; Ramakrishnan, L.; Antoniu, G. & Goble, C. |
Editorial: Cloud computing for data-driven science and engineering
|
2016 | Concurrency and Computation: Practice and Experience | article | iisc, editorial |
BibTeX:
@article{simmhan:ccpe:2016, author = {Yogesh Simmhan and Lavanya Ramakrishnan and Gabriel Antoniu and Carole Goble}, title = {Editorial: Cloud computing for data-driven science and engineering}, journal = {Concurrency and Computation: Practice and Experience}, year = {2016}, url = {http://onlinelibrary.wiley.com/doi/10.1002/cpe.3668/full}, doi = {https://doi.org/10.1002/cpe.3668} } |
||||||
simmhan:bidatabook:2016 | Simmhan, Y. & Perera, S. Pyne, S.; Rao, B.L.S.P. & Rao, S.B. (Hrsg.) |
Big Data Analytics Platforms for Real-Time Applications in IoT (
Big Data Analytics: Methods and Applications
)
|
2016 | Big Data Analytics: Methods and Applications , pp. 115-135 | inbook | iisc, big data, peer reviewed |
BibTeX:
@inbook{simmhan:bidatabook:2016, author = {Yogesh Simmhan and Srinath Perera}, title = {Big Data Analytics: Methods and Applications}, publisher = {Springer India}, year = {2016}, pages = {115--135}, doi = {https://doi.org/10.1007/978-81-322-3628-3_7} } |
||||||
dindokar:bigdata:2016 | Dindokar, R.; Choudhury, N. & Simmhan, Y. |
A Meta-graph Approach to Analyze Subgraph-centric Distributed Programming Models
|
2016 | IEEE International Conference on Big Data (Big Data) , pp. 37-47 | inproceedings | graph, goffish, meta-graph, analysis, iisc, peer reviewed |
BibTeX:
@inproceedings{dindokar:bigdata:2016, author = {Ravikant Dindokar and Neel Choudhury and Yogesh Simmhan}, title = {A Meta-graph Approach to Analyze Subgraph-centric Distributed Programming Models}, booktitle = {IEEE International Conference on Big Data (Big Data)}, year = {2016}, pages = {37--47}, url = {http://ieeexplore.ieee.org/document/7840587/}, doi = {https://doi.org/10.1109/BigData.2016.7840587} } |
||||||
shukla:tpctc:2016 | Shukla, A. & Simmhan, Y. |
Benchmarking Distributed Stream Processing Platforms for IoT Applications
|
2016 |
Vol. 10080 TPC Technology Conference on Performance Evaluation & Benchmarking (TPCTC) , pp. 90-106 |
inproceedings | iot, peer reviewed, iisc, stream, benchmark |
BibTeX:
@inproceedings{shukla:tpctc:2016, author = {Anshu Shukla and Yogesh Simmhan}, title = {Benchmarking Distributed Stream Processing Platforms for IoT Applications}, booktitle = {TPC Technology Conference on Performance Evaluation & Benchmarking (TPCTC)}, year = {2016}, volume = {10080}, pages = {90--106}, url = {https://arxiv.org/abs/1606.07621}, doi = {https://doi.org/10.1007/978-3-319-54334-5_7} } |
||||||
dindokar:ccgrid:2016 | Dindokar, R. & Simmhan, Y. |
Elastic Partition Placement for Non-stationary Graph Algorithms
|
2016 | IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid) , pp. 90-93 | inproceedings | goffish, peer reviewed, iisc, graph, cloud |
BibTeX:
@inproceedings{dindokar:ccgrid:2016, author = {Ravikant Dindokar and Yogesh Simmhan}, title = {Elastic Partition Placement for Non-stationary Graph Algorithms}, booktitle = {IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid)}, year = {2016}, pages = {90--93}, note = {Short Paper, [CORE A]}, url = {http://ieeexplore.ieee.org/document/7515673/}, doi = {https://doi.org/10.1109/CCGrid.2016.97} } |
||||||
jamadagni:ccgrid:2016 | Jamadagni, N. & Simmhan, Y. |
GoDB: From Batch Processing to Distributed Querying over Property Graphs
|
2016 | IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid) , pp. 281-290 | inproceedings | godb, goffish, peer reviewed, iisc, graph |
BibTeX:
@inproceedings{jamadagni:ccgrid:2016, author = {Nitin Jamadagni and Yogesh Simmhan}, title = {GoDB: From Batch Processing to Distributed Querying over Property Graphs}, booktitle = {IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing (CCGrid)}, year = {2016}, pages = {281--290}, note = {[CORE A]}, url = {http://ieeexplore.ieee.org/document/7515700/}, doi = {https://doi.org/10.1109/CCGrid.2016.105} } |
||||||
aluru:jpdc:2015 | Aluru, S. & Simmhan, Y. |
Editorial: Scalable Systems for Big Data Management and Analytics
|
2015 | Journal of Parallel and Distributed Systems (JPDC) | article | editorial, iisc, big data |
BibTeX:
@article{aluru:jpdc:2015, author = {Srinivas Aluru and Yogesh Simmhan}, title = {Editorial: Scalable Systems for Big Data Management and Analytics}, journal = {Journal of Parallel and Distributed Systems (JPDC)}, year = {2015}, note = {To Appear} } |
||||||
Aman:tkde:2015 | Aman, S.; Simmhan, Y. & Prasanna, V. |
Holistic Measures for Evaluating Prediction Models in Smart Grids
|
2015 |
IEEE Transactions on Knowledge and Data Engineering (TKDE) Vol. 27 (2) , pp. 475-488 |
article | usc, machine learning, smart grid, peer reviewed, iisc |
BibTeX:
@article{Aman:tkde:2015, author = {Saima Aman and Yogesh Simmhan and Viktor Prasanna}, title = {Holistic Measures for Evaluating Prediction Models in Smart Grids}, journal = {IEEE Transactions on Knowledge and Data Engineering (TKDE)}, year = {2015}, volume = {27}, number = {2}, pages = {475--488}, note = {[IF 2.476, CORE A]}, doi = {https://doi.org/10.1109/TKDE.2014.2327022} } |
||||||
kumbhare:tcc:2015 | Kumbhare, A.G.; Simmhan, Y.; Frincu, M. & Prasanna, V.K. |
Reactive Resource Provisioning Heuristics for Dynamic Dataflows on Cloud Infrastructure
|
2015 |
IEEE Transactions on Cloud Computing (TCC) Vol. 3 (2) , pp. 105-118 |
article | peer reviewed, iisc, stream processing, cloud |
BibTeX:
@article{kumbhare:tcc:2015, author = {Alok Gautam Kumbhare and Yogesh Simmhan and Marc Frincu and Viktor K. Prasanna}, title = {Reactive Resource Provisioning Heuristics for Dynamic Dataflows on Cloud Infrastructure}, journal = {IEEE Transactions on Cloud Computing (TCC)}, year = {2015}, volume = {3}, number = {2}, pages = {105--118}, doi = {https://doi.org/10.1109/TCC.2015.2394316} } |
||||||
mishra:iotn:2015 | Misra, P.; Simmhan, Y. & Warrior, J. |
Towards a Practical Architecture for Internet of Things: An India-centric View
|
2015 | IEEE Internet of Things Newsletter , pp. 1-2 | article | iot, iisc |
BibTeX:
@article{mishra:iotn:2015, author = {Prasant Misra and Yogesh Simmhan and Jay Warrior}, title = {Towards a Practical Architecture for Internet of Things: An India-centric View}, journal = {IEEE Internet of Things Newsletter}, year = {2015}, pages = {1-2}, url = {http://iot.ieee.org/newsletter/january-2015/towards-a-practical-architecture-for-internet-of-things-an-india-centric-view.html} } |
||||||
dindokar:parlearning:2015 | Dindokar, R.; Choudhury, N. & Simmhan, Y. |
Analysis of Subgraph-centric Distributed Shortest Path Algorithm
|
2015 | IEEE International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics (ParLearning), Co-located with IPDPS , pp. 1185-1190 | inproceedings | peer reviewed, iisc, graph processing |
BibTeX:
@inproceedings{dindokar:parlearning:2015, author = {Ravikant Dindokar and Neel Choudhury and Yogesh Simmhan}, title = {Analysis of Subgraph-centric Distributed Shortest Path Algorithm}, booktitle = {IEEE International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics (ParLearning), Co-located with IPDPS}, year = {2015}, pages = {1185--1190}, note = {Short paper}, url = {http://ieeexplore.ieee.org/document/7284445/}, doi = {https://doi.org/10.1109/IPDPSW.2015.87} } |
||||||
simmhan:wbdb:2015 | Simmhan, Y.; Shukla, A. & Verma, A. |
Benchmarking Fast Data Platforms for the Aadhaar Biometric Database
|
2015 |
Vol. 10044 Workshop on Big Data Benchmarking (WBDB) , pp. 21-39 |
inproceedings | iisc, stream processing, uidai, benchmark, peer reviewed |
BibTeX:
@inproceedings{simmhan:wbdb:2015, author = {Yogesh Simmhan and Anshu Shukla and Arun Verma}, title = {Benchmarking Fast Data Platforms for the Aadhaar Biometric Database}, booktitle = {Workshop on Big Data Benchmarking (WBDB)}, year = {2015}, volume = {10044}, pages = {21--39}, url = {http://arxiv.org/abs/1510.04160}, doi = {https://doi.org/10.1007/978-3-319-49748-8_2} } |
||||||
shukla:hipcw:2015 | Shukla, A.; Sharma, T. & Simmhan, Y. |
Characterizing Distributed Stream Processing Systems for IoT Applications
|
2015 | Workshop on Architectural Support and Middleware for InfoSymbiotics/ Dynamic Data Driven Applications Systems (DDDAS), co-located with High Performance Computing Conference (HiPC) , pp. 61 | inproceedings | iisc, iot, stream processing, peer reviewed |
BibTeX:
@inproceedings{shukla:hipcw:2015, author = {Anshu Shukla and Tarun Sharma and Yogesh Simmhan}, title = {Characterizing Distributed Stream Processing Systems for IoT Applications}, booktitle = {Workshop on Architectural Support and Middleware for InfoSymbiotics/ Dynamic Data Driven Applications Systems (DDDAS), co-located with High Performance Computing Conference (HiPC)}, year = {2015}, pages = {61}, note = {Extended abstract}, doi = {https://doi.org/10.1109/HiPCW.2015.22} } |
||||||
simmhan:ipdps:2015 | Simmhan, Y.; Choudhury, N.; Wickramaarachchi, C.; Kumbhare, A.; Frincu, M.; Raghavendra, C. & Prasanna, V. |
Distributed Programming over Time-series Graphs
|
2015 | IEEE International Parallel & Distributed Processing Symposium (IPDPS) , pp. 809-818 | inproceedings | graph processing, timeseries, goffish, iisc, usc, peer reviewed |
BibTeX:
@inproceedings{simmhan:ipdps:2015, author = {Yogesh Simmhan and Neel Choudhury and Charith Wickramaarachchi and Alok Kumbhare and Marc Frincu and Cauligi Raghavendra and Viktor Prasanna}, title = {Distributed Programming over Time-series Graphs}, booktitle = {IEEE International Parallel & Distributed Processing Symposium (IPDPS)}, year = {2015}, pages = {809--818}, note = {[CORE A]}, url = {http://ieeexplore.ieee.org/document/7161567/}, doi = {https://doi.org/10.1109/IPDPS.2015.66} } |
||||||
kumbhare:icdcs:2015 | Kumbhare, A.; Frincu, M.; Simmhan, Y. & Prasanna, V.K. |
Fault-Tolerant and Elastic Streaming MapReduce with Decentralized Coordination
|
2015 | IEEE International Conference on Distributed Computing Systems (ICDCS) , pp. 328-338 | inproceedings | iisc, peer reviewed, mapreduce, stream processing |
BibTeX:
@inproceedings{kumbhare:icdcs:2015, author = {Alok Kumbhare and Marc Frincu and Yogesh Simmhan and Viktor K. Prasanna}, title = {Fault-Tolerant and Elastic Streaming MapReduce with Decentralized Coordination}, booktitle = {IEEE International Conference on Distributed Computing Systems (ICDCS)}, year = {2015}, pages = {328--338}, note = {[Core A]}, url = {http://ieeexplore.ieee.org/document/7164919/}, doi = {https://doi.org/10.1109/ICDCS.2015.41} } |
||||||
aman:sgcomm:2015 | Aman, S.; Frincu, M.; Chelmis, C.; Noor, M.; Simmhan, Y. & Prasanna, V.K. |
Prediction Models for Dynamic Demand Response: Requirements, Challenges, and Insights
|
2015 | IEEE International Conference on Smart Grid Communications (SmartGridComm) , pp. 1-6 | inproceedings | iisc, peer reviewed, smart grid, iot |
BibTeX:
@inproceedings{aman:sgcomm:2015, author = {Saima Aman and Marc Frincu and Charalampos Chelmis and Muhammad Noor and Yogesh Simmhan and Viktor K. Prasanna}, title = {Prediction Models for Dynamic Demand Response: Requirements, Challenges, and Insights}, booktitle = {IEEE International Conference on Smart Grid Communications (SmartGridComm)}, year = {2015}, pages = {1--6}, url = {http://ieeexplore.ieee.org/document/7436323/}, doi = {https://doi.org/10.1109/SmartGridComm.2015.7436323} } |
||||||
kushwaha:ccem:2014 | Kushwaha, V. & Simmhan, Y. |
An Analysis of Spot-Priced Clouds for Practical Job Scheduling
|
2014 | IEEE Cloud Computing for Emerging Markets (CCEM) , pp. 1-8 | inproceedings | iisc, cloud, spot, peer reviewed |
BibTeX:
@inproceedings{kushwaha:ccem:2014, author = {Vedsar Kushwaha and Yogesh Simmhan}, title = {An Analysis of Spot-Priced Clouds for Practical Job Scheduling}, booktitle = {IEEE Cloud Computing for Emerging Markets (CCEM)}, year = {2014}, pages = {1--8}, doi = {https://doi.org/10.1109/CCEM.2014.7015488} } |
||||||
chu:ipdps:2014 | Chu, H.-Y. & Simmhan, Y. |
Cost-efficient and Resilient Job Life-cycle Management on Hybrid Clouds
|
2014 | IEEE International Parallel & Distributed Processing Symposium (IPDPS) , pp. 327-336 | inproceedings | usc, cloud, peer reviewed, iisc |
BibTeX:
@inproceedings{chu:ipdps:2014, author = {Hsuan-Yi Chu and Yogesh Simmhan}, title = {Cost-efficient and Resilient Job Life-cycle Management on Hybrid Clouds}, booktitle = {IEEE International Parallel & Distributed Processing Symposium (IPDPS)}, year = {2014}, pages = {327--336}, note = {[CORE A]}, url = {http://ieeexplore.ieee.org/document/6877267/}, doi = {https://doi.org/10.1109/IPDPS.2014.43} } |
||||||
govindarajan:comad:2014 | Govindarajan, N.; Simmhan, Y.; Jamadagni, N. & Misra, P. |
Event Processing across Edge and the Cloud for Internet of Things Applications
|
2014 | International Conference on Management of Data (COMAD) , pp. 101-104 | inproceedings | iisc, event processing, cep, iot, peer reviewed, poster |
BibTeX:
@inproceedings{govindarajan:comad:2014, author = {Nithyashri Govindarajan and Yogesh Simmhan and Nitin Jamadagni and Prasant Misra}, title = {Event Processing across Edge and the Cloud for Internet of Things Applications}, booktitle = {International Conference on Management of Data (COMAD)}, year = {2014}, pages = {101--104}, note = {Short paper, [CORE B]}, url = {http://dl.acm.org/citation.cfm?id=2726970.2726985} } |
||||||
simmhan:europar:2014 | Simmhan, Y.; Kumbhare, A.; Wickramaarachchi, C.; Nagarkar, S.; Ravi, S.; Raghavendra, C. & Prasanna, V. |
GoFFish: A Sub-Graph Centric Framework for Large-Scale Graph Analytics
|
2014 |
Vol. 8632 International European Conference on Parallel Processing (Euro-Par) , pp. 451-462 |
inproceedings | graphs, goffish, cluster, usc, peer reviewed, iisc |
BibTeX:
@inproceedings{simmhan:europar:2014, author = {Yogesh Simmhan and Alok Kumbhare and Charith Wickramaarachchi and Soonil Nagarkar and Santosh Ravi and Cauligi Raghavendra and Viktor Prasanna}, title = {GoFFish: A Sub-Graph Centric Framework for Large-Scale Graph Analytics}, booktitle = {International European Conference on Parallel Processing (Euro-Par)}, year = {2014}, volume = {8632}, pages = {451--462}, note = {[CORE A]}, doi = {https://doi.org/10.1007/978-3-319-09873-9_38} } |
||||||
kumbhare:ccgrid:2014 | Kumbhare, A.; Simmhan, Y. & Prasanna, V.K. |
PLAStiCC: Predictive Look-Ahead Scheduling for Continuous dataflows on Clouds
|
2014 | IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) , pp. 344-353 | inproceedings | continuous dataflow, workflow, floe, cloud, iisc, usc, peer reviewed, iisc |
BibTeX:
@inproceedings{kumbhare:ccgrid:2014, author = {Alok Kumbhare and Yogesh Simmhan and Viktor K. Prasanna}, title = {PLAStiCC: Predictive Look-Ahead Scheduling for Continuous dataflows on Clouds}, booktitle = {IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)}, year = {2014}, pages = {344--353}, note = {[CORE A]}, url = {http://ieeexplore.ieee.org/document/6846470/}, doi = {https://doi.org/10.1109/CCGrid.2014.60} } |
||||||
badam:comad:2014 | Badam, N.C. & Simmhan, Y. |
Subgraph Rank: PageRank for SubgraphCentric Distributed Graph Processing
|
2014 | International Conference on Management of Data (COMAD) , pp. 38-49 | inproceedings | iisc, graph, goffish, algorithm, peer reviewed |
BibTeX:
@inproceedings{badam:comad:2014, author = {Nitin Chandra Badam and Yogesh Simmhan}, title = {Subgraph Rank: PageRank for SubgraphCentric Distributed Graph Processing}, booktitle = {International Conference on Management of Data (COMAD)}, year = {2014}, pages = {38--49}, note = {[CORE B]}, url = {http://dl.acm.org/citation.cfm?id=2726970.2726979} } |
||||||
simmhan:cise:2013 | Simmhan, Y.; Aman, S.; Kumbhare, A.; Liu, R.; Stevens, S.; Zhou, Q. & Prasanna, V. |
Cloud-Based Software Platform for Big Data Analytics in Smart Grids
|
2013 |
Computing in Science and Engineering Vol. 15 (4) , pp. 38 - 47 |
article | usc, smart grid, cloud, peer reviewed |
BibTeX:
@article{simmhan:cise:2013, author = {Yogesh Simmhan and Saima Aman and Alok Kumbhare and Rongyang Liu and Sam Stevens and Qunzhi Zhou and Viktor Prasanna}, title = {Cloud-Based Software Platform for Big Data Analytics in Smart Grids}, journal = {Computing in Science and Engineering}, publisher = {IEEE and AIP}, year = {2013}, volume = {15}, number = {4}, pages = {38 - 47}, note = {[IF 1.422, CORE C]}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-cise-2013.pdf}, doi = {https://doi.org/10.1109/MCSE.2013.39} } |
||||||
Aman:comm:2013 | Aman, S.; Simmhan, Y. & Prasanna, V.K. |
Energy Management Systems: State of the Art and Emerging Trends
|
2013 |
IEEE Communications Magazine Vol. 51 (1) , pp. 114 -119 |
article | smart grid, peer reviewed, usc |
BibTeX:
@article{Aman:comm:2013, author = {Saima Aman and Yogesh Simmhan and Viktor K. Prasanna}, title = {Energy Management Systems: State of the Art and Emerging Trends}, journal = {IEEE Communications Magazine}, publisher = {IEEE}, year = {2013}, volume = {51}, number = {1}, pages = {114 -119}, note = {[IF 3.785]}, doi = {https://doi.org/10.1109/MCOM.2013.6400447} } |
||||||
Wickramaarachchi:escience:2013 | Wickramaarachchi, C. & Simmhan, Y. |
Continuous Dataflow Update Strategies for Mission-Critical Applications
|
2013 | IEEE Internatrional Conference on eScience (eScience) , pp. 155-163 | inproceedings | usc, cloud, workflow, continuous dataflow, peer reviewed |
BibTeX:
@inproceedings{Wickramaarachchi:escience:2013, author = {Charith Wickramaarachchi and Yogesh Simmhan}, title = {Continuous Dataflow Update Strategies for Mission-Critical Applications}, booktitle = {IEEE Internatrional Conference on eScience (eScience)}, year = {2013}, pages = {155--163}, note = {[CORE A]}, url = {http://ceng.usc.edu/ simmhan/pubs/wickramaarachchi-escience-2013.pdf}, doi = {https://doi.org/10.1109/eScience.2013.35} } |
||||||
kumbhare:sc:2013 | Kumbhare, A.; Simmhan, Y. & Prasanna, V. |
Exploiting Application Dynamism and Cloud Elasticity for Continuous Dataflows
|
2013 | IEEE/ACM International Conference for High Performance Computing Networking, Storage, and Analysis (SC) , pp. 1-12 | inproceedings | usc, cloud, workflow, continuous dataflow, peer reviewed |
BibTeX:
@inproceedings{kumbhare:sc:2013, author = {Alok Kumbhare and Yogesh Simmhan and Viktor Prasanna}, title = {Exploiting Application Dynamism and Cloud Elasticity for Continuous Dataflows}, booktitle = {IEEE/ACM International Conference for High Performance Computing Networking, Storage, and Analysis (SC)}, year = {2013}, pages = {1--12}, note = {[CORE A]}, doi = {https://doi.org/10.1145/2503210.2503240} } |
||||||
redekopp:ipdps:2013 | Redekopp, M.; Simmhan, Y. & Prasanna, V.K. |
Optimizations and Analysis of BSP Graph Processing Models on Public Clouds
|
2013 | IEEE International Parallel & Distributed Processing Symposium (IPDPS) , pp. 203-214 | inproceedings | usc, cloud, graphs, azure, peer reviewed |
BibTeX:
@inproceedings{redekopp:ipdps:2013, author = {Mark Redekopp and Yogesh Simmhan and Viktor K. Prasanna}, title = {Optimizations and Analysis of BSP Graph Processing Models on Public Clouds}, booktitle = {IEEE International Parallel & Distributed Processing Symposium (IPDPS)}, year = {2013}, pages = {203--214}, note = {[CORE A]}, url = {https://ieeexplore.ieee.org/document/6569812/}, doi = {https://doi.org/10.1109/IPDPS.2013.76} } |
||||||
simmhan:smartcities:2013 | Simmhan, Y. & Noor, M.U. |
Scalable Prediction of Energy Consumption using Incremental Time Series Clustering
|
2013 | Workshop on Big Data and Smarter Cities, Co-located with IEEE International Conference on Big Data , pp. 29-36 | inproceedings | smart grid, analytics, usc, peer reviewed |
BibTeX:
@inproceedings{simmhan:smartcities:2013, author = {Yogesh Simmhan and Muhammad Usman Noor}, title = {Scalable Prediction of Energy Consumption using Incremental Time Series Clustering}, booktitle = {Workshop on Big Data and Smarter Cities, Co-located with IEEE International Conference on Big Data}, year = {2013}, pages = {29--36}, doi = {https://doi.org/10.1109/BigData.2013.6691774} } |
||||||
zhou:bigdata:2013 | Zhou, Q.; Simmhan, Y. & Prasanna, V. |
Towards Hybrid Online On-Demand Querying of Realtime Data with Stateful Complex Event Processing
|
2013 | IEEE International Conference on Big Data (BigData) , pp. 199-205 | inproceedings | smart grid, cep, usc, peer reviewed, short |
BibTeX:
@inproceedings{zhou:bigdata:2013, author = {Qunzhi Zhou and Yogesh Simmhan and Viktor Prasanna}, title = {Towards Hybrid Online On-Demand Querying of Realtime Data with Stateful Complex Event Processing}, booktitle = {IEEE International Conference on Big Data (BigData)}, year = {2013}, pages = {199--205}, doi = {https://doi.org/10.1109/BigData.2013.6691575} } |
||||||
Simmhan:scale:2012 | Simmhan, Y.; Agarwal, V.; Aman, S.; Kumbhare, A.; Natarajan, S.; Rajguru, N.; Robinson, I.; Stevens, S.; Yin, W.; Zhou, Q. & Prasanna, V. |
Adaptive Energy Forecasting and Information Diffusion for Smart Power Grids
|
2012 | IEEE International Scalable Computing Challenge (SCALE) , pp. 1-4 | inproceedings | hadoop, openplanet, floe, workflow, information integration, smart grid, peer reviewed, usc, short |
BibTeX:
@inproceedings{Simmhan:scale:2012, author = {Yogesh Simmhan and Vaibhav Agarwal and Saima Aman and Alok Kumbhare and Sreedhar Natarajan and Nikhil Rajguru and Ian Robinson and Samuel Stevens and Wei Yin and Qunzhi Zhou and Viktor Prasanna}, title = {Adaptive Energy Forecasting and Information Diffusion for Smart Power Grids}, booktitle = {IEEE International Scalable Computing Challenge (SCALE)}, year = {2012}, pages = {1--4}, note = {SCALE Challenge Winner}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-scale-2012.pdf} } |
||||||
Kumbhare:cloud:2012 | Kumbhare, A.; Simmhan, Y. & Prasanna, V. |
Cryptonite: A Secure and Performant Data Repository on Public Clouds
|
2012 | IEEE International Cloud Computing Conference (CLOUD) , pp. 510-517 | inproceedings | usc, smart grid, security, data privacy, cloud, azure, peer reviewed |
BibTeX:
@inproceedings{Kumbhare:cloud:2012, author = {Alok Kumbhare and Yogesh Simmhan and Viktor Prasanna}, title = {Cryptonite: A Secure and Performant Data Repository on Public Clouds}, booktitle = {IEEE International Cloud Computing Conference (CLOUD)}, year = {2012}, pages = {510--517}, note = {[CORE B]}, url = {https://ieeexplore.ieee.org/document/6253545/}, doi = {https://doi.org/10.1109/CLOUD.2012.109} } |
||||||
Zhou:iswc:2012 | Zhao, Q.; Simmhan, Y. & Prasanna, V.K. |
Incorporating Semantic Knowledge into Stream Processing for Smart Grid Applications
|
2012 |
Vol. 7650 International Semantic Web Conference (ISWC) , pp. 257-273 |
inproceedings | peer reviewed, smart grid, cep, usc |
BibTeX:
@inproceedings{Zhou:iswc:2012, author = {Qunzhi Zhao and Yogesh Simmhan and Viktor K. Prasanna}, title = {Incorporating Semantic Knowledge into Stream Processing for Smart Grid Applications}, booktitle = {International Semantic Web Conference (ISWC)}, year = {2012}, volume = {7650}, pages = {257--273}, note = {[CORE A]}, url = {http://iswc2012.semanticweb.org/sites/default/files/76500254.pdf}, doi = {https://doi.org/10.1007/978-3-642-35173-0_17} } |
||||||
Zhao:ipaw:2012 | Zhao, J.; Simmhan, Y. & Prasanna, V. |
Presenting Apropos Provenance for Situation Awareness and Forensics
|
2012 |
Vol. 7525 International Proveanance and Annotation Workshop , pp. 250-253 |
inproceedings | provenance, smart grid, usc, peer reviewed, short |
BibTeX:
@inproceedings{Zhao:ipaw:2012, author = {Jing Zhao and Yogesh Simmhan and Viktor Prasanna}, title = {Presenting Apropos Provenance for Situation Awareness and Forensics}, booktitle = {International Proveanance and Annotation Workshop}, publisher = {Springer}, year = {2012}, volume = {7525}, pages = {250--253}, note = {Poster}, url = {http://dx.doi.org/10.1007/978-3-642-34222-6_30}, doi = {https://doi.org/10.1007/978-3-642-34222-6_30} } |
||||||
Yin:mapreduce:2012 | Yin, W.; Simmhan, Y. & Prasanna, V. |
Scalable Regression Tree Learning on Hadoop using OpenPlanet
|
2012 | ACM International Workshop on MapReduce and its Applications (MAPREDUCE) , pp. 57-64 | inproceedings | cloud, machine learning, map reduce, hadoop, smart grid, peer reviewed, usc |
Abstract: As scientific and engineering domains attempt to effectively analyze the deluge of data arriving from sensors and instruments, machine learning is becoming a key data mining tool to build prediction models. Regression tree is a popular learning model that combines decision trees and linear regression to forecast numerical target variables based on a set of input features. Map Reduce is well suited for addressing such data intensive learning applications, and a proprietary regression tree algorithm, PLANET, using MapReduce has been proposed earlier. In this paper, we describe an open source implement of this algorithm, OpenPlanet, on the Hadoop framework using a hybrid approach. Further, we evaluate the performance of OpenPlanet using realworld datasets from the Smart Power Grid domain to perform energy use forecasting, and propose tuning strategies of Hadoop parameters to improve the performance of the default configuration by 75% for a training dataset of 17 million tuples on a 64-core Hadoop cluster on FutureGrid. | ||||||
BibTeX:
@inproceedings{Yin:mapreduce:2012, author = {Wei Yin and Yogesh Simmhan and Viktor Prasanna}, title = {Scalable Regression Tree Learning on Hadoop using OpenPlanet}, booktitle = {ACM International Workshop on MapReduce and its Applications (MAPREDUCE)}, year = {2012}, pages = {57--64}, url = {http://ceng.usc.edu/ simmhan/pubs/yin-mapreduce-2012.pdf}, doi = {https://doi.org/10.1145/2287016.2287027} } |
||||||
Zhou:itng:2012 | Zhou, Q.; Natarajan, S.; Simmhan, Y. & Prasanna, V. |
Semantic Information Modeling for Emerging Applications in Smart Grid
|
2012 | IEEE International Conference on Information Technology : New Generations (ITNG) , pp. 775-782 | inproceedings | usc, smart grid, semantic, information integration, peer reviewed |
Abstract: Abstract—Smart Grid modernizes power grid by integrating digital and information technologies. Millions of smart meters, intelligent appliances and communication infrastructures are under deployment allowing advanced IT applications to be developed to protect and optimize power grid operations. Demand response (DR) is one such emerging application to optimize electricity demand by curtailing/shifting power load when peak load occurs. Existing DR approaches are mostly based on static plans such as pricing policies and load shedding schedules. However, improvements to power management applications rely on data emanated from existing and new information sources with the grow of Smart Grid information space. In particular, dynamic DR algorithms may depend on information from smart meters that report interval-based power consumption measurement, HVAC systems that monitor buildings heat and humidity, and even weather forecast services. In order for emerging Smart Grid applications to take advantage of the diverse data influx, extensible information integration is required. In this paper, we develop an integrated Smart Grid information model using Semantic Web techniques and present case studies of using semantic information for dynamic DR. We show the semantic model facilitates information integration and knowledge representation for developing the next generation Smart Grid applications. | ||||||
BibTeX:
@inproceedings{Zhou:itng:2012, author = {Qunzhi Zhou and Sreedhar Natarajan and Yogesh Simmhan and Viktor Prasanna}, title = {Semantic Information Modeling for Emerging Applications in Smart Grid}, booktitle = {IEEE International Conference on Information Technology : New Generations (ITNG)}, year = {2012}, pages = {775--782}, url = {http://dx.doi.org/10.1109/ITNG.2012.150}, doi = {https://doi.org/10.1109/ITNG.2012.150} } |
||||||
Simmhan:sciencecloud:2012 | Simmhan, Y.; Antoniu, G.; Goble, C. & Ramakrishnan, L. Simmhan, Y.; Antoniu, G.; Goble, C. & Ramakrishnan, L. (Hrsg.) |
Proceedings of the 3rd International Workshop on Scientific Cloud Computing (ScienceCloud)
[BibTeX] |
2012 | proceedings | editorial, usc | |
BibTeX:
@proceedings{Simmhan:sciencecloud:2012, author = {Yogesh Simmhan and Gabriel Antoniu and Carole Goble and Lavanya Ramakrishnan}, title = {Proceedings of the 3rd International Workshop on Scientific Cloud Computing (ScienceCloud)}, publisher = {ACM}, year = {2012} } |
||||||
Simmhan:fgcs:2011 | Simmhan, Y. & Barga, R. Simmhan, Y.; Groth, P. & Moreau, L. (Hrsg.) |
Analysis of approaches for supporting the Open Provenance Model: A case study of the Trident workflow workbench
|
2011 |
Future Generation Computer Systems (FGCS) Vol. 27 , pp. 790-796 |
article | msr, provenance, opm, trident, workflow, inter-operability, provenance challenge, peer reviewed |
Abstract: The Trident workbench is a platform for composing, executing and managing scientific workflows. While Trident collects provenance in its native provenance model, the third provenance challenge was an opportunity to build support for the Open Provenance Model into Trident. There are several possible approaches to harmonize our native model with OPM, and such choices are also available to other existing provenance and workflow systems working towards OPM compatibility. We identify and analyze the relative merits of these approaches in an effort to inform practitioners planning to support OPM in their existing provenance/workflow systems. Further, we describe our experience with using the integration approach we choose to interoperate with other teams as part of the challenge. | ||||||
BibTeX:
@article{Simmhan:fgcs:2011, author = {Yogesh Simmhan and Roger Barga}, title = {Analysis of approaches for supporting the Open Provenance Model: A case study of the Trident workflow workbench}, journal = {Future Generation Computer Systems (FGCS)}, publisher = {Elsevier}, year = {2011}, volume = {27}, pages = {790--796}, note = {[IF 2.43, CORE A]}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-fgcs-2011.pdf}, doi = {https://doi.org/10.1016/j.future.2010.10.005} } |
||||||
Zhao:ijca:2011 | Zhao, J.; Simmhan, Y.; Gomadam, K. & Prasanna, V.K. |
Querying Provenance Information in Distributed Environments
|
2011 |
International Journal of Computers and Their Applications (IJCA) Vol. 18 (3) , pp. 196-215 |
article | usc, smart oilfield, provenance, peer reviewed, special issue |
Abstract: The growing recognition of the importance of provenance for data intensive and multidisciplinary domains is leading to careful collection of provenance. One consequence of this is the proliferation of provenance repositories hosted for individual organization or communities, with limited ability to reconstruct and query for and on provenance across them. Community standards like the Open Provenance Model (OPM) allow uniform interpretation and exchange of provenance metadata but do not prescribe query or service specifications to access provenance. If data reuse and sharing across institutions is not accompanied by passing provenance at the time of data exchange, we need to track the provenance and query for them or over them across distributed provenance repositories. In this article, we present approaches for querying over distributed provenance information, and address two common provenance query models that we formalize: provenance retrieval query and provenance filter query. Our problem is motivated by Smart Oilfield applications in the energy informatics domain, and we evaluate the performance of our algorithms using synthetic workflows based on the domain. | ||||||
BibTeX:
@article{Zhao:ijca:2011, author = {Jing Zhao and Yogesh Simmhan and Karthik Gomadam and Viktor K. Prasanna}, title = {Querying Provenance Information in Distributed Environments}, journal = {International Journal of Computers and Their Applications (IJCA)}, publisher = {ISCA}, year = {2011}, volume = {18}, number = {3}, pages = {196--215}, url = {http://ceng.usc.edu/ simmhan/pubs/zhao-ijca-2011.pdf} } |
||||||
Moreau:fgcs:2011 | Moreau, L.; Clifford, B.; Freire, J.; Futrelle, J.; Gil, Y.; Groth, P.; Kwasnikowska, N.; Miles, S.; Missier, P.; Myers, J.; Plale, B.; Simmhan, Y.; Stephan, E. & den Bussche, J.V. Simmhan, Y.; Groth, P. & Moreau, L. (Hrsg.) |
The Open Provenance Model core specification (v1.1)
|
2011 |
Future Generation Computer Systems (FGCS) Vol. 27 , pp. 743-756 |
article | msr, provenance, opm, representation, inter-operability, peer reviewed |
Abstract: The Open Provenance Model is a model of provenance that is designed to meet the following requirements: (1) Allow provenance information to be exchanged between systems, by means of a compatibility layer based on a shared provenance model. (2) Allow developers to build and share tools that operate on such a provenance model. (3) Define provenance in a precise, technology-agnostic manner. (4) Support a digital representation of provenance for any “thing”, whether produced by computer systems or not. (5) Allow multiple levels of description to coexist. (6) Define a core set of rules that identify the valid inferences that can be made on provenance representation. This document contains the specification of the Open Provenance Model (v1.1) resulting from a community effort to achieve inter-operability in the Provenance Challenge series. | ||||||
BibTeX:
@article{Moreau:fgcs:2011, author = {Luc Moreau and Ben Clifford and Juliana Freire and Joe Futrelle and Yolanda Gil and Paul Groth and Natalia Kwasnikowska and Simon Miles and Paolo Missier and Jim Myers and Beth Plale and Yogesh Simmhan and Eric Stephan and Jan Van den Bussche}, title = {The Open Provenance Model core specification (v1.1)}, journal = {Future Generation Computer Systems (FGCS)}, publisher = {Elsevier}, year = {2011}, volume = {27}, pages = {743--756}, note = {[IF 2.43, CORE A]}, url = {http://ceng.usc.edu/ simmhan/pubs/moreau-fgcs-2011.pdf}, doi = {https://doi.org/10.1016/j.future.2010.07.005} } |
||||||
Simmhan:ijca:2011 | Simmhan, Y. & Plale, B. |
Using Provenance for Personalized Quality Ranking of Scientific Datasets
|
2011 |
International Journal of Computers and Their Applications (IJCA) Vol. 18 (3) , pp. 180-195 |
article | usc, provenance, iu, peer reviewed, karma, special issue |
Abstract: The rapid growth of eScience has led to an explosion in the creation and availability of scientific datasets that includes raw instrument data and derived datasets from model simulations. A large number of these datasets are surfacing online in public and private catalogs, often annotated with XML metadata, as part of community efforts to foster open research. With this rapid expansion comes the challenge of filtering and selecting datasets that best match the needs of scientists. We address a key aspect of the scientific data discovery process by ranking search results according to a personalized data quality score based on a declarative quality profile to help scientists select the most suitable data for their applications. Our quality model is resilient to missing metadata using a novel strategy that uses provenance in its absence. Intuitively, our premise is that the quality score for a dataset depends on its provenance – the scientific task and its inputs that created the dataset – and it is possible to define a quality function based on provenance metadata that predicts the same quality score as one evaluated using the user’s quality profile over the complete metadata. Here, we present a model and architecture for data quality scoring, apply machine learning techniques to construct a quality function that uses provenance as proxy for missing metadata, and empirically test the prediction power of our quality function. Our results show that for some scientific tasks, quality scores based on provenance closely track the quality scores based on complete metadata properties, with error margins between 1 – 29%. | ||||||
BibTeX:
@article{Simmhan:ijca:2011, author = {Yogesh Simmhan and Beth Plale}, title = {Using Provenance for Personalized Quality Ranking of Scientific Datasets}, journal = {International Journal of Computers and Their Applications (IJCA)}, publisher = {ISCA}, year = {2011}, volume = {18}, number = {3}, pages = {180--195}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-ijca-2011.pdf} } |
||||||
Simmhan:greenit:2011 | Simmhan, Y.; Zhou, Q. & Prasanna, V.K. Kim, J.H. & Lee, M.J. (Hrsg.) |
Semantic Information Integration for Smart Grid Applications (
Green IT: Technologies and Applications
)
|
2011 | Green IT: Technologies and Applications , pp. 361-380 | inbook | usc, smart grid, semantic, information integration, peer reviewed |
Abstract: The Los Angeles Smart Grid Project aims to use informatics techniques to bring about a quantum leap in the way demand response load optimization is performed in utilities. Semantic information integration, from sources as diverse as Internet-connected smart meters and social networks, is a linchpin to support the advanced analytics and mining algorithms required for this. In association with it, semantic complex event processing system will allow consumer and utility managers to easily specify and enact energy policies continuously. We present the information systems architecture for the project that is under development, and discuss research issues that emerge from having to design a system that supports 1.4 million customers and a rich ecosystem of Smart Grid applications from users, third party vendors, the utility and regulators. | ||||||
BibTeX:
@inbook{Simmhan:greenit:2011, author = {Yogesh Simmhan and Qunzhi Zhou and Viktor K. Prasanna}, title = {Green IT: Technologies and Applications}, publisher = {Springer Berlin Heidelberg}, year = {2011}, pages = {361--380}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-greenit-2011.pdf}, doi = {https://doi.org/10.1007/978-3-642-22179-8_19} } |
||||||
Simmhan:sciencecloud:2011 | Simmhan, Y.; Cao, B.; Giakkoupis, M. & Prasanna, V.K. |
Adaptive rate stream processing for smart grid applications on clouds
|
2011 | ACM International Workshop on Scientific Cloud Computing (ScienceCloud) , pp. 33-38 | inproceedings | usc, smart grid, cloud, streaming, peer reviewed, short paper |
Abstract: Pervasive smart meters that continuously measure power usage by consumers within a smart (power) grid are providing utilities and power systems researchers with unprecedented volumes of information through streams that need to be processed and analyzed in near realtime. We introduce the use of Cloud platforms to perform scalable, latency sensitive stream processing for eEngineering applications in the smart grid domain. One unique aspect of our work is the use of adaptive rate control to throttle the rate of generation of power events by smart meters, which meets accuracy requirements of smart grid applications while consuming 50% lesser bandwidth resources in the Cloud. | ||||||
BibTeX:
@inproceedings{Simmhan:sciencecloud:2011, author = {Yogesh Simmhan and Baohua Cao and Michail Giakkoupis and Viktor K. Prasanna}, title = {Adaptive rate stream processing for smart grid applications on clouds}, booktitle = {ACM International Workshop on Scientific Cloud Computing (ScienceCloud)}, year = {2011}, pages = {33--38}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-sciencecloud-2011.pdf}, doi = {https://doi.org/10.1145/1996109.1996116} } |
||||||
Simmhan:cloud:2011 | Simmhan, Y.; Kumbhare, A.; Cao, B. & Prasanna, V.K. |
An Analysis of Security and Privacy Issues in Smart Grid Software Architectures on Clouds
|
2011 | IEEE International Cloud Computing Conference (CLOUD) , pp. 582-589 | inproceedings | usc, cloud, security, privacy, smart grid, peer reviewed |
Abstract: Power utilities globally are increasingly upgrading to Smart Grids that use bi-directional communication with the consumer to enable an information-driven approach to distributed energy management. Clouds offer features well suited for Smart Grid software platforms and applications, such as elastic resources and shared services. However, the security and privacy concerns inherent in an informationrich Smart Grid environment are further exacerbated by their deployment on Clouds. Here, we present an analysis of security and privacy issues in a Smart Grids software architecture operating on different Cloud environments, in the form of a taxonomy. We use the Los Angeles Smart Grid Project that is underway in the largest U.S. municipal utility to drive this analysis that will benefit both Cloud practitioners targeting Smart Grid applications, and Cloud researchers investigating security and privacy. | ||||||
BibTeX:
@inproceedings{Simmhan:cloud:2011, author = {Yogesh Simmhan and Alok Kumbhare and Baohua Cao and Viktor K. Prasanna}, title = {An Analysis of Security and Privacy Issues in Smart Grid Software Architectures on Clouds}, booktitle = {IEEE International Cloud Computing Conference (CLOUD)}, publisher = {IEEE}, year = {2011}, pages = {582--589}, note = {[CORE B]}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-cloud-2011.pdf}, doi = {https://doi.org/10.1109/CLOUD.2011.107} } |
||||||
Kumbhare:datacloud:2011 | Kumbhare, A.; Simmhan, Y. & Prasanna, V. |
Designing a Secure Storage Repository for Sharing Scientific Datasets using Public Clouds
|
2011 | ACM International Workshop on Data Intensive Computing in the Clouds (DataCloud-SC11) , pp. 31-40 | inproceedings | peer reviewed, cloud, azure, security, smart grid, usc |
BibTeX:
@inproceedings{Kumbhare:datacloud:2011, author = {Alok Kumbhare and Yogesh Simmhan and Viktor Prasanna}, title = {Designing a Secure Storage Repository for Sharing Scientific Datasets using Public Clouds}, booktitle = {ACM International Workshop on Data Intensive Computing in the Clouds (DataCloud-SC11)}, year = {2011}, pages = {31--40}, url = {http://ceng.usc.edu/ simmhan/pubs/kumbhare-datacloud-2011.pdf}, doi = {https://doi.org/10.1145/2087522.2087530} } |
||||||
Aman:dddm:2011 | Aman, S.; Simmhan, Y. & Prasanna, V.K. |
Improving Energy Use Forecast for Campus Micro-grids using Indirect Indicators
|
2011 | International Workshop on Domain Driven Data Mining (DDDM) , pp. 389-397 | inproceedings | usc, smart grid, machine learning, peer reviewed |
BibTeX:
@inproceedings{Aman:dddm:2011, author = {Saima Aman and Yogesh Simmhan and Viktor K. Prasanna}, title = {Improving Energy Use Forecast for Campus Micro-grids using Indirect Indicators}, booktitle = {International Workshop on Domain Driven Data Mining (DDDM)}, year = {2011}, pages = {389--397}, url = {http://ceng.usc.edu/ simmhan/pubs/aman-dddm-2011.pdf}, doi = {https://doi.org/10.1109/ICDMW.2011.95} } |
||||||
Redekopp:pargraph:2011 | Redekopp, M.; Simmhan, Y. & Prasanna, V.K. |
Performance Analysis of Vertex-centric Graph Algorithms on the Azure Cloud Platform
|
2011 | IEEE Workshop on Parallel Algorithms and Software for Analysis of Massive Graphs (ParGraph) , pp. 1-8 | inproceedings | graphs, azure, cloud, peer reviewed, usc |
Abstract: Finding key vertices in large graphs is an important problem in many applications such as social networks, bioinformatics, and distribution networks. Betweenness centrality is a popular algorithm for finding such vertices and has been studied extensively, yielding several parallel formulations suitable to supercomputers and clusters. In this paper we implement and study betweenness centrality in the context of cloud-based platforms using Microsoft Windows Azure as our case study. We demonstrate scalable parallel performance and investigate key issues related to a cloud-based implementation including mitigating penalties associated with VM failures as well as the impact of communication overheads in the cloud. We use a combination of empirical and analytical evaluation using both synthetic small-world and real-world social interaction graphs. | ||||||
BibTeX:
@inproceedings{Redekopp:pargraph:2011, author = {Mark Redekopp and Yogesh Simmhan and Viktor K. Prasanna}, title = {Performance Analysis of Vertex-centric Graph Algorithms on the Azure Cloud Platform}, booktitle = {IEEE Workshop on Parallel Algorithms and Software for Analysis of Massive Graphs (ParGraph)}, year = {2011}, pages = {1--8}, url = {http://halcyon.usc.edu/ pk/prasannawebsite/papers/2011/redekopp-pargraph-2011.pdf} } |
||||||
Simmhan:hpcdb:2011 | Simmhan, Y.; van Ingen, C.; Heasley, J. & Szalay, A. |
Stargazing through a Digital Veil: Managing a Large Scale Sky Survey using Distributed Databases on HPC Clusters
|
2011 | Workshop on High-Performance Computing meets Databases (HPCDB) , pp. 33-36 | inproceedings | usc, msr, escience, data management, hpc, graywulf, panstarrs, databases, peer reviewed |
BibTeX:
@inproceedings{Simmhan:hpcdb:2011, author = {Yogesh Simmhan and Catharine van Ingen and Jim Heasley and Alex Szalay}, title = {Stargazing through a Digital Veil: Managing a Large Scale Sky Survey using Distributed Databases on HPC Clusters}, booktitle = {Workshop on High-Performance Computing meets Databases (HPCDB)}, year = {2011}, pages = {33--36}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-hpcdb-2011.pdf}, doi = {https://doi.org/10.1145/2125636.2125648} } |
||||||
Zhou:debs:2011 | Zhou, Q.; Simmhan, Y. & Prasanna, V.K. |
Towards an inexact semantic complex event processing framework
|
2011 | International Conference on Distributed Event-Based System (DEBS) , pp. 401-402 | inproceedings | usc, smart grid. cep, semantic, peer reviewed, poster |
Abstract: Complex event processing (CEP) deals with detecting real-time situations, represented as event patterns, from among an event cloud. The state-of-the-art CEP systems process events as plain data tuples and are limited to detect precisely defined patterns. Emerging application areas like optimization in smart power grids require CEP to incorporate semantic knowledge of the domain for easier pattern specification, and detect inexact patterns in the presence of uncertainties. In this paper, we present motivating use cases, discuss limitations of existing CEP systems and describe our work towards an Inexact Semantic Complex Event Processing (InSCEP) framework. | ||||||
BibTeX:
@inproceedings{Zhou:debs:2011, author = {Qunzhi Zhou and Yogesh Simmhan and Viktor K. Prasanna}, title = {Towards an inexact semantic complex event processing framework}, booktitle = {International Conference on Distributed Event-Based System (DEBS)}, publisher = {ACM}, year = {2011}, pages = {401--402}, note = {Poster}, url = {http://ceng.usc.edu/ simmhan/pubs/zhou-debs-2011.pdf}, doi = {https://doi.org/10.1145/2002259.2002331} } |
||||||
Simmhan:buildsys:2011 | Simmhan, Y.; Prasanna, V.; Aman, S.; Natarajan, S.; Yin, W. & Zhou, Q. |
Towards Data-driven Demand-Response Optimization in a Campus Microgrid
|
2011 | Workshop On Embedded Sensing Systems For Energy-Efficiency In Buildings (BuildSys) , pp. 41-42 | inproceedings | usc, smart grid. information integration, cep, machine learning, peer reviewed, demo |
Abstract: We describe and demonstrate a prototype software architecture to support data-driven demand response optimization (DR) in the USC campus microgrid, as part of the Los Angeles Smart Grid Demonstration Project. The architecture includes a semantic information repository that integrates diverse data sources to support DR, demand forecasting using scalable machine-learned models, and detection of load curtailment opportunities by matching complex event patterns. | ||||||
BibTeX:
@inproceedings{Simmhan:buildsys:2011, author = {Yogesh Simmhan and Viktor Prasanna and Saima Aman and Sreedhar Natarajan and Wei Yin and Qunzhi Zhou}, title = {Towards Data-driven Demand-Response Optimization in a Campus Microgrid}, booktitle = {Workshop On Embedded Sensing Systems For Energy-Efficiency In Buildings (BuildSys)}, publisher = {ACM}, year = {2011}, pages = {41--42}, note = {Demo}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-buildsys-2011.pdf}, doi = {https://doi.org/10.1145/2434020.2434032} } |
||||||
Zinn:ccgrid:2011 | Zinn, D.; Hart, Q.; McPhillips, T.M.; Ludäscher, B.; Simmhan, Y.; Giakkoupis, M. & Prasanna, V.K. |
Towards Reliable, Performant Workflows for Streaming-Applications on Cloud Platforms
|
2011 | IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) , pp. 235-244 | inproceedings | usc, smart grid, cloud, streaming, peer reviewed, escience |
Abstract: Scientific workflows are commonplace in eScience applications. Yet, the lack of integrated support for data models, including streaming data, structured collections and files, is limiting the ability of workflows to support emerging applications in energy informatics that are stream oriented. This is compounded by the absence of Cloud data services that support reliable and performant streams. In this paper, we propose and present a scientific workflow framework that supports streams as first-class data, and is optimized for performant and reliable execution across desktop and Cloud platforms. The workflow framework features and its empirical evaluation on a private Eucalyptus Cloud are presented. | ||||||
BibTeX:
@inproceedings{Zinn:ccgrid:2011, author = {Daniel Zinn and Quinn Hart and Timothy M. McPhillips and Bertram Ludäscher and Yogesh Simmhan and Michail Giakkoupis and Viktor K. Prasanna}, title = {Towards Reliable, Performant Workflows for Streaming-Applications on Cloud Platforms}, booktitle = {IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)}, publisher = {IEEE}, year = {2011}, pages = {235--244}, note = {[CORE A]}, url = {http://ceng.usc.edu/ simmhan/pubs/zinn-ccgrid-2011.pdf}, doi = {https://doi.org/10.1109/CCGrid.2011.74} } |
||||||
Simmhan:HiPC:2011 | Simmhan, Y. & Srinivasan, A. Simmhan, Y. & Srinivasan, A. (Hrsg.) |
HiPC 2011 Student Research Symposium: Message from the co-chairs
[BibTeX] |
2011 | High Performance Computing Conference (HiPC) | proceedings | editorial, usc |
BibTeX:
@proceedings{Simmhan:HiPC:2011, author = {Yogesh Simmhan and Ashok Srinivasan}, title = {HiPC 2011 Student Research Symposium: Message from the co-chairs}, booktitle = {High Performance Computing Conference (HiPC)}, year = {2011} } |
||||||
Raicu:ScienceCloud2011 | Raicu, I.; Beckman, P.; Foster, I.T. & Simmhan, Y. Raicu, I.; Beckman, P.; Foster, I.T. & Simmhan, Y. (Hrsg.) |
Proceedings of the 2nd International Workshop on Scientific Cloud Computing (ScienceCloud)
|
2011 | proceedings | editorial, usc | |
BibTeX:
@proceedings{Raicu:ScienceCloud2011, author = {Ioan Raicu and Pete Beckman and Ian T. Foster and Yogesh Simmhan}, title = {Proceedings of the 2nd International Workshop on Scientific Cloud Computing (ScienceCloud)}, publisher = {ACM}, year = {2011}, url = {http://dx.doi.org/10.1145/1996109} } |
||||||
Barga:deb:2010 | Barga, R.; Simmhan, Y.; Withana, E.C.; Sahoo, S.; Jackson, J. & Araujo, N. Tan, W.-C. (Hrsg.) |
Provenance for Scientific Workflows: Towards Reproducible Research
|
2010 |
Data Engineering Bulletin (DEB) Vol. 33 (3) , pp. 50-59 |
article | msr, provenance, trident, workflow, peer reviewed |
BibTeX:
@article{Barga:deb:2010, author = {Roger Barga and Yogesh Simmhan and Eran Chinthaka Withana and Satya Sahoo and Jared Jackson and Nelson Araujo}, title = {Provenance for Scientific Workflows: Towards Reproducible Research}, journal = {Data Engineering Bulletin (DEB)}, publisher = {IEEE}, year = {2010}, volume = {33}, number = {3}, pages = {50--59}, url = {http://sites.computer.org/debull/A10sept/barga.pdf} } |
||||||
Simmhan:works:2010 | Simmhan, Y.; Soroush, E.; van Ingen, C.; Agarwal, D. & Ramakrishnan, L. |
BReW: Blackbox resource selection for e-Science workflows
|
2010 | IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS) , pp. 1-10 | inproceedings | msr, escience, workflow, cloud, scheduling, peer reviewed |
Abstract: Workflows are commonly used to model data intensive scientific analysis. As computational resource needs increase for eScience, emerging platforms like clouds present additional resource choices for scientists and policy makers. We introduce BReW, a tool enables users to make rapid, highlevel platform selection for their workflows using limited workflow knowledge. This helps make informed decisions on whether to port a workflow to a new platform. Our analysis of synthetic and real eScience workflows shows that using just total runtime length, maximum task fanout, and total data used and produced by the workflow, BReW can provide platform predictions comparable to whitebox models with detailed workflow knowledge. | ||||||
BibTeX:
@inproceedings{Simmhan:works:2010, author = {Yogesh Simmhan and Emad Soroush and Catharine van Ingen and Deb Agarwal and Lavanya Ramakrishnan}, title = {BReW: Blackbox resource selection for e-Science workflows}, booktitle = {IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS)}, year = {2010}, pages = {1--10}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-works-2010.pdf}, doi = {https://doi.org/10.1109/WORKS.2010.5671857} } |
||||||
Simmhan:cloud:2010 | Simmhan, Y.; van Ingen, C.; Subramanian, G. & Li, J. |
Bridging the Gap between Desktop and the Cloud for eScience Applications
|
2010 | IEEE International Cloud Computing Conference (CLOUD) , pp. 474-481 | inproceedings | msr, cloud, workflow, escience, generic worker, genomics, peer reviewed |
Abstract: The widely discussed scientific data deluge creates a need to computationally scale out eScience applications beyond the local desktop and cope with variable loads over time. Cloud computing offers a scalable, economic, on-demand model well matched to these needs. Yet cloud computing creates gaps that must be crossed to move existing science applications to the cloud. In this article, we propose a Generic Worker framework to deploy and invoke science applications in the cloud with minimal user effort and predictable cost-effective performance. Our framework addresses three distinct challenges posed by the cloud: the complexity of application deployment, invocation of cloud applications from desktop clients, and efficient transparent data transfers across desktop and the cloud. We present an implementation of the Generic Worker for the Microsoft Azure Cloud and evaluate its use for a genomics application. Our evaluation shows that the user complexity to port and scale the application is substantially reduced while introducing a negligible performance overhead of of <; 5% for the genomics application when scaling to 20 VM instances. | ||||||
BibTeX:
@inproceedings{Simmhan:cloud:2010, author = {Yogesh Simmhan and Catharine van Ingen and Girish Subramanian and Jie Li}, title = {Bridging the Gap between Desktop and the Cloud for eScience Applications}, booktitle = {IEEE International Cloud Computing Conference (CLOUD)}, publisher = {IEEE}, year = {2010}, pages = {474--481}, note = {[CORE B]}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-cloud-2010.pdf}, doi = {https://doi.org/10.1109/CLOUD.2010.72} } |
||||||
Simmhan:sciencecloud:2010 | Simmhan, Y. & Ramakrishnan, L. |
Comparison of resource platform selection approaches for scientific workflows
|
2010 | International Workshop on Scientific Cloud Computing (ScienceCloud) , pp. 445-450 | inproceedings | msr, cloud, escience, hpc, resource management, workflows, azure, scheduling, peer reviewed, short paper |
Abstract: Cloud computing is increasingly considered as an additional computational resource platform for scientific workflows. The cloud offers opportunity to scale-out applications from desktops and local cluster resources. Each platform has different properties (e.g., queue wait times in high performance systems, virtual machine startup overhead in clouds) and characteristics (e.g., custom environments in cloud) that makes choosing from these diverse resource platforms for a workflow execution a challenge for scientists. Scientists are often faced with deciding resource platform selection trade-offs with limited information on the actual workflows. While many workflow planning methods have explored resource selection or task scheduling, these methods often require fine-scale characterization of the workflow that is onerous for a scientist. In this paper, we describe our early exploratory work in using blackbox characteristics for a cost-benefit analysis of using different resource platforms. In our blackbox method, we use only limited high-level information on the workflow length, width, and data sizes. The length and width are indicative of the workflow duration and parallelism. We compare the effectiveness of this approach to other resource selection models using two exemplar scientific workflows on desktop, local cluster, HPC center, and cloud platforms. Early results suggest that the blackbox model often makes the same resource selections as a more fine-grained whitebox model. We believe the simplicity of the blackbox model can help inform a scientist on the applicability of a new resource platform, such as cloud resources, even before porting an existing workflow. | ||||||
BibTeX:
@inproceedings{Simmhan:sciencecloud:2010, author = {Yogesh Simmhan and Lavanya Ramakrishnan}, title = {Comparison of resource platform selection approaches for scientific workflows}, booktitle = {International Workshop on Scientific Cloud Computing (ScienceCloud)}, publisher = {ACM}, year = {2010}, pages = {445--450}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-sciencecloud-2010.pdf}, doi = {https://doi.org/10.1145/1851476.1851541} } |
||||||
Simmhan:cloudcom:2010 | Simmhan, Y.; Giakkoupis, M.; Cao, B. & Prasanna, V.K. |
On Using Cloud Platforms in a Software Architecture for Smart Energy Grids
|
2010 | International Conference on Cloud Computing Technology and Science (CloudCom) , pp. 1-3 | inproceedings | usc, energy informatics, smart grid, cloud, poster, peer reviewed |
Abstract: Increasing concern about energy consumption is leading to infrastructure that continuously monitors consumer energy usage and allow power utilities to provide dynamic feedback to curtail peak power load. Smart Grid infrastructure being deployed globally needs scalable software platforms to rapidly integrate and analyze information streaming from millions of smart meters, forecast power usage and respond to operational events. Cloud platforms are well suited to support such data and compute intensive, always-on applications. We examine opportunities and challenges of using cloud platforms for such applications in the emerging domain of energy informatics. | ||||||
BibTeX:
@inproceedings{Simmhan:cloudcom:2010, author = {Yogesh Simmhan and Michail Giakkoupis and Baohua Cao and Viktor K. Prasanna}, title = {On Using Cloud Platforms in a Software Architecture for Smart Energy Grids}, booktitle = {International Conference on Cloud Computing Technology and Science (CloudCom)}, publisher = {IEEE}, year = {2010}, pages = {1--3}, note = {Poster [CORE C]}, url = {http://salsahpc.indiana.edu/CloudCom2010/EPoster/cloudcom2010_submission_269.pdf} } |
||||||
Simmhan:ipaw:2010 | Simmhan, Y. & Gomadam, K. McGuinness, D.; Michaelis, J. & Moreau, L. (Hrsg.) |
Social Web-Scale Provenance in the Cloud
|
2010 |
Vol. 6378 International Provenance and Annotation Workshop (IPAW) , pp. 298-300 |
inproceedings | msr, provenance, social network, cloud, poster, peer reviewed, short paper |
Abstract: The lower barrier to entry for users to create and share resources through applications like Facebook and Twitter, and the commoditization of social Web data has heightened issues of privacy, attribution, and copyright. These make it important to track the provenance of social Web data. We outline and discuss key engineering, privacy, and monetization challenges in collecting and analyzing provenance of social Web resources. | ||||||
BibTeX:
@inproceedings{Simmhan:ipaw:2010, author = {Yogesh Simmhan and Karthik Gomadam}, title = {Social Web-Scale Provenance in the Cloud}, booktitle = {International Provenance and Annotation Workshop (IPAW)}, publisher = {Springer Berlin / Heidelberg}, year = {2010}, volume = {6378}, pages = {298--300}, url = {http://ceng.usc.edu/ simmhan/pubs/simmhan-ipaw-2010.pdf}, doi = {https://doi.org/10.1007/978-3-642-17819-1_39} } |
||||||
Zinn:works:2010 | Zinn, D.; Hart, Q.; Ludascher, B. & Simmhan, Y. |
Streaming satellite data to cloud workflows for on-demand computing of environmental data products
|
2010 | Workshop on Workflows in Support of Large-Scale Science (WORKS) , pp. 1-8 | inproceedings | usc, streaming, workflow, cloud, escience, peer reviewed |
Abstract: Environmental data arriving constantly from satellites and weather stations are used to compute weather coefficients that are essential for agriculture and viticulture. For example, the reference evapotranspiration (ET0) coefficient, overlaid on regional maps, is provided each day by the California Department of Water Resources to local farmers and turf managers to plan daily water use. Scaling out single-processor compute/data intensive applications operating on realtime data to support more users and higher-resolution data poses data engineering challenges. Cloud computing helps data providers expand resource capacity to meet growing needs besides supporting scientific needs like reprocessing historic data using new models. In this article, we examine migration of a legacy script used for daily ET |
||||||
BibTeX:
@inproceedings{Zinn:works:2010, author = {Daniel Zinn and Quinn Hart and Bertram Ludascher and Yogesh Simmhan}, title = {Streaming satellite data to cloud workflows for on-demand computing of environmental data products}, booktitle = {Workshop on Workflows in Support of Large-Scale Science (WORKS)}, publisher = {IEEE}, year = {2010}, pages = {1--8}, url = {http://ceng.usc.edu/ simmhan/pubs/zinn-works-2010.pdf}, doi = {https://doi.org/10.1109/WORKS.2010.5671841} } |
||||||
Simmhan:escience:2009 | Simmhan, Y.; van Ingen, C.; Szalay, A.; Barga, R. & Heasley, J. |
Building Reliable Data Pipelines for Managing Community Data Using Scientific Workflows
|
2009 | IEEE International Conference on eScience (eScience) , pp. 321-328 | inproceedings | msr, workflows, data management, cloud, panstarrs, escience, peer reviewed |
Abstract: The growing amount of scientific data from sensors and field observations is posing a challenge to ᅢツᅡdata valetsᅢツᅡ responsible for managing them in data repositories. These repositories built on commodity clusters need to reliably ingest data continuously and ensure its availability to a wide user community. Workflows provide several benefits to modeling data-intensive science applications and many of these benefits can help manage the data ingest pipelines too. But using workflows is not panacea in itself and data valets need to consider several issues when designing workflows that behave reliably on fault prone hardware while retaining the consistency of the scientific data. In this paper, we propose workflow designs for reliable data ingest in a distributed environment and identify workflow framework features to support resilience. We illustrate these using the data pipeline for the Pan-STARRS repository, one of the largest digital surveys that accumulates 100TB of data annually to support 300 astronomers. | ||||||
BibTeX:
@inproceedings{Simmhan:escience:2009, author = {Yogesh Simmhan and Catharine van Ingen and Alex Szalay and Roger Barga and Jim Heasley}, title = {Building Reliable Data Pipelines for Managing Community Data Using Scientific Workflows}, booktitle = {IEEE International Conference on eScience (eScience)}, publisher = {IEEE}, year = {2009}, pages = {321--328}, note = {[CORE A]}, doi = {https://doi.org/10.1109/e-Science.2009.52} } |
||||||
Simmhan:advcomp:2009 | Simmhan, Y.; Barga, R.; van Ingen, C.; Lazowska, E. & Szalay, A. |
Building the Trident Scientific Workflow Workbench for Data Management in the Cloud
|
2009 | Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP) , pp. 41-50 | inproceedings | msr, workflows, escience, data management, cloud, hpc, trident, panstarrs, peer reviewed |
Abstract: Scientific workflows have gained popularity for modeling and executing in silico experiments by scientists for problem-solving. These workflows primarily engage in computation and data transformation tasks to perform scientific analysis in the Science Cloud. Increasingly workflows are gaining use in managing the scientific data when they arrive from external sensors and are prepared for becoming science ready and available for use in the Cloud. While not directly part of the scientific analysis, these workflows operating behind the Cloud on behalf of the -data valetsᅢツᅡ play an important role in end-to-end management of scientific data products. They share several features with traditional scientific workflows: both are data intensive and use Cloud resources. However, they also differ in significant respects, for example, in the reliability required, scheduling constraints and the use of provenance collected. In this article, we investigate these two classes of workflows - Science Application workflows and Data Preparation workflows - and use these to drive common and distinct requirements from workflow systems for eScience in the Cloud. We use workflow examples from two collaborations, the NEPTUNE oceanography project and the Pan-STARRS astronomy project, to draw out our comparison. Our analysis of these workflows classes can guide the evolution of workflow systems to support emerging applications in the Cloud and the Trident Scientific Workbench is one such workflow system that has directly benefitted from this to meet the needs of these two eScience projects. | ||||||
BibTeX:
@inproceedings{Simmhan:advcomp:2009, author = {Yogesh Simmhan and Roger Barga and Catharine van Ingen and Ed Lazowska and Alex Szalay}, title = {Building the Trident Scientific Workflow Workbench for Data Management in the Cloud}, booktitle = {Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP)}, publisher = {IEEE}, year = {2009}, pages = {41--50}, doi = {https://doi.org/10.1109/ADVCOMP.2009.14} } |
||||||
Simmhan:hicss:2009 | Simmhan, Y.; Barga, R.; van Ingen, C.; Nieto-Santisteban, M.; Dobos, L.; Li, N.; Shipway, M.; Szalay, A.S.; Werner, S. & Heasley, J. |
GrayWulf: Scalable Software Architecture for Data Intensive Computing
|
2009 | Hawaii International Conference on System Sciences (HICSS) , pp. 1-10 | inproceedings | msr, workflows, escience, data management, cloud, hpc, trident, graywulf, panstarrs, peer reviewed |
Abstract: Big data presents new challenges to both cluster infrastructure software and parallel application design. We present a set of software services and design principles for data intensive computing with petabyte data sets, named GrayWulf. These services are intended for deployment on a cluster of commodity servers similar to the well-known Beowulf clusters. We use the Pan-STARRS system currently under development as an example of the architecture and principles in action. | ||||||
BibTeX:
@inproceedings{Simmhan:hicss:2009, author = {Yogesh Simmhan and Roger Barga and Catharine van Ingen and Maria Nieto-Santisteban and Lazslo Dobos and Nolan Li and Michael Shipway and Alexander S. Szalay and Sue Werner and Jim Heasley}, title = {GrayWulf: Scalable Software Architecture for Data Intensive Computing}, booktitle = {Hawaii International Conference on System Sciences (HICSS)}, publisher = {IEEE}, year = {2009}, pages = {1--10}, note = {[CORE A]}, doi = {https://doi.org/10.1109/HICSS.2009.235} } |
||||||
Cao:swf:2009 | Cao, B.; Plale, B.; Subramanian, G.; Robertson, E. & Simmhan, Y. |
Provenance Information Model of Karma Version 3
|
2009 | International Workshop on Scientific Workflows (SWF) , pp. 348-351 | inproceedings | msr, karma, provenance, workflow, peer reviewed |
Abstract: Provenance that captures e-Science activity has long term value only if the right amount and kind of information is collected. In this paper, we propose a two-layer model for representing provenance information capable of representing both execution information and higher level process details. The information model forms the basis for efficient relational database storage and query, and sets the stage for investigation of the necessary and sufficient information for long-term preservation. | ||||||
BibTeX:
@inproceedings{Cao:swf:2009, author = {Bin Cao and Beth Plale and Girish Subramanian and Ed Robertson and Yogesh Simmhan}, title = {Provenance Information Model of Karma Version 3}, booktitle = {International Workshop on Scientific Workflows (SWF)}, publisher = {IEEE}, year = {2009}, pages = {348--351}, doi = {https://doi.org/10.1109/SERVICES-I.2009.54} } |
||||||
Cao:swpm:2009 | Cao, B.; Plale, B.; Subramanian, G.; Missier, P.; Goble, C. & Simmhan, Y. Freire, J.; Missier, P. & Sahoo, S.S. (Hrsg.) |
Semantically Annotated Provenance in the Life Science Grid
|
2009 |
Vol. 526 International Workshop on the role of Semantic Web in Provenance Management (SWPM) , pp. 1-6 |
inproceedings | msr, provenance, karma, lsg, semantic web, life sciences, escience, peer reviewed |
Abstract: Selected semantic annotation on raw provenance data can help bridge the gap between low level provenance events (e.g., service invocations, data creation, message passing) and the high-level view that the user has of his/her investigation (e.g., data retrieval and analysis). In this initial investigation we added semantically annotated provenance to the Life Science Grid, a cyber-infrastructure framework supporting interactive data exploration and automated data analysis tools, through (i) automated data provenance collection and (ii) automated semantic enrichment of the collected provenance metadata. We use a paradigmatic life sciences use case of interactive data exploration to show that semantically annotated provenance can help users recognize the occurrence of specific patterns of investigation from an otherwise low-level sequence of elementary interaction events. | ||||||
BibTeX:
@inproceedings{Cao:swpm:2009, author = {Bin Cao and Beth Plale and Girish Subramanian and Paolo Missier and Carole Goble and Yogesh Simmhan}, title = {Semantically Annotated Provenance in the Life Science Grid}, booktitle = {International Workshop on the role of Semantic Web in Provenance Management (SWPM)}, publisher = {CEUR-WS.org}, year = {2009}, volume = {526}, pages = {1--6}, url = {http://ceur-ws.org/Vol-526/paper_5.pdf} } |
||||||
Simmhan:ijwsr:2008 | Simmhan, Y.L.; Plale, B. & Gannon, D. |
Karma2: Provenance Management for Data-Driven Workflows
|
2008 |
International Journal of Web Services Research (IJWSR) Vol. 5 (2) , pp. 1-22 |
article | msr, provenance, karma, workflow, escience, peer reviewed |
Abstract: The increasing ability for the sciences to sense the world around us is resulting in a growing need for datadriven e-Science applications that are under the control of workflows composed of services on the Grid. The focus of our work is on provenance collection for these workflows that are necessary to validate the work-flow and to determine quality of generated data products. The challenge we address is to record uniform and usable provenance metadata that meets the domain needs while minimizing the modification burden on the service authors and the performance overhead on the workflow engine and the services. The framework is based on generating discrete provenance activities during the lifecycle of a workflow execution that can be aggregated to form complex data and process provenance graphs that can span across workflows. The implementation uses a loosely coupled publish-subscribe architecture for propagating these activities, and the capabilities of the system satisfy the needs of detailed provenance collection. A performance evaluation of a prototype finds a minimal performance overhead (in the range of 1% for an eight-service workflow using 271 data products). | ||||||
BibTeX:
@article{Simmhan:ijwsr:2008, author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon}, title = {Karma2: Provenance Management for Data-Driven Workflows}, journal = {International Journal of Web Services Research (IJWSR)}, publisher = {IGI Publishing}, year = {2008}, volume = {5}, number = {2}, pages = {1--22}, note = {[IF 0.371, CORE C]}, doi = {https://doi.org/10.4018/jwsr.2008040101} } |
||||||
Simmhan:cpe:2008 | Simmhan, Y.L.; Plale, B. & Gannon, D. |
Query capabilities of the Karma provenance framework
|
2008 |
Concurrency and Computation: Practice & Experience, Special Issue on The First Provenance Challenge Vol. 20 , pp. 441-451 |
article | iu, provenance, data provenance, process provenance, provenance queries, workflows, karma, escience, provenance challenge, peer reviewed |
Abstract: Provenance metadata in e-Science captures the derivation history of data products generated from scientific workflows. Provenance forms a glue linking workflow execution with associated data products, and finds use in determining the quality of derived data, tracking resource usage, and for verifying and validating scientific experiments. In this article, we discuss the scope of provenance collected in the Karma provenance framework used in the LEAD Cyberinfrastructure project, distinguishing provenance metadata from generic annotations. We further describe our approaches to querying for different forms of provenance in Karma in the context of queries in the first provenance challenge. We use an incremental, building-block method to construct provenance queries based on the fundamental querying capabilities provided by the Karma service centered on the provenance data model. This has the advantage of keeping the Karma service generic and simple, and yet supports a wide range of queries. Karma successfully answers all but one challenge query. Copyright © 2007 John Wiley & Sons, Ltd. | ||||||
BibTeX:
@article{Simmhan:cpe:2008, author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon}, title = {Query capabilities of the Karma provenance framework}, journal = {Concurrency and Computation: Practice & Experience, Special Issue on The First Provenance Challenge}, publisher = {John Wiley and Sons Ltd.}, year = {2008}, volume = {20}, pages = {441--451}, note = {[IF 0.636, CORE A]}, doi = {https://doi.org/10.1002/cpe.v20:5} } |
||||||
Moreau:cpe:2008 | Moreau, L.; Ludäscher, B.; Altintas, I.; Barga, R.S.; Bowers, S.; Callahan, S.; George Chin, J.; Clifford, B.; Cohen, S.; Cohen-Boulakia, S.; Davidson, S.; Deelman, E.; Digiampietri, L.; Foster, I.; Freire, J.; Frew, J.; Futrelle, J.; Gibson, T.; Gil, Y.; Goble, C.; Golbeck, J.; Groth, P.; Holland, D.A.; Jiang, S.; Kim, J.; Koop, D.; Krenek, A.; McPhillips, T.; Mehta, G.; Miles, S.; Metzger, D.; Munroe, S.; Myers, J.; Plale, B.; Podhorszki, N.; Ratnakar, V.; Santos, E.; Scheidegger, C.; Schuchardt, K.; Seltzer, M.; Simmhan, Y.L.; Silva, C.; Slaughter, P.; Stephan, E.; Stevens, R.; Turi, D.; Vo, H.; Wilde, M.; Zhao, J. & Zhao, Y. |
Special Issue: The First Provenance Challenge
|
2008 |
Concurrency and Computation: Practice & Experience, Special Issue on The First Provenance Challenge Vol. 20 , pp. 409-418 |
article | iu, provenance, provenance challenge |
Abstract: The first Provenance Challenge was set up in order to provide a forum for the community to understand the capabilities of different provenance systems and the expressiveness of their provenance representations. To this end, a functional magnetic resonance imaging workflow was defined, which participants had to either simulate or run in order to produce some provenance representation, from which a set of identified queries had to be implemented and executed. Sixteen teams responded to the challenge, and submitted their inputs. In this paper, we present the challenge workflow and queries, and summarize the participants' contributions. Copyright © 2007 John Wiley & Sons, Ltd. | ||||||
BibTeX:
@article{Moreau:cpe:2008, author = {Luc Moreau and Bertram Ludäscher and Ilkay Altintas and Roger S. Barga and Shawn Bowers and Steven Callahan and George Chin, Jr. and Ben Clifford and Shirley Cohen and Sarah Cohen-Boulakia and Susan Davidson and Ewa Deelman and Luciano Digiampietri and Ian Foster and Juliana Freire and James Frew and Joe Futrelle and Tara Gibson and Yolanda Gil and Carole Goble and Jennifer Golbeck and Paul Groth and David A. Holland and Sheng Jiang and Jihie Kim and David Koop and Ales Krenek and Timothy McPhillips and Gaurang Mehta and Simon Miles and Dominic Metzger and Steve Munroe and Jim Myers and Beth Plale and Norbert Podhorszki and Varun Ratnakar and Emanuele Santos and Carlos Scheidegger and Karen Schuchardt and Margo Seltzer and Yogesh L. Simmhan and Claudio Silva and Peter Slaughter and Eric Stephan and Robert Stevens and Daniele Turi and Huy Vo and Mike Wilde and Jun Zhao and Yong Zhao}, title = {Special Issue: The First Provenance Challenge}, journal = {Concurrency and Computation: Practice & Experience, Special Issue on The First Provenance Challenge}, publisher = {John Wiley and Sons Ltd.}, year = {2008}, volume = {20}, pages = {409-418}, note = {[CORE A]}, doi = {https://doi.org/10.1002/cpe.v20:5} } |
||||||
Gannon:hpcbook:2008 | Gannon, D.; Plale, B.; Christie, M.; Huang, Y.; Jensen, S.; Liu, N.; Marru, S.; Pallickara, S.; Perera, S.; Shirasuna, S.; Simmhan, Y.; Slominski, A.; Sun, Y. & Vijayakumar, N. Grandinetti, L. (Hrsg.) |
Building Grid Portals for e-Science: A Service Oriented Architecture (
High Performance Computing and Grids in Action
)
|
2008 |
High Performance Computing and Grids in Action Vol. 16 , pp. 149-166 |
inbook | iu,escience, portal, web service, lead, peer reviewed |
Abstract: Grids are built by communities who need a shared cyberinfrastructure to make progress on the critical problems they are currently confronting. An e-science portal is a conventional Web portal that sits on top of a rich collection of web services that allow a community of users access to shared data and application resources without exposing them to the details of Grid computing. In this chapter we describe a service-oriented architecture to support this type of portal. | ||||||
BibTeX:
@inbook{Gannon:hpcbook:2008, author = {Dennis Gannon and Beth Plale and Marcus Christie and Yi Huang and Scott Jensen and Ning Liu and Suresh Marru and Sangmi Pallickara and Srinath Perera and Satoshi Shirasuna and Yogesh Simmhan and Aleksander Slominski and Yiming Sun and Nithya Vijayakumar}, title = {High Performance Computing and Grids in Action}, publisher = {IOS Press}, year = {2008}, volume = {16}, pages = {149--166}, url = {http://www.booksonline.iospress.nl/Content/View.aspx?piid=8567} } |
||||||
Barga:clade:2008 | Barga, R.S.; Fay, D.; Guo, D.; Newhouse, S.; Simmhan, Y. & Szalay, A. |
Efficient scheduling of scientific workflows in a high performance computing cluster
|
2008 | International Workshop on Challenges of Large Applications in Distributed Environments (CLADE) , pp. 63-68 | inproceedings | msr, data intensive, escience, scheduling, workflow, hpc, peer reviewed |
Abstract: The scientific computing community, especially academia is clearly in need of technology to handle and organize the 1-100+ Terabyte datasets coming from computer simulations and scientific instrumentation. In this paper we briefly describe GrayWulf, an exemplar cluster for data intensive applications using SQL Server and HPC Clusters. One of the key software components of GrayWulf is Trident, a scientific workflow workbench that performs automatic scheduling of workflows across the cluster. We examine the challenges of scheduling workflows on GrayWulf, algorithms to improve performance, and present early results from applying Trident to schedule data loading workflows on GrayWulf for an actual e-Science project | ||||||
BibTeX:
@inproceedings{Barga:clade:2008, author = {Roger S. Barga and Dan Fay and Dean Guo and Steven Newhouse and Yogesh Simmhan and Alex Szalay}, title = {Efficient scheduling of scientific workflows in a high performance computing cluster}, booktitle = {International Workshop on Challenges of Large Applications in Distributed Environments (CLADE)}, publisher = {ACM}, year = {2008}, pages = {63--68}, note = {[CORE C]}, doi = {https://doi.org/10.1145/1383529.1383545} } |
||||||
Simmhan:escience:2008 | Simmhan, Y.; Barga, R.; van Ingen, C.; Lazowska, E. & Szalay, A. |
On Building Scientific Workflow Systems for Data Management in the Cloud
|
2008 | IEEE International Conference on eScience (eScience) , pp. 434-435 | inproceedings | msr, workflows, escience, data management, cloud, hpc, trident, panstarrs, poster, peer reviewed |
Abstract: Scientific workflows have become an archetype to model in silico experiments in the Cloud by scientists. There is a class of workflows that are used to by "data valets" to prepare raw data from scientific instruments into a science-ready form for use by scientists. These share data-intensive traits with traditional scientific workflows, yet differ significantly, for example, in the required degree of reliability and the type of provenance collected. We compare and contrast science application and data valet workflows through exemplar eScience projects to drive shared and unique requirements for scientific workflows across diverse users in a Science Cloud. | ||||||
BibTeX:
@inproceedings{Simmhan:escience:2008, author = {Yogesh Simmhan and Roger Barga and Catharine van Ingen and Ed Lazowska and Alex Szalay}, title = {On Building Scientific Workflow Systems for Data Management in the Cloud}, booktitle = {IEEE International Conference on eScience (eScience)}, publisher = {IEEE}, year = {2008}, pages = {434--435}, note = {Poster [CORE A]}, doi = {https://doi.org/10.1109/eScience.2008.150} } |
||||||
Barga:escience:2008 | Barga, R.; Jackson, J.; Araujo, N.; Guo, D.; Gautam, N. & Simmhan, Y. |
The Trident Scientific Workflow Workbench
|
2008 | IEEE International Conference on eScience (eScience) , pp. 317-318 | inproceedings | msr, workflows, escience, trident, panstarrs, neptune, demo, peer reviewed |
Abstract: In our demonstration we present Trident, a scientific workflow workbench built on top of a commercial workflow system to leverage existing functionality to the extent possible. Trident is being developed in collaboration with the scientific computing community for use in a number of ongoing eScience projects that make use of scientific workflows, in particular the Pan-STARRS sky survey project and the Ocean Observatory Initiative. In our demonstration of Trident we will illustrate the ability to utilize both local and cloud resources for storage and execution, as well as services such as provenance, monitoring, logging and scheduling workflows over clusters. Our goal is to release Trident in early 2009 as an open source accelerator for others to use for eScience projects and to continue extending with support for new workflow features and services. | ||||||
BibTeX:
@inproceedings{Barga:escience:2008, author = {Roger Barga and Jared Jackson and Nelson Araujo and Dean Guo and Nitin Gautam and Yogesh Simmhan}, title = {The Trident Scientific Workflow Workbench}, booktitle = {IEEE International Conference on eScience (eScience)}, publisher = {IEEE}, year = {2008}, pages = {317--318}, note = {Demo [CORE A]}, doi = {https://doi.org/10.1109/eScience.2008.126} } |
||||||
Gannon:wfbook:2007 | Gannon, D.; Plale, B.; Marru, S.; Kandaswamy, G.; Simmhan, Y. & Shirasuna, S. Gannon, D.; Deelman, E.; Shields, M. & Taylor, I. (Hrsg.) |
Dynamic, Adaptive Workflows for Mesoscale Meteorology (
Workflows for eScience: Scientific Workflows for Grids
)
|
2007 | Workflows for eScience: Scientific Workflows for Grids , pp. 126-142 | inbook | iu, workflows, grid, escience, peer reviewed |
Abstract: The Linked Environments for Atmospheric Discovery (LEAD) [122] is a National Science Foundation funded1 project to change the paradigm for mesoscale weather prediction from one of static, fixed-schedule computational forecasts to one that is adaptive and driven by weather events. It is a collaboration of eight institutions,2 led by Kelvin Droegemeier of the University of Oklahoma, with the goal of enabling far more accurate and timely predictions of tornadoes and hurricanes than previously considered possible. The traditional approach to weather prediction is a four-phase activity. In the first phase, data from sensors are collected. The sensors include ground instruments such as humidity and temperature detectors, and lightning strike detectors and atmospheric measurements taken from balloons, commercial aircraft, radars, and satellites. The second phase is data assimilation, in which the gathered data are merged together into a set of consistent initial and boundary conditions for a large simulation. The third phase is the weather prediction, which applies numerical equations to measured conditions in order to project future weather conditions. The final phase is the generation of visual images of the processed data products that are analyzed to make predictions. Each phase of activity is performed by one or more application components. | ||||||
BibTeX:
@inbook{Gannon:wfbook:2007, author = {Dennis Gannon and Beth Plale and Suresh Marru and Gopi Kandaswamy and Yogesh Simmhan and Satoshi Shirasuna}, title = {Workflows for eScience: Scientific Workflows for Grids}, publisher = {Springer London}, year = {2007}, pages = {126--142}, doi = {https://doi.org/10.1007/978-1-84628-757-2_9} } |
||||||
Simmhan:gbpse:2006 | Simmhan, Y.; Pallickara, S.; Vijayakumar, N. & Plale, B. Gaffney, P. & Pool, J. (Hrsg.) |
Data Management in Dynamic Environment-driven Computational Science
|
2007 |
Vol. 239 Grid-Based Problem Solving Environments , pp. 317-333 |
inproceedings | iu, data management, lead, provenance, portal, mylead, karma, calder, escience, peer reviewed |
Abstract: Advances in numerical modeling, computational hardware and problem solving environments have driven the growth of computational science over the past decades. Science gateways, based on service oriented architectures and scientific workflows, provide yet another step in democratizing access to advanced numerical and scientific tools, computational resource and massive data storage, and fostering collaborations. Dynamic, data-driven applications, such as those found in weather forecasting, present interesting challenges to Science Gateways, which are being addressed as part of the LEAD Cyberinfrastructure project. In this article, we discuss three important data related problems faced by such adaptive data-driven environments: managing a user’s personal workspace and metadata on the Grid, tracking the provenance of scientific workflows and data products, and continuous data mining over observational weather data. | ||||||
BibTeX:
@inproceedings{Simmhan:gbpse:2006, author = {Yogesh Simmhan and Sangmi Pallickara and Nithya Vijayakumar and Beth Plale}, title = {Data Management in Dynamic Environment-driven Computational Science}, booktitle = {Grid-Based Problem Solving Environments}, publisher = {Springer Boston}, year = {2007}, volume = {239}, pages = {317--333}, doi = {https://doi.org/10.1007/978-0-387-73659-4_17} } |
||||||
Ramakrishnan:iccs:2007 | Ramakrishnan, L.; Simmhan, Y. & Plale, B. Shi, Y.; van Albada, G.; Dongarra, J. & Sloot, P. (Hrsg.) |
Realization of Dynamically Adaptive Weather Analysis and Forecasting in LEAD: Four Years Down the Road
|
2007 |
Vol. 4487 International Conference on Computational Science (ICCS) , pp. 1122-1129 |
inproceedings | iu, lead, escience, workflow, peer reviewed |
Abstract: Linked Environments for Atmospheric Discovery (LEAD) is a large-scale cyberinfrastructure effort in support of mesoscale meteorology. One of the primary goals of the infrastructure is support for real-time dynamic, adaptive response to severe weather. In this paper we revisit the conception of dynamic adaptivity as appeared in our 2005 DDDAS workshop paper, and discuss changes since the original conceptualization, and lessons learned in working with a complex service oriented architecture in support of data driven science. | ||||||
BibTeX:
@inproceedings{Ramakrishnan:iccs:2007, author = {Ramakrishnan, Lavanya and Simmhan, Yogesh and Plale, Beth}, title = {Realization of Dynamically Adaptive Weather Analysis and Forecasting in LEAD: Four Years Down the Road}, booktitle = {International Conference on Computational Science (ICCS)}, publisher = {Springer Berlin / Heidelberg}, year = {2007}, volume = {4487}, pages = {1122--1129}, note = {[CORE A]}, doi = {https://doi.org/10.1007/978-3-540-72584-8_147} } |
||||||
Simmhan:icws:2006 | Simmhan, Y.L.; Plale, B. & Gannon, D. |
A Framework for Collecting Provenance in Data-Centric Scientific Workflows
|
2006 | International Conference on Web Services (ICWS) , pp. 427-436 | inproceedings | iu, provenance, escience, karma, workflows, peer reviewed |
Abstract: The increasing ability for the earth sciences to sense the world around us is resulting in a growing need for data-driven applications that are under the control of data-centric workflows composed of grid- and web- services. The focus of our work is on provenance collection for these workflows, necessary to validate the workflow and to determine quality of generated data products. The challenge we address is to record uniform and usable provenance metadata that meets the domain needs while minimizing the modification burden on the service authors and the performance overhead on the workflow engine and the services. The framework, based on a loosely-coupled publish-subscribe architecture for propagating provenance activities, satisfies the needs of detailed provenance collection while a performance evaluation of a prototype finds a minimal performance overhead (in the range of 1% for an eight service workflow using 271 data products). | ||||||
BibTeX:
@inproceedings{Simmhan:icws:2006, author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon}, title = {A Framework for Collecting Provenance in Data-Centric Scientific Workflows}, booktitle = {International Conference on Web Services (ICWS)}, publisher = {IEEE}, year = {2006}, pages = {427--436}, note = {[CORE A]}, doi = {https://doi.org/10.1109/ICWS.2006.5} } |
||||||
Simmhan:ipaw:2006 | Simmhan, Y.L.; Plale, B. & Gannon, D. Moreau, L. & Foster, I. (Hrsg.) |
Performance Evaluation of the Karma Provenance Framework for Scientific Workflows
|
2006 |
Vol. 4145 International Provenance and Annotation Workshop (IPAW) , pp. 222-236 |
inproceedings | iu, provenance, escience, karma, workflows, peer reviewed |
Abstract: Provenance about workflow executions and data derivations in scientific applications help estimate data quality, track resources, and validate in silico experiments. The Karma provenance framework provides a means to collect workflow, process, and data provenance from data-driven scientific workflows and is used in the Linked Environments for Atmospheric Discovery (LEAD) project. This paper presents a performance analysis of the Karma service as compared against the contemporary PReServ provenance service. Our study finds that Karma scales exceedingly well for collecting and querying provenance records, showing linear or sub-linear scaling with increasing number of provenance records and clients when tested against workloads in the order of 10,000 application-service invocations and over 36 concurrent clients. | ||||||
BibTeX:
@inproceedings{Simmhan:ipaw:2006, author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon}, title = {Performance Evaluation of the Karma Provenance Framework for Scientific Workflows}, booktitle = {International Provenance and Annotation Workshop (IPAW)}, publisher = {Springer Berlin / Heidelberg}, year = {2006}, volume = {4145}, pages = {222--236}, doi = {https://doi.org/10.1007/11890850_23} } |
||||||
Simmhan:sciflow:2006 | Simmhan, Y.L.; Plale, B. & Gannon, D. |
Towards a Quality Model for Effective Data Selection in Collaboratories
|
2006 | Workshop on Workflow and Data Flow for Scientific Applications (SciFlow) , pp. 1-4 | inproceedings | iu, provenance, escience, karma, workflows, short paper, peer reviewed |
Abstract: Data-driven scientific applications utilize workflow frameworks to execute complex dataflows, resulting in derived data products of unknown quality. We discuss our on-going research on a quality model that provides users with an integrated estimate of the data quality that is tuned to their application needs, and is available as a numerical quality score that enables uniform comparison of datasets, and increases community’s trust in derived data. | ||||||
BibTeX:
@inproceedings{Simmhan:sciflow:2006, author = {Yogesh L. Simmhan and Beth Plale and Dennis Gannon}, title = {Towards a Quality Model for Effective Data Selection in Collaboratories}, booktitle = {Workshop on Workflow and Data Flow for Scientific Applications (SciFlow)}, publisher = {IEEE}, year = {2006}, pages = {1--4}, doi = {https://doi.org/10.1109/ICDEW.2006.150} } |
||||||
Simmhan:record:2005 | Simmhan, Y.; Plale, B. & Gannon, D. |
A Survey of Data Provenance in e-Science
|
2005 |
SIGMOD Record Vol. 34 (3) , pp. 31-36 |
article | iu, provenance, escience, peer reviewed |
Abstract: Data management is growing in complexity as large-scale applications take advantage of the loosely coupled resources brought together by grid middleware and by abundant storage capacity. Metadata describing the data products used in and generated by these applications is essential to disambiguate the data and enable reuse. Data provenance, one kind of metadata, pertains to the derivation history of a data product starting from its original sources. In this paper we create a taxonomy of data provenance characteristics and apply it to current research efforts in e-science, focusing primarily on scientific workflow approaches. The main aspect of our taxonomy categorizes provenance systems based on why they record provenance, what they describe, how they represent and store provenance, and ways to disseminate it. The survey culminates with an identification of open research problems in the field. | ||||||
BibTeX:
@article{Simmhan:record:2005, author = {Yogesh Simmhan and Beth Plale and Dennis Gannon}, title = {A Survey of Data Provenance in e-Science}, journal = {SIGMOD Record}, publisher = {ACM}, year = {2005}, volume = {34}, number = {3}, pages = {31--36}, note = {[IF 0.667]}, doi = {https://doi.org/10.1145/1084805.1084812} } |
||||||
Gannon:ieee:2005 | Gannon, D.; Alameda, J.; Chipara, O.; Christie, M.; Dukle, V.; Fang, L.; Farellee, M.; Fox, G.; Hampton, S.; Kandaswamy, G.; Kodeboyina, D.; Moad, C.; Pierce, M.; Plale, B.; Rossi, A.; Simmhan, Y.; Sarangi, A.; Slominski, A.; Shirasauna, S. & Thomas, T. |
Building Grid Portal Applications from a Web-Service Component Architecture
|
2005 |
Proceedings of the IEEE, Special issue on Grid Computing Vol. 93 (3) , pp. 551-563 |
article | iu,grid, portal,web service, peer reviewed |
Abstract: This paper describes an approach to building Grid applications based on the premise that users who wish to access and run these applications prefer to do so without becoming experts on Grid technology. We describe an application architecture based on wrapping user applications and application workflows as web services and web service resources.These services are visible to the users and to resource providers through a family of Grid portal components that can be used to configure, launch and monitor complex applications in the scientific language of the end user. The applications in this model are instantiated by an application factory service. The layered design of the architecture makes it possible for an expert to configure an application factory service with a custom user interface client that may be dynamical loaded into the portal. | ||||||
BibTeX:
@article{Gannon:ieee:2005, author = {Dennis Gannon and Jay Alameda and Octav Chipara and Marcus Christie and Vinayak Dukle and Liang Fang and Matthew Farellee and Geoffrey Fox and Shawn Hampton and Gopi Kandaswamy and Deepti Kodeboyina and Charlie Moad and Marlon Pierce and Beth Plale and Albert Rossi and Yogesh Simmhan and Anuraag Sarangi and Aleksander Slominski and Satoshi Shirasauna and Thomas Thomas}, title = {Building Grid Portal Applications from a Web-Service Component Architecture}, journal = {Proceedings of the IEEE, Special issue on Grid Computing}, publisher = {IEEE}, year = {2005}, volume = {93}, number = {3}, pages = {551--563}, note = {[IF 6.81]}, doi = {https://doi.org/10.1109/JPROC.2004.842756} } |
||||||
Gannon:icsoc:2005 | Gannon, D.; Plale, B.; Christie, M.; Fang, L.; Huang, Y.; Jensen, S.; Kandaswamy, G.; Marru, S.; Pallickara, S.L.; Shirasuna, S.; Simmhan, Y.; Slominski, A. & Sun, Y. Benatallah, B.; Casati, F. & Traverso, P. (Hrsg.) |
Service Oriented Architectures for Science Gateways on Grid Systems
|
2005 |
Vol. 3826 International Conference on Service-Oriented Computing (ICSOC) , pp. 21-32 |
inproceedings | iu, portal, web service, grid, peer reviewed |
Abstract: Grid computing is about allocating distributed collections of resources including computers, storage systems, networks and instruments to form a coherent system devoted to a “virtual organization” of users who share a common interest in solving a complex problem or building an efficient agile enterprise. Service oriented architectures have emerged as the standard way to build Grids. This paper provides a brief look at the Open Grid Service Architecture, a standard being proposed by the Global Grid Forum, which provides the foundational concepts of most Grid systems. Above this Grid foundation is a layer of application-oriented services that are managed by workflow tools and “science gateway” portals that provide users transparent access to the applications that use the resources of a Grid. In this paper we will also describe these Gateway framework services and discuss how they relate to and use Grid services. | ||||||
BibTeX:
@inproceedings{Gannon:icsoc:2005, author = {Dennis Gannon and Beth Plale and Marcus Christie and Liang Fang and Yi Huang and Scott Jensen and Gopi Kandaswamy and Suresh Marru and Sangmi Lee Pallickara and Satoshi Shirasuna and Yogesh Simmhan and Aleksander Slominski and Yiming Sun}, title = {Service Oriented Architectures for Science Gateways on Grid Systems}, booktitle = {International Conference on Service-Oriented Computing (ICSOC)}, publisher = {Springer Berlin / Heidelberg}, year = {2005}, volume = {3826}, pages = {21--32}, note = {[CORE A]}, doi = {https://doi.org/10.1007/11596141_3} } |
||||||
Gannon:clade:2004 | Gannon, D.; Krishnan, S.; Fang, L.; Kandaswamy, G.; Simmhan, Y. & Slominski, A. IEEE |
On Building Parallel and Grid Applications: Component Technology and Distributed Services
|
2004 | International Workshop on Challenges of Large Applications in Distributed Environments (CLADE) , pp. 44-51 | inproceedings | iu, grid, web service, escience, component, peer reviewed |
Abstract: Software Component Frameworks are well known in the commercial business application world and now this technology is being explored with great interest as a way to build large-scale scientific application on parallel computers. In the case of Grid systems, the current architectural model is based on the emerging web services framework. In this paper we describe progress that has been made on the Common Component Architecture model (CCA) and discuss its success and limitations when applied to problems in Grid computing. Our primary conclusion is that a component model fits very well with a services-oriented Grid, but the model of composition must allow for a very dynamic (both in space and it time) control of composition. We note that this adds a new dimension to conventional service workflow and it extends the “Inversion of Control” aspects of must component systems. | ||||||
BibTeX:
@inproceedings{Gannon:clade:2004, author = {Dennis Gannon and Sriram Krishnan and Liang Fang and Gopi Kandaswamy and Yogesh Simmhan and Aleksander Slominski}, title = {On Building Parallel and Grid Applications: Component Technology and Distributed Services}, booktitle = {International Workshop on Challenges of Large Applications in Distributed Environments (CLADE)}, year = {2004}, pages = {44--51}, note = {[CORE C]}, doi = {https://doi.org/10.1109/CLADE.2004.1309091} } |
||||||
Gannon:dbgs:2003 | Gannon, D.; Christie, M.; Chipara, O.; Fang, L.; Farrellee, M.; Kandaswamy, G.; Lu, W.; Plale, B.; Slominski, A.; Sarangi, A. & Simmhan, Y.L. |
Building Grid Services for User Portals
|
2003 | Workshop on Designing and Building Grid Services (DBGS) | inproceedings | iu, portal, grid, web service, escience, peer reviewed |
BibTeX:
@inproceedings{Gannon:dbgs:2003, author = {Dennis Gannon and Marcus Christie and Octav Chipara and Liang Fang and Matthew Farrellee and Gopi Kandaswamy and Wei Lu and Beth Plale and Aleksander Slominski and Anuraag Sarangi and Yogesh L. Simmhan}, title = {Building Grid Services for User Portals}, booktitle = {Workshop on Designing and Building Grid Services (DBGS)}, publisher = {GGF}, year = {2003}, url = {http://www.mcs.anl.gov/ keahey/DBGS/DBGS_files/dbgs_papers/gannon.pdf} } |
||||||
Gannon:cluster:2002 | Gannon, D.; Bramley, R.; Fox, G.; Smallen, S.; Rossi, A.; Ananthakrishnan, R.; Bertrand, F.; Chiu, K.; Farrellee, M.; Govindaraju, M.; Krishnan, S.; Ramakrishnan, L.; Simmhan, Y.; Slominski, A.; Ma, Y.; Olariu, C. & Rey-Cenvaz, N. |
Programming the Grid: Distributed Software Components, P2P and Grid Web Services for Scientific Applications
|
2002 |
Cluster Computing Vol. 5 (3) , pp. 325-336 |
article | iu, component, grid, web service, escience, peer reviewed |
Abstract: Computational Grids have become an important asset in large-scale scientific and engineering research. By providing a set of services that allow a widely distributed collection of resources to be tied together into a relatively seamless computing framework, teams of researchers can collaborate to solve problems that they could not have attempted before. Unfortunately the task of building Grid applications remains extremely difficult because there are few tools available to support developers. To build reliable and re-usable Grid applications, programmers must be equipped with a programming framework that hides the details of most Grid services and allows the developer a consistent, non-complex model in which applications can be composed from well tested, reliable sub-units. This paper describes experiences with using a software component framework for building Grid applications. The framework, which is based on the DOE Common Component Architecture (CCA), allows individual components to export function/service interfaces that can be remotely invoked by other components. The framework also provides a simple messaging/event system for asynchronous notification between application components. The paper also describes how the emerging Web-Services model fits with a component-oriented application design philosophy. To illustrate the connection between web services and Grid application programming we describe a simple design pattern for application factory services which can be used to simplify the task of building reliable Grid programs. Finally we address several issues of Grid programming that better understood from the perspective of Peer-to-Peer (P2P) systems. In particular we describe how models for collaboration and resource sharing fit well with many grid application scenarios. | ||||||
BibTeX:
@article{Gannon:cluster:2002, author = {Dennis Gannon and Randall Bramley and Geoffrey Fox and Shava Smallen and Al Rossi and Rachana Ananthakrishnan and Felipe Bertrand and Kenneth Chiu and Matt Farrellee and Madhusudhan Govindaraju and Sriram Krishnan and Lavanya Ramakrishnan and Yogesh Simmhan and Aleksander Slominski and Yu Ma and Caroline Olariu and Nicolas Rey-Cenvaz}, title = {Programming the Grid: Distributed Software Components, P2P and Grid Web Services for Scientific Applications}, journal = {Cluster Computing}, publisher = {Springer Netherlands}, year = {2002}, volume = {5}, number = {3}, pages = {325--336}, note = {[IF 0.519]}, doi = {https://doi.org/10.1023/A:1015633507128} } |
||||||
Krishnan:sciprog:2002 | Krishnan, S.; Bramley, R.; Gannon, D.; Ananthakrishnan, R.; Govindaraju, M.; Slominski, A.; Simmhan, Y.; Alameda, J.; Alkire, R.; Drews, T. & Webb, E. |
The XCAT Science Portal
|
2002 |
Scientific Programming Vol. 10 (4) , pp. 303--317 |
article | iu, component, portal, escience, peer reviewed |
Abstract: This paper describes the design and prototype implementation of the XCAT Grid Science Portal. The portal lets grid application programmers script complex distributed computations and package these applications with simple interfaces for others to use. Each application is packaged as a notebook which consists of webpages and editable parameterized scripts. The portal is a workstation-based specialized personal web server, capable of executing the application scripts and launching remote grid applications for the user. The portal server can receive event streams published by the application and grid resource information published by Network Weather Service(NWS) or Autopilot sensors. Notebooks can be published and stored in web based archives for others to retrieve and modify. The XCAT Grid Science Portal has been tested with various applications, including the distributed simulation of chemical processes in semiconductor manufacturing and collaboratory support for X-ray crystallographers. | ||||||
BibTeX:
@article{Krishnan:sciprog:2002, author = {Sriram Krishnan and Randall Bramley and Dennis Gannon and Rachana Ananthakrishnan and Madhusudhan Govindaraju and Aleksander Slominski and Yogesh Simmhan and Jay Alameda and Richard Alkire and Timothy Drews and Eric Webb}, title = {The XCAT Science Portal}, journal = {Scientific Programming}, publisher = {IOS Press}, year = {2002}, volume = {10}, number = {4}, pages = {303---317}, note = {[IF 0.967]}, url = {https://content.iospress.com/articles/scientific-programming/spr00107} } |
Created by JabRef on 09/10/2020.