ICDE 2013: 29th IEEE International Conference on Data Engineering
Sponsored by the IEEE Computer Society
Sofitel Brisbane Central Hotel, Brisbane, Australia, April 8-11, 2013
Program
This PDF of the conference booklet is for your online reference only. Printed hard copies of this booklet will be available for all delegates at the Registration Desk at the conference.
The ICDE program-at-a-glance is also available.
Monday 8 April
9am - 5pm Workshops
Data Engineering Meets the Semantic Web – DESWEB
Self-Managing Database Systems - SMDB
Privacy-Preserving Data Publication and Analysis - PrivDB
Mobile Data Analytics - MoDA
Data-Driven Decision Guidance and Support Systems - DGSS
Graph Data Management: Techniques and Applications - GDM
Data Management in the Cloud - DMC
Tuesday 9 April
9 - 10:30am
Keynote 1 (Ballroom 1-2)
Re-thinking the Performance of Information Processing Systems
11am - 12:30pm
R1 - Main Memory Databases (Ballroom 1)
CPU and Cache Efficient Management of Memory-Resident Databases
- Holger Pirk (CWI)
- Florian Funke (TU München)
- Martin Grund (HPI)
- Thomas Neumann (Technische Universitat Munchen)
- Ulf Leser (HU Berlin)
- Stefan Manegold (CWI)
- Alfons Kemper (Technische Universität München)
- Martin Kersten (CWI)
Identifying Hot and Cold Data in Main-Memory Databases
- Justin Levandoski (Microsoft Research)
- Paul Larson (Microsoft Research)
- Radu Stoica
The Adaptive Radix Tree: ARTful Indexing for Main-Memory Databases
- Viktor Leis (Technische Universität München)
- Alfons Kemper (TUM)
- Thomas Neumann (TUM)
11am - 12:30pm
R2 - MapReduce algorithms (St Germaine)
Finding Connected Components on Map-reduce in Logarithmic Rounds
- Vibhor Rastogi (Google)
- Ashwin Machanavajjhala (Duke University)
- Laukik Chitnis (Google)
- Anish Das Sarma (Google)
Enumerating Subgraph Instances Using Map-Reduce
- Foto Afrati (National Technical Uni Athens)
- Dimitris Fotakis
- Jeffrey Ullman
Scalable Maximum Clique Computation Using MapReduce
- Jingen Xiang (University of Waterloo)
- Cong Guo (University of Waterloo)
- Ashraf Aboulnaga (University of Waterloo)
11am - 12:30pm
R3 - Time Travel in Database (Bastille 1)
Ficklebase: Looking into the Future to Erase the Past
- Sumeet Bajaj (Stony Brook University)
- Radu Sion
Time Travel in a Scientific Array Database
- Emad Soroush (University of Washington)
- Magdalena Balazinska (Univ of Washington)
Time Travel in Column Stores
- Martin Kaufmann (ETH Zürich)
- Amin Amiri Manjili (ETH Zürich)
- Stefan Hildenbrand
- Donald Kossmann (ETH Zurich)
- Andreas Tonder (SAP AG)
11am - 12:30pm
R4 - Top-k query in uncertain data (Bastille 2)
Top-k Query Processing in Probabilistic Databases with Non-Materialized Views
- Maximilian Dylla
- Iris Miliaraki (Max-Planck-Institut)
- Martin Theobald
Cleaning Uncertain Data for Top-k Queries
- Luyi Mo (University of Hong Kong)
- Reynold Cheng (University of Hong Kong)
- David Cheung
- Xiang Li
- Xuan Yang (The University of Hong Kong)
Top-K Oracle: A New Way to Present Top-K Tuples for Uncertain Data
- Chunyao Song (University of Massachusetts, Lowell)
- Zheng Li (University of Massachusetts, Lowell)
- Tingjian Ge (University of Massachusetts, Lowell)
11am - 12:30pm
Seminar 1 (Ballroom 2)
Machine Learning on Big Data
- Tyson Condie (Microsoft, USA)
- Paul Mineiro (Microsoft, USA)
- Neoklis Polyzotis (University of California, Santa Cruz)
- Markus Weimer (Microsoft, USA)
11am - 12:30pm
Seminar 2 (Odeon)
Big Data Integration
- Xin Luna Dong (Google Mountain View)
- Divesh Srivastava (AT&T Labs)
11am - 12:30pm
Industry 1 (Concorde)
Big Data Analytics at Facebook
Data Services for E-tailers Leveraging Search Engine Assets
- Tao Cheng (Microsoft Research)
- Kaushik Chakrabarti (Microsoft Research)
- Surajit Chaudhuri (Microsoft Research)
- Vivek Narasayya (Microsoft Research)
- Manoj Syamala (Microsoft Research)
SAP HANA Distributed In-Memory Database System: Transaction, Session, and Metadata Management
- Juchang Lee (SAP Labs, Korea)
- Yong Sik Kwon (SAP Labs, Korea)
- Franz Färber (SAP Labs, Korea)
- Michael Muehle (SAP AG, Germany)
- Chulwon Lee (SAP Labs, Korea)
- Christian Bensberg (SAP AG, Germany)
- Joo Yeon Lee (SAP Labs, Korea)
- Arthur H. Lee (SAP & Claremont McKenna College, USA)
- Wolfgang Lehner (Dresden University of Technology)
2pm - 3:30pm
R5 - Uncertainty in spatial data (Ballroom 1)
Voronoi-based Nearest Neighbor Search for Multi-Dimensional Uncertain Databases
- Peiwu Zhang (The University of Hong Kong)
- Reynold Cheng (The University of Hong Kong)
- Nikos Mamoulis (The University of Hong Kong)
- Matthias Renz (Ludwig-Maximilians-Universität München)
- Andreas Züfle (Ludwig-Maximilians-Universität München)
- Yu Tang (The University of Hong Kong)
- Tobias Emrich (Ludwig-Maximilians-Universität München)
Interval Reverse Nearest Neighbor Queries on Uncertain Data with Markov Correlations
- Chuanfei Xu (Northeastern Uinversity)
- Yu Gu
- Lei Chen (Hong Kong University of Science and Technology)
- Jianzhong Qiao
- Ge Yu (Northeastern University)
Efficient Tracking and Querying for Coordinated Uncertain Mobile Objects
- Nicholas Larusso (UCSB)
- Ambuj Singh (UCSB)
2pm - 3:30pm
R6 - Data extraction (St Germaine)
Attribute Extraction and Scoring: A Probabilistic Approach
- Taesung Lee (POSTECH)
- Zhongyuan Wang (Microsoft)
- Haixun Wang (Microsoft)
- Seung-won Hwang (POSTECH)
TYPifier: Inferring the Type Semantics of Structured Data
- Yongtao Ma (AIFB, Karlsruhe Institute of Technology)
- Thanh Tran (AIFB)
- Veli Bizer (IBM)
SUSIE: Search Using Services and Information Extraction
- Nicoleta Preda (University of Versailles)
- Fabian Suchanek (Max Planck Institute for Informatics)
- Wenjun Yuan (Université de Versailles)
- Gerhard Weikum (Max Planck Institute for Informatics)
2pm - 3:30pm
R7 - Trajectory databases (Bastille 1)
Towards Efficient Search for Activity Trajectories
- Kai Zheng (The University of Queensland)
- Shuo Shang (Aalborg University)
- Jing Yuan (Microsoft Research Asia)
- Yi Yang (Carnegie Mellon University)
On Discovery of Gathering Patterns from Trajectories
- Kai Zheng (The University of Queensland)
- Yu Zheng (Microsoft Research Asia)
- Jing Yuan (Microsoft Research Asia)
- Shuo Shang (Aalborg University)
Destination Prediction by Sub-Trajectory Synthesis and Privacy Protection Against Such Prediction
- Andy Yuan Xue (The University of Melbourne)
- Rui Zhang (University of Melbourne)
- Yu Zheng (Microsoft Research Asia)
- Xing Xie (Microsoft Research Asia)
- Jin Huang (The University of Melbourne)
- Zhenghua Xu (The University of Melbourne)
2pm - 3:30pm
R8 - Social networks (Bastille 2)
Scalable and Parallelizable Processing of Influence Maximization for Large-Scale Social Networks
- Jinha Kim (POSTECH)
- Seung-Keol Kim
- Hwanjo Yu (Pohang University of Sciece and Technology)
SociaLite: Datalog Extensions for Efficient Social Network Analysis
- Jiwon Seo (Stanford)
- Stephen Guo (Stanford)
- Monica Lam (Stanford)
LinkProbe: Probabilistic Inference on Large-Scale Social Networks
- Haiquan Chen (Valdosta State University)
- Wei-Shinn Ku (Auburn University)
- Haixun Wang (Microsoft)
- Liang Tang (Auburn University)
- Min-Te Sun (National Central University)
2pm - 3:30pm
Seminar 3 (Odeon)
Workload Management for Big Data Analytics
- Ashraf Aboulnaga (University of Waterloo)
- Shivnath Babu (Duke University)
2pm - 3:30pm
Industry 2 (Concorde)
HFMS: Managing the Lifecycle and Complexity of Hybrid Analytic Data Flows
- Alkis Simitsis (Hewlett-Packard Laboratories)
- Kevin Wilkinson (Hewlett-Packard Laboratories)
- Umeshwar Dayal (Hewlett-Packard Laboratories)
- Meichun Hsu (Hewlett-Packard Laboratories)
KuaFu: Closing the Parallelism Gap in Database Replication
- Chuntao Hong (Microsoft Research Asia)
- Mao Yang (Microsoft Research Asia)
- Lintao Zhang (Microsoft Research Asia)
- Lidong Zhou (Microsoft Research Asia)
- Dong Zhou (Tsinghua University)
- Carbo Kuo (Tsinghua University)
Materialization Strategies in the Vertica Analytic Database: Lessons Learned
- Lakshmikant Shrinivas (Vertica)
- Sreenath Bodagala (Vertica)
- Ramakrishna Varadarajan (Vertica)
- Ariel Cary (Vertica)
- Vivek Bharathan (Vertica)
- Chuck Bear (Vertica)
2pm - 3:30pm
Demo Groups 1 & 2 (Ballroom 2)
Twitter+: Build Personalized Newspaper For Twitter
- Chen Liu (National University of Singapore)
- Anthony K. H. Tung (National University of Singapore)
A Generic Database Benchmarking Service
- Martin Kaufmann (ETH Zürich/SAP AG)
- Peter M. Fischery (Albert-Ludwigs-Universität Freiburg)
- Donald Kossmann (ETH Zürich)
- Norman May (SAP AG)
Aeolus: An Optimizer for Distributed Intra-Node-Parallel Streaming Systems
- Matthias J. Sax (Humboldt-Universität zu Berlin)
- Malu Castellanos (Humboldt-Universität zu Berlin)
- Qiming Chen (Humboldt-Universität zu Berlin)
- Meichun Hsu (Hewlett-Packard Laboratories)
Crowd-Answering System via Microblogging
- Xianke Zhou (Zhejiang University)
- Ke Chen (Zhejiang University)
- Sai Wu (Zhejiang University)
- Bingbing Zhang (Zhejiang University)
With a Little Help from My Friends
- Arnab Nandi (The Ohio State University)
- Stelios Paparizos ( Microsoft Research)
- John Shafer (Microsoft Research)
- Rakesh Agrawal (Microsoft Research)
Peeking into the Optimization of Data Flow Programs with MapReduce-style UDFs
- Fabian Hueske (Technische Universität Berlin)
- Mathias Peters (Humboldt Universität zu Berlin)
- Aljoscha Krettek (Technische Universität Berlin)
- Matthias Ringwald (Technische Universität Berlin)
- Kostas Tzoumas (Technische Universität Berlin)
- Volker Markl (Technische Universität Berlin)
- Johann-Christoph Freytag (Humboldt Universität zu Berlin)
Very Fast Estimation for Result and Accuracy of Big Data Analytics: the EARL System
- Nikolay Laptev (University of California, Los Angeles)
- Kai Zeng (University of California, Los Angeles)
- Carlo Zaniolo (University of California, Los Angeles)
Road Network Mix-zones for Anonymous Location Based Services
- Balaji Palanisamy (Georgia Institute of Technology)
- Sindhuja Ravichandran (Georgia Institute of Technology)
- Ling Liu (Georgia Institute of Technology)
- Binh Han, Kisung Lee (Georgia Institute of Technology)
Query Time Scaling of Attribute Values in Interval Timestamped Databases
- Anton Dignös
- Michael Böhlen (University of Zurich)
- Johann Gamper (Free University of Bozen-Bolzano)
Extracting Interesting Related Context-dependent Concepts from Social Media Streams using Temporal Distributions
- Craig P. Sayers, Meichun Hsu (Hewlett-Packard Labs)
VERDICT: Privacy-Preserving Authentication of Range Queries in Location-based Services
- Haibo Hu, Qian Chen, Jianliang Xu (Hong Kong Baptist University)
ExpFinder: Finding Experts by Graph Pattern Matching
- Wenfei Fan (University of Edinburgh / Harbin Institute of Technology)
- Xin Wang (Harbin Institute of Technology)
- Yinghui Wu (UC Santa Barbara)
Tajo: A Distributed Data Warehouse System on Large Clusters
- Hyunsik Choi
- Jihoon Son
- Haemi Yang
- Hyoseok Ryu
- Byungnam Lim
- Soohyung Kim
- Yon Dohn Chung (Korea University)
4pm - 5:30pm
R9 - Indexing structures (Ballroom 1)
The Bw-Tree: A B-tree for New Hardware Platforms
- Justin Levandoski (Microsoft Research)
- David Lomet (Microsoft Research)
- Sudipta Sengupta (Microsoft Research)
Secure and Efficient Range Queries on Outsourced Databases Using $\widehat{R}$-trees
- Peng Wang (UC Riverside)
- Chinya Ravishankar (UC Riverside)
An Efficient and Compact Indexing Scheme for Large-scale Data Store
- Peng Lu (National University of Singapo)
- Sai Wu (Zhejiang University)
- Lidan Shou (Zhejiang University)
- Kian-Lee Tan (National University of Singapore)
4pm - 5:30pm
R10 - Main memory query processing (St Germaine)
Recycling in Pipelined Query Evaluation
- Fabian Nagel (The University of Edinburgh)
- Peter Boncz (CWI)
- Stratis Viglas (The University of Edinburgh)
Efficient Many-Core Query Execution in Main Memory Column-Stores
- Jonathan Dees (SAP)
- Peter Sanders (KIT - Karlsruher Institut fuer Technology)
Main-Memory Hash Joins on Multi-Core CPUs: Tuning to the Underlying Hardware
- Cagri Balkesen (ETH Zurich)
- Jens Teubner (ETH Zurich)
- Gustavo Alonso (ETH Zurich)
- M. Tamer Özsu (University of Waterloo)
4pm - 5:30pm
R11 - Data mining I (Bastille 1)
Coupled Clustering Ensemble: Incorporating Coupling Relationships Both between Base Clusterings and Objects
- Can Wang (AAI, UTS)
- Zhong She (AAI, UTS)
- Longbing Cao (UTS)
Focused Matrix Factorization For Audience Selection in Display Advertising
- Bhargav Kanagal (Google)
- Amr Ahmed (Yahoo! Research)
- Sandeep Pandey (Twitter)
- Vanja Josifovski (Google)
- Lluis Garcia-Pueyo (Yahoo! Research)
- Jeff Yuan (Yahoo! Research)
Graph Stream Classification using Labeled and Unlabeled Graphs
- Shirui Pan (Univ. Technology Sydney)
- Xingquan Zhu
- Chengqi Zhang (QCIS, FEIT, UTS)
- Philip Yu (UIC)
4pm - 5:30pm
R12 - Moving objects (Bastille 2)
T-Share: A Large-Scale Dynamic Taxi Ridesharing Service
- Shuo Ma (Univ. of Illinois at Chicago)
- Yu Zheng (Microsoft Research Asia)
- Ouri Wolfson (University of Illinois at Chicago)
Efficient Notification of Meeting Points for Moving Groups via Independent Safe Regions
- Jing LI (The University of Hong Kong)
- Man Lung Yiu (Hong Kong Polytechnic University)
- Nikos Mamoulis (University of Hong Kong)
Efficient Distance-Aware Query Evaluation on Indoor Moving Objects
- Xike Xie (AAU)
- Hua Lu (Aalborg University)
- Torben Pedersen (AAU)
4pm - 5:30pm
Seminar 4 (Odeon)
Knowledge Harvesting from Text and Web Sources
- Fabian Suchanek
- Gerhard Weikum (Max Planck Institute for Informatics)
4pm - 5:30pm
Industry 3 (Concorde)
Pipe Break Prediction: A Data Mining Method
- Rui Wang (University of Science & Technology of China)
- Weishan Dong, Yu Wang (IBM Research – China)
- Ke Tang (University of Science & Technology of China)
- Xin Yao (The University of Birmingham)
SASH: Enabling Continuous Incremental Analytic Workflows on Hadoop
- Manish Sethi
- Narendran Sachindran
- Sriram Raghavan (IBM)
Automating Pattern Discovery for Rule Based Data Standarization Systems
- Snigdha Chaturvedi
- Hima Prasad K
- Tanveer Faruqie
- Bhupesh Chawda
- L Venkata Subramaniam
- Raghuram Krishnapuram (IBM Research-India)
4pm - 5:30pm
Demo Groups 1 & 2 (Ballroom 2)
Twitter+: Build Personalized Newspaper For Twitter
- Chen Liu
- Anthony K. H. Tung (National University of Singapore)
A Generic Database Benchmarking Service
- Martin Kaufmann (ETH Zürich/SAP AG)
- Peter M. Fischery (Albert-Ludwigs-Universität Freiburg)
- Donald Kossmann (ETH Zürich)
- Norman May (SAP AG)
Aeolus: An Optimizer for Distributed Intra-Node-Parallel Streaming Systems
- Matthias J. Sax (Humboldt-Universität zu Berlin)
- Malu Castellanos
- Qiming Chen
- Meichun Hsu (Hewlett-Packard Laboratories)
Crowd-Answering System via Microblogging
- Xianke Zhou
- Ke Chen
- Sai Wu
- Bingbing Zhang (Zhejiang University)
With a Little Help from My Friends
- Arnab Nandi (The Ohio State University)
- Stelios Paparizos
- John Shafer
- Rakesh Agrawal (Microsoft Research)
Peeking into the Optimization of Data Flow Programs with MapReduce-style UDFs
- Fabian Hueske (Technische Universität Berlin)
- Mathias Peters (Humboldt Universität zu Berlin)
- Aljoscha Krettek
- Matthias Ringwald
- Kostas Tzoumas
- Volker Markl (Technische Universität Berlin)
- Johann-Christoph Freytag (Humboldt Universität zu Berlin)
Very Fast Estimation for Result and Accuracy of Big Data Analytics: the EARL System
- Nikolay Laptev
- Kai Zeng
- Carlo Zaniolo (University of California, Los Angeles)
Road Network Mix-zones for Anonymous Location Based Services
- Balaji Palanisamy
- Sindhuja Ravichandran
- Ling Liu
- Binh Han
- Kisung Lee (Georgia Institute of Technology)
Query Time Scaling of Attribute Values in Interval Timestamped Databases
- Anton Dignös
- Michael Böhlen (University of Zurich)
- Johann Gamper (Free University of Bozen-Bolzano)
Extracting Interesting Related Context-dependent Concepts from Social Media Streams using Temporal Distributions
- Craig P. Sayers
- Meichun Hsu (Hewlett-Packard Labs)
VERDICT: Privacy-Preserving Authentication of Range Queries in Location-based Services
- Haibo Hu
- Qian Chen
- Jianliang Xu (Hong Kong Baptist University)
ExpFinder: Finding Experts by Graph Pattern Matching
- Wenfei Fan (University of Edinburgh / Harbin Institute of Technology)
- Xin Wang (Harbin Institute of Technology)
- Yinghui Wu (UC Santa Barbara)
Tajo: A Distributed Data Warehouse System on Large Clusters
- Hyunsik Choi
- Jihoon Son
- Haemi Yang
- Hyoseok Ryu
- Byungnam Lim
- Soohyung Kim
- Yon Dohn Chung (Korea University)
Wednesday 10 April
9am - 10am
Keynote 2 (Ballroom Le Grand)
Recent Advances on Structured Data and the Web
- Alon Halevy (Google Inc.)
10:30am - 12pm
R13 - Data cleaning (St Germaine)
HANDS: A Heuristically Arranged Non-Backup In-line Deduplication System
- Avani Gadani (UC Santa Cruz)
- Ethan Miller
- Ohad Rodeh
Holistic Data Cleaning: Putting Violations Into Context
- Xu Chu (University of Waterloo)
- Ihab Ilyas (Qatar Computing Research Institute)
- Paolo Papotti (QCRI)
Inferring Data Currency and Consistency for Conflict Resolution
- Wenfei Fan
- Floris Geerts
- Nan Tang (Qatar Computing Research Institute)
- Wenyuan Yu (University of Edinburgh)
10:30am - 12pm
R14 - Social media I (Bastille 1)
LSII: An Indexing Structure for Exact Real-Time Search on Microblogs
- Lingkun Wu (Nanyang Technological University)
- Xiaokui Xiao (Nanyang Technological University)
- Yabo Xu (Sun Yet-Sen University)
- Wenqing Lin (NTU, Singapore)
Utilizing Social Pressure in Recommender Systems
- Hu Kailun (NUS)
- Wynne Hsu (NUS)
- Mong Li Lee (NUS)
Presenting Diverse Location Views with Real-time Near-duplicate Photo Elimination
- Jiajun Liu (The University of Queensland)
- Zi Huang (The University of Queensland)
- Heng Tao Shen (The University of Queensland)
- Hong Cheng (The Chinese Univerity of Hong Kong)
- Yueguo Chen (Renmin University of China)
10:30am - 12pm
R15 - Data trust (Bastille 2)
Publicly Verifiable Grouped Aggregation Queries on Outsourced Data Streams
- Suman Nath (Microsoft Research)
- Ramarathnam Venkatesan (Microsoft Research)
Trustworthy Data from Untrusted Databases
- Rohit Jain (Purdue University)
- Sunil Prabhakar (Purdue University)
On the Relative Trust between Inconsistent Data and Inaccurate Constraints
- George Beskales (QCRI)
- Ihab Ilyas ("Qatar Computing Research Institute, Qatar")
- Lukasz Golab (University of Waterloo)
- Artur Galiullin (University of Waterloo)
10:30am - 12pm
R16 - Data on the cloud (Concorde)
Catch the Wind: Graph Workload Balancing on Cloud
- Zechao Shang (The Chinese University of Hong Kong)
- Jeffrey Xu Yu (Chinese University of Hong Kong)
EAGRE: Towards Scalable I/O Efficient SPARQL Query Evaluation on the Cloud
- Xiaofei Zhang (HKUST)
- Lei Chen (Hong Kong University of Science and Technology)
- Yongxin Tong (HKUST)
- Min Wang (HP Labs)
C-Cube: Elastic Continuous Clustering in the Cloud
- Zhenjie Zhang ("ADSC, SG")
- Hu Shu (Huawei Inc.)
- Zhihong Chong (Southeast University, China)
- Hua Lu (Aalborg University)
- Yin Yang (Advanced Digital Sciences Center)
10:30am - 12pm
Seminar 5 (Odeon)
Sorting in Space: Multidimensional, Spatial, and Metric
Data structures for Applications in Spatial Databases, Geographic
Information Systems (GIS), and Location-based Services
- Hanan Samet (University of Maryland)
12pm - 2pm
SAP Business Lunch (Ballroom Le Grand)
ICDE Award Presentations
2pm - 3pm
Keynote 4 (Ballroom Le Grand)
10 Year Most Influential Papers
Schema Mediation in Peer Data Management Systems [ICDE 2003]
- Alon Y. Halevy
- Zachary G. Ives
- Dan Suciu
- Igor Tatarinov
Similarity flooding: a versatile graph matching algorithm and its application to schema matching [ICDE 2002]
- Melnik, S., Garcia-Molina, H., Rahm, E.
3:30pm - 5pm
R17 - Similarity ranking (St Germaine)
Efficient Search Algorithm for SimRank
- Yasuhiro Fujiwara (NTT)
- Makoto Nakatsuji (NTT)
- Hiroaki Shiokawa (NTT)
- Makoto Onizuka
Towards Efficient SimRank Computation on Large Graphs
- Weiren Yu (UNSW)
- Xuemin Lin (University of New South Wales)
- Wenjie Zhang (UNSW)
RoundTripRank: Graph-based Proximity with Importance and Specificity
- Yuan Fang (University of Illinois at Urba)
- Kevin Chang (UIUC)
- Hady Lauw (Singapore Management University)
3:30pm - 5pm
R18 - Spatial databases (Bastille 1)
Finding Distance-Preserving Subgraphs in Large Road Networks
- Da Yan (HKUST)
- James Cheng (CUHK)
- Wilfred Ng (HKUST)
- Kin Sum Liu (HKUST)
Maximum Visibility Queries in Spatial Databases
- Sarah Masud (BUET)
- Farhana Murtaza Choudhury (BUET)
- Mohammed Eunus Ali (BUET)
- Sarana Nutanong (University of Maryland)
Memory-Efficient Algorithms for Spatial Network Queries
- Sarana Nutanong (University of Maryland)
- Hanan Samet (University of Maryland)
3:30pm - 5pm
R19 - Social media II (Bastille 2)
A Unified Model for Stable and Temporal Topic Detection from Social Media Data
- Hongzhi Yin (Peking University)
- Bin Cui (Peking University)
- Hua Lu (Aalborg University)
- Yuxin Huang
- Junjie Yao
Crowdsourced Enumeration Queries
- Katherine Trushkowsky (UC Berkeley)
- Tim Kraska (University of California Berkeley)
- Michael Franklin
- Purnamrita Sarkar
On Incentive-based Tagging
- Xuan Yang (The University of Hong Kong)
- Reynold Cheng (University of Hong Kong)
- Luyi Mo (University of Hong Kong)
- Benjamin Kao (The University of Hong Kong)
- David Cheung
3:30pm - 5pm
R20 - Trees and XML (Concorde)
Ontology-based subgraph querying
- YINGHUI WU (UCSB)
- Shengqi Yang (University of California, Santa Barbara)
- Xifeng Yan (UC Santa Barbara)
Stratification Driven Placement of Complex Data: A Framework for Distributed Data Analytics
- Ye Wang (Ohio State)
- Srinivasan Parthasarathy (The Ohio State University)
- P Sadayappan (Ohio State)
Optimizing Approximations of Query Lineage in Probabilistic XML
- Asma Souihli (Institut Mines-Telecom)
- Pierre Senellart (Institut Mines-Telecom)
3:30pm - 5pm
Seminar 6 (Odeon)
Triples in the clouds
- Zoi Kaoudi
- Ioana Manolescu (Inria Saclay - Île de France)
7pm - 10pm
Conference Banquet (Ballroom Le Grand)
Thursday 11 April
9am - 10am
Keynote 3 (Ballroom 1-2)
Hardware Killed the Software Star
- Gustavo Alonso (ETH Zurich)
10:30am - 12:00pm
R21 - Security and privacy (St Germaine)
Secure Nearest Neighbor Revisited
- Bin Yao (Shanghai Jiao Tong University)
- Feifei Li (University of Utah)
- Xiaokui Xiao (Nanyang Technological University)
Accurate and Efficient Private Release of Datacubes and Contingency Tables
- Graham Cormode (AT&T Labs Research)
- Cecilia Procopiuc (AT&T Labs-Research)
- Divesh Srivastava (AT&T Labs)
- Grigory Yaroslavtsev
Differentially Private Grids for Geospatial Data
- Wahbeh Qardaji (Purdue)
- Weining Yang (Purdue University)
- Ninghui Li (Purdue University)
10:30am - 12:00pm
R22 - Randomized algorithms for graphs (Bastille 1)
Faster Random Walks By Rewiring Online Social Networks On-The-Fly
- Zhuojie Zhou (George Washington University)
- Nan Zhang (George Washington University)
- Zhiguo Gong (University of Macau)
- Gautam Das (UT Arlington and QCRI)
Sampling Node Pairs Over Graphs
- Pinghui Wang (CUHK)
- Junzhou Zhao (Xi'an Jiaotong University)
- John Chi Shing Lui (CUHK)
- Don Towsley (University of Massachusetts Amherst)
- Xiaohong Guan (Xi'an Jiaotong University)
Link Prediction across Networks by Biased Cross-Network Sampling
- Guo-Jun Qi (UIUC)
- Charu Aggarwal (IBM)
- Thomas Huang (ECE UIUC)
10:30am - 12:00 pm
R23 - Distributed data processing (Bastille 2)
Interval Indexing and Querying on Key-Value Cloud Stores
- George Sfakianakis (University of Patras)
- Ioannis Patlakas (University of Patras)
- Nikos Ntarmos (University of Patras)
- Peter Triantafillou (University of Patras)
Robust Distributed Stream Processing
- Chuan Lei (WPI)
- Elke Rundensteiner (WPI)
- Joshua Guttman (WPI)
10:30am - 12:00 pm
R24 - Data mining II (Concorde)
Learning to Rank from Distant Supervision: Exploiting
- Mianwei Zhou (UIUC)
- Hongning Wang (UIUC)
- Kevin Chang (UIUC)
AFFINITY: Efficiently Querying Statistical Measures on Time-Series Data
- Saket Sathe (EPFL)
- Karl Aberer (EPFL)
Forecasting the Data Cube: A Model Configuration Advisor for Multi-Dimensional Data Sets
- Ulrike Fischer (TU Dresden)
- Christopher Schildt
- Claudio Hartmann
- Wolfgang Lehner (Dresden University of Technology)
10:30am - 12pm
Seminar 7 (Odeon)
Querying Encrypted Data
- Arvind Arasu
- Ken Eguro
- Raghav Kaushik
- Ravi Ramamurthy (Microsoft Research)
10:30am - 12pm
Panel (Ballroom 1-2)
Big Data for the Public
Moderator:
- Dimitrios Georgakopoulos (CSIRO, Australia)
Panelists:
- Karl Aberer (EPFL, Switzerland)
- Ashraf Aboulnaga (U. Waterloo, Canada)
- Kevin Chang (UIUC, USA)
- Xin Luna Dong (Google Mountain View)
10:30am - 12pm
Demo Groups 3 & 4 (Ballroom 3)
Πgora: An Integration System for Probabilistic Data
- Dan Olteanu
- Lampros Papageorgiou
- Sebastiaan J. van Schaik (Oxford)
Complex Pattern Matching in Complex Structures: the XSeq Approach
- Kai Zeng
- Mohan Yang (University of California, Los Angeles)
- Barzan Mozafari (Massachusetts Institute of Technology)
- Carlo Zaniolo (University of California, Los Angeles)
T-Music: A Melody Composer based on Frequent Pattern Mining
- Cheng Long
- Raymond Chi-Wing Wong
- Raymond Ka Wai Sze (The Hong Kong University of Science and Technology)
SHARE: Secure information sHaring frAmework for emeRgency management
- Barbara Carminati
- Elena Ferrari
- Michele Guglielmi (University of Insubria)
KORS: Keyword-aware Optimal Route Search System
- Xin Cao
- Lisi Chen
- Gao Cong (Nanyang Technological University)
- Jihong Guan (Tongji University)
- Nhan-Tue Phan
- Xiaokui Xiao (Nanyang Technological University)
CrowdPlanr: Planning Made Easy with Crowd
- Ilia Lotosh
- Tova Milo
- Slava Novgorodov (Tel-Aviv University)
ASVTDECTOR: A Practical Near Duplicate Video Retrieval System
- Xiangmin Zhou (CSIRO)
- Lei Chen (Hong Kong University of Science and Technology)
YumiInt - A Deep Web Integration System for Local Search Engines for Geo-referenced Objects
- Eduard Dragut (Purdue University)
- B. P. Beirne
- A. Neyestani
- B. Atassi
- Clement Yu
- Bhaskar DasGupta (University of Illinois at Chicago)
- Weiyi Meng (Binghamton University)
A Demonstration of the G* Graph Database System
- Sean R. Spillane
- Jeremy Birnbaum
- Daniel Bokser
- Daniel Kemp
- Alan Labouseur
- Paul W. Olsen Jr.
- Jayadevan Vijayan
- Jeong-Hyon Hwang (University at Albany - State University of New York)
RECODS: Replica Consistency-On-Demand Store
- Yuqing Zhu (Tsinghua University)
- Philip S. Yu (University of Illinois at Chicago)
- Jianmin Wang (Tsinghua University)
SODIT: An Innovative System for Outlier Detection using Multiple Localized Thresholding and Interactive Feedback
- Ji Zhang
- Hua Wang
- Xiaohui Tao (University of Southern Queensland)
COLA: A Cloud-based System for Online Aggregation
- Yantao Gan
- Xiaofeng Meng
- Yingjie Shi (Renmin University of China)
RoadAlarm: a Spatial Alarm System on Road Networks
- Kisung Lee
- Ling Liu
- Binh Han
- Balaji Palanisamy (Georgia Institute of Technology)
Real-time Abnormality Detection System for Intensive Care Management
- Jing He
- Guangyan Huang
- Zhi Qiao (Victoria University)
- Michael Steyn
- Kersi Taraporewalla (Royal Brisbane and Women's Hospital)
- Jie Cao (Nanjing University of Finance and Economics)
1:30pm - 3pm
R25 - Lineage and provenance (St Germaine)
SubZero: a Fine-Grained Lineage System for Scientific Databases
- Eugene Wu (MIT)
- Samuel Madden (MIT)
- Michael Stonebraker
Logical Provenance in Data-Oriented Workflows
- Robert Ikeda (Stanford University)
- Akash Das Sarma
- Jennifer Widom
Revision Provenance in Text Documents of Asynchronous Collaboration
- Jing Zhang (University of Michigan)
- H.V. Jagadish
1:30pm - 3pm
R26 - Similarity search (Bastille 1)
Inverted Linear Quadtree : Efficient Top K Spatial Keyword Search
- chengyuan Zhang (UNSW)
- Ying Zhang (University of New South Wales)
- Wenjie Zhang (University of New South Wales)
- Xuemin Lin (University of New South Wales)
Similarity Query Processing for Probabilistic Sets
- Ming Gao (ECNU)
- Cheqing Jin
- Wei Wang
- Xuemin Lin (University of New South Wales)
- Aoying Zhou
Top-k String Similarity Search with Edit-Distance Constraints
- Dong Deng (Tsinghua University)
- Guoliang Li (Tsinghua University)
- Jianhua Feng (Tsinghua University)
- Wen-Syan Li ()
1:30pm - 3pm
R27 - Shortest and direct query (Bastille 2)
On Shortest Unique Substring Queries
- Jian Pei (Simon Fraser University)
- Wush Chi-Hsuan Wu (Academia Sinica)
- Mi-Yen Yeh (Academia Sinica)
Engineering Generalized Shortest Path Queries
- Michael Rice (UCR)
- Vassilis Tsotras (UCR)
Efficient Direct Search on Compres
- Xiaochun Yang (Northeastern University)
- Bin Wang (Northeastern University)
- Chen Li (UC Irvine)
- Jiaying Wang (Northeastern University)
- Xiaohui Xie (UCI)
1:30pm - 3pm
R28 - Skyline and snapshot query (Concorde)
On Answering Why-not Questions in Reverse Skyline Queries
- Md. Saiful Islam (Swinburne Univ. of Technogy)
- Rui Zhou (Swinburne Univ. Technology)
- Chengfei Liu (Swinburne Univ. Technology)
Layered Processing of Skyline-Window-Join (SWJ) Queries using Iteration-Fabric
- Mithila Nagendra (Arizona State University)
- K. Selcuk Candan (Arizona State University)
Efficient Snapshot Retrieval over Historical Graph Data
- Udayan Khurana (University of Maryland)
- Amol Deshpande (University of Maryland)
1:30pm -3pm
Seminar 8 (Odeon)
Shallow Information Extraction for the Knowledge Web
- Denilson Barbosa (University of Alberta)
- Haixun Wang (Microsoft Research Asia)
- Cong Yu (Google Research New York)
1:30pm - 3pm
Demo Groups 3 & 4 (Ballroom 3)
Πgora: An Integration System for Probabilistic Data
- Dan Olteanu
- Lampros Papageorgiou
- Sebastiaan J. van Schaik (Oxford)
Complex Pattern Matching in Complex Structures: the XSeq Approach
- Kai Zeng
- Mohan Yang (University of California, Los Angeles)
- Barzan Mozafari (Massachusetts Institute of Technology)
- Carlo Zaniolo (University of California, Los Angeles)
T-Music: A Melody Composer based on Frequent Pattern Mining
- Cheng Long
- Raymond Chi-Wing Wong
- Raymond Ka Wai Sze (The Hong Kong University of Science and Technology)
SHARE: Secure information sHaring frAmework for emeRgency management
- Barbara Carminati
- Elena Ferrari
- Michele Guglielmi (University of Insubria)
KORS: Keyword-aware Optimal Route Search System
- Xin Cao
- Lisi Chen
- Gao Cong (Nanyang Technological University)
- Jihong Guan (Tongji University)
- Nhan-Tue Phan
- Xiaokui Xiao (Nanyang Technological University)
CrowdPlanr: Planning Made Easy with Crowd
- Ilia Lotosh
- Tova Milo
- Slava Novgorodov (Tel-Aviv University)
ASVTDECTOR: A Practical Near Duplicate Video Retrieval System
- Xiangmin Zhou (CSIRO)
- Lei Chen (Hong Kong University of Science and Technology)
YumiInt - A Deep Web Integration System for Local Search Engines for Geo-referenced Objects
- Eduard Dragut (Purdue University)
- B. P. Beirne
- A. Neyestani
- B. Atassi
- Clement Yu
- Bhaskar DasGupta (University of Illinois at Chicago)
- Weiyi Meng (Binghamton University)
A Demonstration of the G* Graph Database System
- Sean R. Spillane
- Jeremy Birnbaum
- Daniel Bokser
- Daniel Kemp
- Alan Labouseur
- Paul W. Olsen Jr.
- Jayadevan Vijayan
- Jeong-Hyon Hwang (University at Albany - State University of New York)
RECODS: Replica Consistency-On-Demand Store
- Yuqing Zhu (Tsinghua University)
- Philip S. Yu (University of Illinois at Chicago)
- Jianmin Wang (Tsinghua University)
SODIT: An Innovative System for Outlier Detection using Multiple Localized Thresholding and Interactive Feedback
- Ji Zhang
- Hua Wang
- Xiaohui Tao (University of Southern Queensland)
COLA: A Cloud-based System for Online Aggregation
- Yantao Gan
- Xiaofeng Meng
- Yingjie Shi (Renmin University of China)
RoadAlarm: a Spatial Alarm System on Road Networks
- Kisung Lee
- Ling Liu
- Binh Han
- Balaji Palanisamy (Georgia Institute of Technology)
Real-time Abnormality Detection System for Intensive Care Management
- Jing He
- Guangyan Huang
- Zhi Qiao (Victoria University)
- Michael Steyn
- Kersi Taraporewalla (Royal Brisbane and Women's Hospital)
- Jie Cao (Nanjing University of Finance and Economics)
3:30pm - 5pm
R29 - Large graph indexing (St Germaine)
FERRARI: Flexible and Efficient Reachability Range Assignment for Graph Indexing
- Stephan Seufert (Max Planck Institute for Infor)
- Avishek Anand (Max Planck Institute for Informatics)
- Srikanta Bedathur (IIIT Delhi)
- Gerhard Weikum (Max Planck Institute for Informatics)
gIceberg: Towards Iceberg Analysis in Large Graphs
- Nan Li (UCSB)
- Ziyu Guan (UC Santa Barbara)
- Lijie Ren
- Jian Wu
- Jiawei Han (University of Illinois at Urbana-Champaign)
- Xifeng Yan (UC Santa Barbara)
Top-k Graph Pattern Matching over Large Graphs
- Jiefeng Cheng (SIAT, China)
- xianggang Zeng (SIAT, China)
- Jeffrey Xu Yu (Chinese University of Hong Kong)
3:30pm - 5pm
R30 - Web data (Bastille 1)
Breaking the Top-k Barrier of Hidden Web Databases
- Saravanan Thirumuruganathan (University of Texas At Arlingt)
- Nan Zhang (George Washington University)
- Gautam Das (UT Arlington and QCRI)
Automatic Extraction of Top-k Lists from the Web
- Zhixian Zhang (Shanghai Jiao Tong University)
- Kenny Zhu (Shanghai Jiao Tong University)
- Haixun Wang (Microsoft)
- Hongsong Li (Microsoft Research Asia)
Finding Interesting Correlations with Conditional Heavy Hitters
- Katsiaryna Mirylenka
- Graham Cormode (AT&T Labs Research)
- Themis Palpanas (University of Trento, Italy)
- Divesh Srivastava (AT&T Labs)
3:30pm - 5pm
R31 - Query optimization (Bastille 2)
Predicting Query Execution Time: Are Optimizer Cost Models Really Unusable?
- Wentao Wu (University of Wisconsin)
- Yun Chi (NEC Laboratories America)
- Shenghuo Zhu
- Junichi Tatemura
- Hakan Hacigumus (NEC Labs)
- Jeffrey Naughton (University of Wisconsin Madison)
Query Optimization for Differentially Private Data Management Systems
- Shangfu Peng (Shanghai Jiao Tong University)
- Yin Yang (Advanced Digital Sciences Center)
- Zhenjie Zhang ("ADSC, SG")
- Marianne Winslett (University of Illinois at Urbana-Champaign)
- Yong Yu (Shanghai Jiao Tong University)
Top Down Plan Generation: From Theory to Practice
- Pit Fender (University of Mannheim)
- Guido Moerkotte (University of Mannheim)
3:30pm - 5pm
R32 - Data storage (Concorde)
TBF: A Memory-Efficient Replacement Policy for Flash-based Caches
- Cristian Ungureanu (NEC Labs)
- Biplob Debnath (NEC Labs)
- Steve Rago (sar@nec-labs.com)
- Akshat Aranya (NEC Labs)
Fast Peak-to-Peak Behavior with SSD Buffer Pool
- Jaeyoung Do (UW-Madison)
- Donghui Zhang (Paradigm4)
- Jignesh Patel (University of Wisconsin)
- David DeWitt (Microsoft Jim Gray Systems Lab)
SELECT Triggers for Data Auditing
- Daniel Fabbri (University of Michigan)
- Ravi Ramamurthy (Microsoft)
- Raghav Kaushik (Microsoft)
3:30pm - 5pm
Seminar 9 (Odeon)
Secure and Privacy-Preserving Database Services in the Cloud
- Divyakant Agrawal
- Amr El Abbadi
- Shiyuan Wang (University of California, Santa Barbara)
3:30pm - 6pm
Poster Session & Closing Drinks (Ballroom 1 & 2)