Seminar "Organizing and Searching Information with XML"
Seminar |
Dr.-Ing. Ralf Schenkel Chair for databases and information systems |
The extended markup language XML is commonly used as a universal format for data storage and data exchange in many application domains, so more and more data is available in XML. Efficient storage and organization of such information, but also the migration of existing relational data are key elements for future applications. Additionally, algorithms for efficient and effective search in XML documents as well as in the Semantic Web of the future are highly up-to-date and significant topics in research.The seminar covers selected contributions from the most important international research conferences of the last years.
Tue, April 29th: XML for Beginners
Speaker: Ralf Schenkel Slides (ppt) Slides(pdf)
Tue, May 27th: Index Structures for XML Documents
Speakers: Benedikt Fries Sarah Schmidt Alexander Walz Tutor: Hanglin Pan Slides Elaboration
Literature:
Brian Cooper et al: A Fast Index for Semistructured Data. In: Proceedings of the 27th International Conference on Very Large Databases (VLDB), Roma, Italy, 2001
Quanzhong Li and Bongki Moon: Indexing and Querying XML Data for Regular Path Expressions. In: Proceedings of the 27th International Conference on Very Large Databases (VLDB), Roma, Italy, 2001
Torsten Grust: Accelerating XPath Location Steps. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, WA, June 3-6, 2002.
Chin-Wan Chung, Jun-Ki Min, Kyuseok Shim: APEX: An Adaptive Path Index for XML data. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, WA, June 3-6, 2002.
Raghav Kaushik et al.: Covering Indexes for Branching Path Queries. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, WA, June 3-6, 2002.
Raghav Kaushik et al.: Updates for Structure Indexes. In: Proceedings of the 28th International Conference on Very Large Databases (VLDB), Hongkong, China, 2002.
Tue, June 3rd: Updating XML Data & Transaction Support for XML
Speakers: Stefan Bender Christian Fuchs Michael Schmidt Tutor: Ralf Schenkel Slides Elaboration
Literature:
Y. Wang, D. DeWitt, J-Y Cai: X-Diff: An Efficient Change-Detection Algorithm for XML Documents. In: Proceedings of the 19th International Conference on Data Engineering (ICDE), Bangalore, India, 2003.
G. Cobena, S. Abiteboul, A. Marian: Detecting Changes in XML documents. In: Proceedings of the 18th International Conference on Data Engineering (ICDE), San Jose, CA, 2002.
Igor Tatarinov et al.: Updating XML. In: Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, Santa Barbara, CA, 2001.
Torsten Grabs, Klemens Böhm, Hans-Jörg Schek: XMLTM: Efficient Transaction Management for XML Documents. In: Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, McLean, VA, USA, November 4-9, 2002. (see this Technical Report for a more detailed description of XMLTM.)
Sven Helmer, Carl-Christian Kanne, Guido Moerkotte: Isolation in XML Bases. Technical Report, University of Mannheim, Germany, 2001.
Tue, June 17th: Selectivity Estimation for XML Queries
Speakers: Thomas Beer Mostafa Khabouze Christian Linz Tutor: Stefan Siersdorfer Slides Elaboration
Literature:
Ashraf Aboulnaga and Alaa R. Alameldeen and Jeffrey F. Naughton: Estimating the Selectivity of XML Path Expressions for Internet Scale Applications. In: Proceedings of the 27th International Conference on Very Large Databases (VLDB), Roma, Italy, 2001
L. Lim et al.: XPathLearner: An On-Line Self-Tuning Markov Histogram for XML Path Selectivity Estimation. In: Proceedings of the 28th International Conference on Very Large Databases (VLDB), Hongkong, China, 2002.
Yuqing Wu, Jignesh M. Patel, H.V. Jagadish: Estimating Answer Sizes for XML Queries. In: Proceedings of the 8th International Conference on Extending Database Technology (EDBT), Prague, Czech Republic, March 25-27, 2002.
Minos N. Garofalakis: Statistical Synopses for Graph-Structured XML Databases. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison, WA, June 3-6, 2002.
Minos N. Garofalakis: Structure and Value Synopses for XML Data Graphs. In: Proceedings of the 28th International Conference on Very Large Databases (VLDB), Hongkong, China, 2002.
Tue, June 24th: Schema & Schema Integration
Speaker: Thorsten Dollmann Carsten Karl Dennis Schade Tutor: Ralf Schenkel Slides Elaboration
Literature:
Tue, July 1st: XML Systems & Benchmarks
Speakers: Peter Chiv Christoph Staudt Tutor: Sergej Sizov Slides (PDF) Elaboration
Literature:
H.V. Jagadish et al.: Timber: A Native XML Database. Technical report, University of Michigan, April 2002.
Thorsten Fiebig et al.: Natix - ein natives XML-DBMS. In: Datenbank-Spektrum 1(1), 2001. See also the Natix home page.
Albrecht Schmidt et al.: XMark: A Benchmark for XML Data Management. In: Proceedings of the 28th International Conference on Very Large Databases (VLDB), Hongkong, China, 2002.
Albrecht Schmidt et al.: Why And How To Benchmark XML Databases. In: SIGMOD Record 30(3), 2001.
Norbert Fuhr, Norbert Gövert, Gabriella Kazai, Mounia Lalmas: INEX: Initiative for the Evaluation of XML Retrieval. In: Proceedings of the ACM SIGIR 2002 Workshop on XML and Information Retrieval.
Tue, July8th: Ranked Information Retrieval on XML Data
Speakers: Bernadette Blum Christian Nicolaus Markus Uhl Tutor: Stefan Siersdorfer Slides Elaboration
Literature:
Lin Guo, Feng Shao, Chavdar Botev, Jayavel Shanmugasundaram: XRANK: Ranked Keyword Search over XML Documents. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, CA, USA, June 10-12, 2003.
Torsten Grabs, Klemens Böhm, Hans-Jörg Schek: PowerDB-IR - Information Retrieval on Top of a Database Cluster. In: Proceedings of the 2001 ACM CIKM International Conference on Information and Knowledge Management, Atlanta, Georgia, November 5-10, 2001.
Norbert Fuhr, Kai Großjohann: XIRQL: A Query Language for Information Retrieval in XML Documents. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, September 9-13, 2001. (see this paper for a more detailed description of XIRQL)
Taurai Tapiwa Chinenyanga, Nicholas Kushmerick: An expressive and efficient language for XML information retrieval. In: Journal of the Americal Society for Information Science & Technology 53(6) (special issue on XML and Information Retrieval), 2002.
Tue, July 15th: Semantic Web & Ontologies
Speakers: Jun Cai Vladimir Eske Xueqiang Wang Tutor: Jens Graupmann Slides Elaboration
Literature:
Peter F. Patel-Schneider and Jerome Simeon: Building the Semantic Web on XML. In: Proceedings of the First International Semantic Web Conference, Sardinia, Italy, June 9-12, 2002
.Urvi Shah et al.: Information Retrieval on the Semantic Web. In: Proceedings of the 2002 ACM CIKM International Conference on Information and Knowledge Management, McLean, VA, USA, November 4-9, 2002
Nigel Collier: Machine Learning for Information Extraction from XML marked-up text on the Semantic Web. In: Proceedings of the Second International Workshop on the Semantic Web - SemWeb'2001, Hongkong, China, May 1, 2001.
AnHai Doan et al.: Learning to map between ontologies on the Semantic Web. In: Proceedings of the Eleventh International World Wide Web Conference, WWW2002, Honolulu, Hawaii, USA, 7-11 May 2002. ACM, 2002.
Tue, July 22nd: Similarity Search
Speakers: Christian Bering Carsten Greiveldinger Regis Newo Tutor: Martin Theobald Slides Elaboration
Literature:
last change: Ralf Schenkel, August 18th, 2003