Institut für Informatik
FB 08 - Physik, Mathematik und Informatik / Johannes Gutenberg-Universität Mainz
- 06131/39-23378
- 06131/39-23534
Gottron, Thomas
Bridging the gap : from multi document template detection to single document content extractionProceedings of the IASTED International Conference on Internet & Multimedia Systems & Applications with Special Sessions on Visual Communications : March 17 - 19, 2008, Innsbruck, Austria ; EuroIMSA ; (Innsbruck) : 2008.03.17-19. Anaheim, Calif. u.a.: Acta Press 2008 S. 66 - 71
Gottron, Thomas
Clustering template based web documentsMacDonald, Craig (Hrsg). Advances in information retrieval : 30. European Conference on IR Research, ECIR 2008, Glasgow, UK, March 30 - April 3, 2008 ; proceedings. Berlin u.a.: Springer 2008 S. 40 - 51
Gottron, Thomas
Combining content extraction heuristics : the CombinE systemKotsis, G. (Hrsg). The 10th International Conference on Information Integration and Web-based Applications & Services (iiWAS 2008) : November 24 - 26, 2008, Linz, Austria. New York, NY: ACM 2008 S. 591 - 595
Gottron, Thomas
Content Code Blurring : a new approach to content extractionTjoa, A. Min (Hrsg). Proceedings / DEXA 2008, 19th international conference on database and expert systems applications : 1 - 5 September 2008, Turin, Italy ; [workshop papers]. Piscataway, NJ: IEEE 2008 S. 29 - 33
Gottron, Thomas
Content extraction: Identifying the main content in HTML DocumentsMainz: Univ. 2008 252 S.
Martin, Ludger
Usability Analysis and Visualization of Web 2.0 ApplicationsChao, Liu (Hrsg). Proceedings : tenth IEEE International Symposium on Web Site Evolution ; October 3 - 4, 2008, Beijing, China. Piscataway, NJ: IEEE 2008 S. 121 - 124
Gottron, Thomas
Evaluating content extraction on HTML documentsGrout, Vic (Hrsg). Proceedings of the Second International Conference on Internet Technologies and Applications (ITA 07) : 4-7 September 2007, University of Wales, NEWI, Wrexham, UK. Wrexham u.a.: NEWI 2007 S. 123 - 132