Research

My research interest is in natural language understanding, knowledge base construction, and web-scale knowledge harvesting.

Publications

2017

  • [PDF] Dominic Seyler, Tatiana Dembelova, Luciano Del Corro, Johannes Hoffart, and Gerhard Weikum. KnowNER – Incremental Multilingual Knowledge in Named Entity Recognition. Corr, cs.CL, 2017.
    [Bibtex]
    @article{Seyler:2017ww,
    author = {Seyler, Dominic and Dembelova, Tatiana and Del Corro, Luciano and Hoffart, Johannes and Weikum, Gerhard},
    title = {{KnowNER - Incremental Multilingual Knowledge in Named Entity Recognition}},
    journal = {CoRR},
    year = {2017},
    volume = {cs.CL}
    }

2016

  • Thomas Rebele, Fabian Suchanek, Johannes Hoffart, Joanna Biega, Erdal Kuzey, and Gerhard Weikum. YAGO: A Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames. In The th international semantic web conferenece, iswc , kobe, japan, 2016.
    [Bibtex]
    @inproceedings{Rebele:2016vx,
    author = {Rebele, Thomas and Suchanek, Fabian and Hoffart, Johannes and Biega, Joanna and Kuzey, Erdal and Weikum, Gerhard},
    title = {{YAGO: A Multilingual Knowledge Base from Wikipedia, Wordnet, and Geonames}},
    booktitle = {The th International Semantic Web Conferenece, ISWC , Kobe, Japan},
    year = {2016}
    }
  • [PDF] Johannes Hoffart, Dragan Milchevski, Gerhard Weikum, Avishek Anand, and Jaspreet Singh. The Knowledge Awakens: Keeping Knowledge Bases Fresh with Emerging Entities. In Proceedings of the 25th international conference companion on world wide web, www 2016, montreal, canada, pages 203-206. International World Wide Web Conferences Steering Committee, 2016.
    [Bibtex]
    @inproceedings{Hoffart:2016bp,
    author = {Hoffart, Johannes and Milchevski, Dragan and Weikum, Gerhard and Anand, Avishek and Singh, Jaspreet},
    title = {{The Knowledge Awakens: Keeping Knowledge Bases Fresh with Emerging Entities}},
    booktitle = {Proceedings of the 25th International Conference Companion on World Wide Web, WWW 2016, Montreal, Canada},
    year = {2016},
    pages = {203--206},
    publisher = {International World Wide Web Conferences Steering Committee},
    month = apr
    }
  • Gerhard Weikum, Johannes Hoffart, and Fabian Suchanek. Ten Years of Knowledge Harvesting: Lessons and Challenges. Ieee data eng. bull., 39(3):41-50, 2016.
    [Bibtex]
    @article{Weikum:2016vn,
    author = {Weikum, Gerhard and Hoffart, Johannes and Suchanek, Fabian},
    title = {{Ten Years of Knowledge Harvesting: Lessons and Challenges}},
    journal = {IEEE Data Eng. Bull.},
    year = {2016},
    volume = {39},
    number = {3},
    pages = {41--50}
    }
  • [PDF] Andreas Schmidt, Johannes Hoffart, Dragan Milchevski, and Gerhard Weikum. Context-Sensitive Auto-Completion for Searching with Entities and Categories. In Proceedings of the 39th international acm sigir conference on research and development in information retrieval – systems demonstrations, pages 1097-1100. ACM Press, 2016.
    [Bibtex]
    @inproceedings{Schmidt:2016kr,
    author = {Schmidt, Andreas and Hoffart, Johannes and Milchevski, Dragan and Weikum, Gerhard},
    title = {{Context-Sensitive Auto-Completion for Searching with Entities and Categories}},
    booktitle = {Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval - Systems Demonstrations},
    year = {2016},
    pages = {1097--1100},
    publisher = {ACM Press}
    }
  • [PDF] Patrick Ernst, Amy Siu, Dragan Milchevski, Johannes Hoffart, and Gerhard Weikum. DeepLife: An Entity-aware Search, Analytics and Exploration Platform for Health and Life Sciences. In Proceedings of the 54th annual meeting of the association for computational linguistics – system demonstrations, pages 19-24, 2016.
    [Bibtex]
    @inproceedings{Ernst:uv,
    author = {Ernst, Patrick and Siu, Amy and Milchevski, Dragan and Hoffart, Johannes and Weikum, Gerhard},
    title = {{DeepLife: An Entity-aware Search, Analytics and Exploration Platform for Health and Life Sciences}},
    booktitle = {Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics - System Demonstrations},
    year = {2016},
    pages = {19--24}
    }
  • [PDF] Jaspreet Singh, Johannes Hoffart, and Avishek Anand. Discovering Entities with Just a Little Help from You. In Proceedings of the 25th acm international on conference on information and knowledge management, cikm 2016, indianapolis, usa, pages 1331-1340. ACM Press, 2016.
    [Bibtex]
    @inproceedings{Singh:2016du,
    author = {Singh, Jaspreet and Hoffart, Johannes and Anand, Avishek},
    title = {{Discovering Entities with Just a Little Help from You}},
    booktitle = {Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM 2016, Indianapolis, USA},
    year = {2016},
    pages = {1331--1340},
    publisher = {ACM Press}
    }

2015

  • [PDF] Johannes Hoffart. Discovering and Disambiguating Named Entities in Text. PhD thesis, 2015.
    [Bibtex]
    @phdthesis{Hoffart:2015wk,
    author = {Hoffart, Johannes},
    title = {{Discovering and Disambiguating Named Entities in Text}},
    year = {2015},
    month = feb
    }
  • Johannes Hoffart, Nicoleta Preda, Fabian M. Suchanek, and Gerhard Weikum. Knowledge Bases for Web Content Analytics. In Tutorial at www 2015, florence, italy, pages 1-1, 2015.
    [Bibtex]
    @inproceedings{Hoffart:2015dr,
    author = {Hoffart, Johannes and Preda, Nicoleta and Suchanek, Fabian M and Weikum, Gerhard},
    title = {{Knowledge Bases for Web Content Analytics}},
    booktitle = {Tutorial at WWW 2015, Florence, Italy},
    year = {2015},
    pages = {1--1}
    }

2014

  • [PDF] Johannes Hoffart, Dragan Milchevski, and Gerhard Weikum. STICS: Searching with Strings, Things, and Cats. In The 37th international acm sigir conference on research and development in information retrieval, sigir 2014, gold coast, qld, australia, pages 1247-1248, 2014.
    [Bibtex]
    @inproceedings{Hoffart:2014dt,
    author = {Hoffart, Johannes and Milchevski, Dragan and Weikum, Gerhard},
    title = {{STICS: Searching with Strings, Things, and Cats}},
    booktitle = {The 37th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2014, Gold Coast, QLD, Australia},
    year = {2014},
    pages = {1247--1248}
    }
  • [PDF] Johannes Hoffart, Dragan Milchevski, and Gerhard Weikum. AESTHETICS: Analytics with Strings, Things, and Cats. In Proceedings of the 23rd acm international conference on conference on information and knowledge management, cikm 2014, shanghai, china, 2014.
    [Bibtex]
    @inproceedings{Hoffart:2014cy,
    author = {Hoffart, Johannes and Milchevski, Dragan and Weikum, Gerhard},
    title = {{AESTHETICS: Analytics with Strings, Things, and Cats}},
    booktitle = {Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, CIKM 2014, Shanghai, China},
    year = {2014}
    }
  • [PDF] Dat Ba Nguyen, Johannes Hoffart, Martin Theobald, and Gerhard Weikum. AIDA-light: High-Throughput Named-Entity Disambiguation. In Linked data on the web at www2014, 2014.
    [Bibtex]
    @inproceedings{Nguyen:2014wl,
    author = {Nguyen, Dat Ba and Hoffart, Johannes and Theobald, Martin and Weikum, Gerhard},
    title = {{AIDA-light: High-Throughput Named-Entity Disambiguation}},
    booktitle = {Linked Data on the Web at WWW2014},
    year = {2014}
    }
  • [PDF] Johannes Hoffart, Yasemin Altun, and Gerhard Weikum. Discovering emerging entities with ambiguous names. In Proceedings of the 23rd international conference on world wide web, www 2014, seoul, south korea, pages 385-396, 2014.
    [Bibtex]
    @inproceedings{Hoffart:2014hp,
    author = {Hoffart, Johannes and Altun, Yasemin and Weikum, Gerhard},
    title = {{Discovering emerging entities with ambiguous names}},
    booktitle = {Proceedings of the 23rd international conference on World wide web, WWW 2014, Seoul, South Korea},
    year = {2014},
    pages = {385--396}
    }

2013

  • Stephan Seufert, Srikanta J. Bedathur, Johannes Hoffart, Andrey Gubichev, and Klaus Berberich. Efficient Computation of Relationship-Centrality in Large Entity-Relationship Graphs. In Posters and demonstrations track of the 12th international semantic web conference, iswc 2013, sydney, australia, pages 1-4, 2013.
    [Bibtex]
    @inproceedings{Seufert:2013tx,
    author = {Seufert, Stephan and Bedathur, Srikanta J and Hoffart, Johannes and Gubichev, Andrey and Berberich, Klaus},
    title = {{Efficient Computation of Relationship-Centrality in Large Entity-Relationship Graphs}},
    booktitle = {Posters and Demonstrations Track of the 12th International Semantic Web Conference, ISWC 2013, Sydney, Australia},
    year = {2013},
    pages = {1--4}
    }
  • [PDF] Yafang Wang, Lili Jian, Johannes Hoffart, and Gerhard Weikum. YaLi: a Crowdsourcing Plug-In for NERD. In Sigir 2013, dublin, ireland, 2013.
    [Bibtex]
    @inproceedings{Wang:2013wx,
    author = {Wang, Yafang and Jian, Lili and Hoffart, Johannes and Weikum, Gerhard},
    title = {{YaLi: a Crowdsourcing Plug-In for NERD}},
    booktitle = {SIGIR 2013, Dublin, Ireland},
    year = {2013}
    }
  • [PDF] Mohamed Amir Yosef, Sandro Bauer, Johannes Hoffart, Marc Spaniol, and Gerhard Weikum. HYENA-live: Fine-Grained Online Entity Type Classification from Natural-language Text. In Proceedings of the 51st annual meeting of the association for computational linguistics, acl 2013, sofia, bulgaria, pages 133-138, 2013.
    [Bibtex]
    @inproceedings{Yosef:2013vb,
    author = {Yosef, Mohamed Amir and Bauer, Sandro and Hoffart, Johannes and Spaniol, Marc and Weikum, Gerhard},
    title = {{HYENA-live: Fine-Grained Online Entity Type Classification from Natural-language Text}},
    booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, ACL 2013, Sofia, Bulgaria},
    year = {2013},
    pages = {133--138}
    }
  • [PDF] Lili Jiang, Yafang Wang, Johannes Hoffart, and Gerhard Weikum. Crowdsourced Entity Markup. In Crowdsem workshop at the 12th international semantic web conference, iswc 2013, sydney, australia, 2013.
    [Bibtex]
    @inproceedings{Jiang:2013tw,
    author = {Jiang, Lili and Wang, Yafang and Hoffart, Johannes and Weikum, Gerhard},
    title = {{Crowdsourced Entity Markup}},
    booktitle = {CrowdSem Workshop at the 12th International Semantic Web Conference, ISWC 2013, Sydney, Australia},
    year = {2013}
    }
  • [PDF] Johannes Hoffart. Discovering and Disambiguating Named Entities in Text. In Phd symposion at acm sigmod international conference on management of data, sigmod 2013, new york city, usa, pages 43-48, 2013.
    [Bibtex]
    @inproceedings{Hoffart:2013wk,
    author = {Hoffart, Johannes},
    title = {{Discovering and Disambiguating Named Entities in Text}},
    booktitle = {PhD Symposion at ACM SIGMOD International Conference on Management of Data, SIGMOD 2013, New York City, USA},
    year = {2013},
    pages = {43--48}
    }
  • [PDF] Fabian M. Suchanek, Johannes Hoffart, Erdal Kuzey, and Edwin Lewis-Kelham. YAGO2s: Modular High-Quality Information Extraction with an Application to Flight Planning. In 15. gi-fachtagung datenbanksysteme für business, technologie und web, 2013.
    [Bibtex]
    @inproceedings{Suchanek:2013vd,
    author = {Suchanek, Fabian M and Hoffart, Johannes and Kuzey, Erdal and Lewis-Kelham, Edwin},
    title = {{YAGO2s: Modular High-Quality Information Extraction with an Application to Flight Planning}},
    booktitle = {15. GI-Fachtagung Datenbanksysteme f{\"u}r Business,
    Technologie und Web},
    year = {2013}
    }
  • [PDF] Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, and Gerhard Weikum. YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract. In 23rd international joint conference on artificial intelligence, ijcai 2013, beijing, china, pages 3161-3165, 2013.
    [Bibtex]
    @inproceedings{Hoffart:2013ww,
    author = {Hoffart, Johannes and Suchanek, Fabian M and Berberich, Klaus and Weikum, Gerhard},
    title = {{YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia: Extended Abstract}},
    booktitle = {23rd International Joint Conference on Artificial Intelligence, IJCAI 2013, Beijing, China},
    year = {2013},
    pages = {3161--3165}
    }
  • Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, and Gerhard Weikum. YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. Artificial intelligence, 194:28-61, 2013.
    [Bibtex]
    @article{Hoffart:2013hn,
    author = {Hoffart, Johannes and Suchanek, Fabian M and Berberich, Klaus and Weikum, Gerhard},
    title = {{YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia}},
    journal = {Artificial Intelligence},
    year = {2013},
    volume = {194},
    pages = {28--61},
    month = jan
    }

2012

  • [PDF] Mohamed Amir Yosef, Sandro Bauer, Johannes Hoffart, Marc Spaniol, and Gerhard Weikum. HYENA: Hierarchical Type Classification for Entity Names. In Proceedings of the 24th international conference on computational linguistics, coling 2012, mumbai, india, pages 1361-1370, 2012.
    [Bibtex]
    @inproceedings{Yosef:2012tz,
    author = {Yosef, Mohamed Amir and Bauer, Sandro and Hoffart, Johannes and Spaniol, Marc and Weikum, Gerhard},
    title = {{HYENA: Hierarchical Type Classification for Entity Names}},
    booktitle = {Proceedings of the 24th International Conference on Computational Linguistics, Coling 2012, Mumbai, India},
    year = {2012},
    pages = {1361--1370}
    }
  • [PDF] Johannes Hoffart, Stephan Seufert, Dat Ba Nguyen, Martin Theobald, and Gerhard Weikum. KORE: Keyphrase Overlap Relatedness for Entity Disambiguation. In Proceedings of the 21st acm international conference on information and knowledge management, cikm 2012, hawaii, usa, pages 545-554, 2012.
    [Bibtex]
    @inproceedings{Hoffart:2012vx,
    author = {Hoffart, Johannes and Seufert, Stephan and Nguyen, Dat Ba and Theobald, Martin and Weikum, Gerhard},
    title = {{KORE: Keyphrase Overlap Relatedness for Entity Disambiguation}},
    booktitle = {Proceedings of the 21st ACM International Conference on Information and Knowledge Management, CIKM 2012, Hawaii, USA},
    year = {2012},
    pages = {545--554}
    }
  • Gerhard Weikum, Johannes Hoffart, Ndapandula Nakashole, Marc Spaniol, Fabian Suchanek, and Mohamed Amir Yosef. Big Data Methods for Computational Linguistics. Ieee data eng. bull., 35:46-55, 2012.
    [Bibtex]
    @article{Weikum:2012wb,
    author = {Weikum, Gerhard and Hoffart, Johannes and Nakashole, Ndapandula and Spaniol, Marc and Suchanek, Fabian and Yosef, Mohamed Amir},
    title = {{Big Data Methods for Computational Linguistics}},
    journal = {IEEE Data Eng. Bull.},
    year = {2012},
    volume = {35},
    pages = {46--55}
    }

2011

  • [PDF] Johannes Hoffart, Mohamed Amir Yosef, Ilaria Bordino, Hagen Fürstenau, Manfred Pinkal, Marc Spaniol, Bilyana Taneva, Stefan Thater, and Gerhard Weikum. Robust Disambiguation of Named Entities in Text. In Proceedings of the 2011 conference on empirical methods in natural language processing, emnlp 2011, edinburgh, scotland, pages 782-792, 2011.
    [Bibtex]
    @inproceedings{Hoffart:2011a,
    author = {Hoffart, Johannes and Yosef, Mohamed Amir and Bordino, Ilaria and F{\"u}rstenau, Hagen and Pinkal, Manfred and Spaniol, Marc and Taneva, Bilyana and Thater, Stefan and Weikum, Gerhard},
    title = {{Robust Disambiguation of Named Entities in Text}},
    booktitle = {Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, Edinburgh, Scotland},
    year = {2011},
    pages = {782--792}
    }
  • [PDF] Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, Edwin Lewis-Kelham, Gerard de Melo, and Gerhard Weikum. YAGO2: Exploring and Querying World Knowledge in Space, Context, and Many Languages. In Proceedings of the 20th international conference companion on world wide web, www 2011, hyderabad, india, pages 229-232. ACM, 2011.
    [Bibtex]
    @inproceedings{Hoffart:2011,
    author = {Hoffart, Johannes and Suchanek, Fabian M and Berberich, Klaus and Lewis-Kelham, Edwin and de Melo, Gerard and Weikum, Gerhard},
    title = {{YAGO2: Exploring and Querying World Knowledge in Space, Context, and Many Languages}},
    booktitle = {Proceedings of the 20th International Conference Companion on World Wide Web, WWW 2011, Hyderabad, India},
    year = {2011},
    pages = {229--232},
    publisher = {ACM}
    }
  • Mohamed Amir Yosef, Johannes Hoffart, Ilaria Bordino, Marc Spaniol, and Gerhard Weikum. AIDA: An Online Tool for Accurate Disambiguation of Named Entities in Text and Tables. In Proceedings of the 37th international conference on very large databases, vldb 2011, seattle, wa, usa, pages 1450-1453, 2011.
    [Bibtex]
    @inproceedings{Yosef:2011,
    author = {Yosef, Mohamed Amir and Hoffart, Johannes and Bordino, Ilaria and Spaniol, Marc and Weikum, Gerhard},
    title = {{AIDA: An Online Tool for Accurate Disambiguation of Named Entities in Text and Tables}},
    booktitle = {Proceedings of the 37th International Conference on Very Large Databases, VLDB 2011, Seattle, WA, USA},
    year = {2011},
    pages = {1450--1453}
    }

2010

  • [PDF] Johannes Hoffart, Fabian M. Suchanek, Klaus Berberich, and Gerhard Weikum. YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia. Technical Report, Saarbrücken, Germany, 2010.
    [Bibtex]
    @techreport{Hoffart:2010,
    author = {Hoffart, Johannes and Suchanek, Fabian M and Berberich, Klaus and Weikum, Gerhard},
    title = {{YAGO2: A Spatially and Temporally Enhanced Knowledge Base from Wikipedia}},
    year = {2010},
    address = {Saarbr{\"u}cken, Germany},
    month = nov
    }

2009

  • [PDF] Johannes Hoffart, Torsten Zesch, and Iryna Gurevych. An Architecture to Support Intelligent User Interfaces for Wikis by Means of Natural Language Processing. In Proceedings of the 5th international symposium on wikis and open collaboration, wikisym 2009, orlando, fl, usa. ACM, 2009.
    [Bibtex]
    @inproceedings{Hoffart:2009,
    author = {Hoffart, Johannes and Zesch, Torsten and Gurevych, Iryna},
    title = {{An Architecture to Support Intelligent User Interfaces for Wikis by Means of Natural Language Processing}},
    booktitle = {Proceedings of the 5th International Symposium on Wikis and Open Collaboration, WikiSym 2009, Orlando, FL, USA },
    year = {2009},
    publisher = {ACM},
    month = oct
    }
  • Johannes Hoffart, Daniel Bär, Torsten Zesch, and Iryna Gurevych. Discovering Links Using Semantic Relatedness. In Inex 2009 workshop preproceedings, 2009, brisbane, australia, pages 314-325, 2009.
    [Bibtex]
    @inproceedings{Hoffart:2009a,
    author = {Hoffart, Johannes and B{\"a}r, Daniel and Zesch, Torsten and Gurevych, Iryna},
    title = {{Discovering Links Using Semantic Relatedness}},
    booktitle = {INEX 2009 Workshop Preproceedings, 2009, Brisbane, Australia},
    year = {2009},
    pages = {314--325}
    }
DSCF5127
DSCF5177
IMG_0006