Abstract
The rapid growth in IT in the last two decades has led to a growth in the amount of information available online. A new style for sharing information is social media. Social media is a continuously instantly updated source of information. In this position paper, we propose a framework for Information Extraction (IE) from unstructured user generated contents on social media. The framework proposes solutions to overcome the IE challenges in this domain such as the short context, the noisy sparse contents and the uncertain contents. To overcome the challenges facing IE from social media, State-Of-The-Art approaches need to be adapted to suit the nature of social media posts. The key components and aspects of our proposed framework are noisy text filtering, named entity extraction, named entity disambiguation, feedback loops, and uncertainty handling.
Original language | English |
---|---|
Title of host publication | Proceedings of the Third Workshop on Semantic Web and Information Extraction (SWAIE 2014) |
Place of Publication | Dublin |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 9-16 |
Number of pages | 8 |
ISBN (Print) | 978-1-873769-48-5 |
Publication status | Published - Aug 2014 |
Event | 3rd Workshop on Semantic Web and Information Extraction, SWAIE 2014 - Dublin, Ireland Duration: 24 Aug 2014 → 24 Aug 2014 Conference number: 3 |
Publication series
Name | |
---|---|
Publisher | Association for Computational Linguistics |
Volume | W14-62 |
Workshop
Workshop | 3rd Workshop on Semantic Web and Information Extraction, SWAIE 2014 |
---|---|
Abbreviated title | SWAIE |
Country/Territory | Ireland |
City | Dublin |
Period | 24/08/14 → 24/08/14 |
Keywords
- Relation/fact extraction
- Named entity extraction
- Information Extraction
- Named entity disambiguation
- Social media
- Microblogs
- Named entity recognition
- Named entity linking
- Tweets