DocoPool: Discovering hidden knowledge in text MARKET NEED Many businesses store pools of information-rich text in their systems – such as reports, customer reviews, user comments and general word processing documents. Insights can be uncovered by analysing these text sources to discover underlying hidden topics or trends, such as unexpected clusters of words across documents. In reality, however, it can be difficult to manually examine all this of information to discover such hidden trends or topics. TECHNOLOGY SOLUTION Figure 1. Visualising document groups based on word clusters or topics DocoPool – a web tool that allows users to explore the content of text documents for hidden knowledge: identifies and visualises word groupings or “topics” across sets of text documents, each document is carved up into individual words and word frequencies uses a probabilistic topic modelling algorithm to discover the spread of word occurrences across a corpus of text documents. KEY FEATURES Easy-to-interpret visualisations Drill-down on document details for deeper analysis of word clusters Specialist or domain specific word exclusions – to prevent clouding of hidden topics Flexible document upload (.txt, .pdf and .docx) “Save” facilities to allow revisiting of explorations. Figure 3: Document exploration: An iterative process RESEARCH TEAM Dr. Caroline Maillet, Dublin Institute of Technology Dr. Susan McKeever, Dublin Institute of Technology By Kate McCarthy|2022-06-03T11:40:34+00:003 June 2022|Demonstrators|0 Comments Share This Story, Choose Your Platform! FacebookTwitterRedditLinkedInWhatsAppTelegramTumblrPinterestVkXingEmail About the Author: Kate McCarthy Related Posts ML4GE – Machine Learning for Green Energy ML4GE – Machine Learning for Green Energy SmartAd: Measuring advertising campaigns efficacy SmartAd: Measuring advertising campaigns efficacy SmartSeg: A customer segmentation tool designed to be accessible to a broad user base SmartSeg: A customer segmentation tool designed to be accessible to a broad user base Data Stream Clustering: A high-throughput, continuous data clustering solution Data Stream Clustering: A high-throughput, continuous data clustering solution Leave A Comment Cancel replyComment Save my name, email, and website in this browser for the next time I comment.