Unfamiliar Data Modeller: Discovering structures within unfamiliar data sets
Project Description MARKET NEED ETL (Extract Transform Load) software systems are responsible for the extraction of data from various sources, the cleansing, validation, and reformatting of data, and the insertion of data into a data warehouse. In some cases the source data is unfamiliar to the developer or analyst and has no apparent structure. ETL tools typically do not try to infer relationships between source entities, with the result that the user is forced to configure filters, joins, aggregations manually to define the transformation from source to destination. Doing so can be time-consuming and error-prone. TECHNOLOGY SOLUTION In this project
Large-Scale Sentiment Clustering: Clustering of documents based on sentiment
Project Description MARKET NEED There has been enormous growth in the field of sentiment analysis in recent years. CeADAR recently delivered Next-Generation Sentiment, software which provides a sophisticated measure of sentiment and a better understanding of the emotional composition of sentiment. While that solution allows the user to develop a very nuanced understanding of the sentiment expressed in a document, it lacks a means to compare the results of sentiment analysis from one document to another. We now introduce the complementary software Large-Scale Sentiment Clustering, which allows the clustering of documents based on sentiment and the identification of exceptions to the
Supply Chain Optimisation: A tool to optimise the amount of stock invested in customer inventory
Project Description Market Need An inefficient supply chain has a detrimental effect on the profits achieved by any business. Many companies supply inventory to their customers via a consignment agreement, whereby the customer stores the product but is not invoiced by the supplier unless the product is sold or used. Inefficient consignment inventory policies result in the supplier over investing in customer inventory and can lead to product wastage. The complexity of setting the inventory level is increased for suppliers who guarantee against stock outs. For example, this is the agreement between some medical device suppliers and their customers (hospitals).
Time Series Pattern Search: A tool to extract events from time series data
Project Description MARKET NEED The market need for this technology is for applications which perform analysis and data mining of time series. These applications are ubiquitous in many different different domains: • Body Area Network (BAN) data for medical applications. • Patient monitoring e.g. real time analysis of ECG data, • Telemetry of aircraft flights, • Fluctuations of stock market. These applications use methodologies such as indexing, classification, clustering and approximation of time series. TPS is software tool that is relevant to all of these applications. TECHNOLOGY SOLUTION TPS is a tool that can be used by developers and data
Content Curated Consumer Reviews: Extract and cluster sentences on reviews (Product Fault Finder)
Project Description MARKET NEED Publicly expressed opinions in online reviews, such as found on Amazon.com, are becoming an integral part of the modern consumer’s decision making process. The 2014 version of an annual study by BrightLocal1 revealed that 88% of consumers trust online reviews as much as they trust personal recommendations. This percentage has been steadily increasing since the study began in 2011 (with 67% in 2011, 72% in 2012 and 79% in 2013). Further, the study found that 85% of consumers read fewer than 11 reviews, with 67% only reading up to 6 reviews. Depending on the product, review
Next-Generation Sentiment: A fine-grained understanding of the sentiment expressed in text articles
Project Description MARKET NEED There has been enormous growth in the field of sentiment analysis in recent years. About 94% of the articles in Google Scholar on the subject have been published in the last ten years. Despite that surge in interest, there is still a need for a more sophisticated measure of sentiment, i.e., beyond merely positive, negative, or neutral. There is also a need for a better understanding of the emotional quality (e.g., happy, disappointed, anxious) of sentiment, and of how that can change over time. Finally, we wish to be able to distinguish the sentiment targeted at
Analytics for Inventory and Supply Chain Management: Optimisation of repair supply chain networks
Project Description Market Need To support after-sales warranty services many companies manage a spare parts supply chain network. This involves the purchasing and positioning of spare parts in distribution centres across the supported region. The key objective in the management of this supply chain is to honour the warranty agreement with customers by making replacement parts available to conduct repairs within the agreed timeframes (e.g. Next Business Day, Same Business Day). Having too much stock results in additional storage and obsolescence costs, while having too little stock results in stock out situations and poor customer service. Technology Solution The optimisation
Query-Aware Database Generator: Improve query correctness in complex SQL environments (Query Correctness Platform – I)
Project Description Market Need Database queries are often business-critical. But there are few methods for ensuring their correctness. Businesses need new methods and tools that increase confidence in the correctness of both standard and ad-hoc queries. Query Testing There may be no data at all, e.g. in a green field project; or, there may be data in a legacy system but data migration may not have happened yet; or, there may be data but it may be too sensitive to share with the software development team. When there is data, there is often so much of it that it is
Process Data Analytics: A tool to standardise business process models
Project Description Market Need The field of Business Process Management (BPM) is now an established one, as reflected in a growing research community, a large volume of research publications, and specialised conferences. In spite of that, BPM techniques developed in a research context can falter when confronted with real-world enterprise situations. Large organisations can have hundreds or even thousands of business processes in place, and often those processes are poorly documented and the relationships between them poorly understood. Business processes might be duplicated partially or wholly, with little or no reuse of process fragments. The problem of complexity becomes even
Query Audit Tool: Visualising query result provenance (Query Correctness Platform – II)
Project Description Market Need Database queries are often business-critical. But there are few methods for ensuring their correctness . Businesses need new methods and tools that increase confidence in the correctness of both standard and ad-hoc queries. Technology Solution Process is key to querying with confidence: businesses must use methods that foster query correctness. But, within these methods, technologies can assist the people who write and test queries. We are developing a query correctness platform , whose tools will assist programmers in testing and debugging complex SQL queries. The platform comprises a Query-Aware Database Generator and a Query Audit Tool.
AI for Business Process Modelling: Business process modelling with automated planning
Project Description Market Need Business process modelling is a technique for representing existing or future processes within an organisation, particularly with a view to documenting and improving them. Many business processes manipulate sophisticated data sources to generate a data set that will be subject to subsequent data analytics tasks, and their study falls within the general field of data management. User-friendly editors are available which allow business users to define business processes as a structured collection of tasks towards a specific goal. What they do not provide, however, is the ability to verify and optimise those processes or the data