Text Mining

Links

About Scatter/Gather

http://www.sims.berkeley.edu/~hearst/sg-overview.html

Using text clustering as a way to group document according to the overall similarities in their content. Features background information, examples and technical papers.

About the Cat-a-Cone

http://www.sims.berkeley.edu/~hearst/cac-overview.html

A novel user interface that integrates search and browsing of very large category hierarchies with their associated text collections. Features technical papers.

About TileBars

http://www.sims.berkeley.edu/~hearst/tb-overview.html

An interface which attempts to show the user, graphically, the relationship between the words in the query and the documents retrieved. Features technical papers and commercial and other uses.

Automated Info Solutions

http://www.automated-info-solutions.com/

Products and services offering automated collection of data from public web sites. Features overviews and contact information.

Compris Intelligence

http://www.compris.com/

Offers solutions for understanding textual content and the automatic comprehension of text meaning by a computer. Features products, news and contact information.

Eidetica

http://www.eidetica.com/

Netherlands firm offers search and text mining solutions on a hosting basis. Features services, support, portfolio and contact information.

NetOwl - Intelligent Content Management

http://www.netowl.com/

Features of the tools named Extractor, Summarizer, TextMiner and InstaLink.

Pertinence Mining

http://www.pertinence.net/

Automatic text summarization tools. Includes contact information.

SAS Text Miner

http://www.sas.com/technologies/analytics/datamining/textminer/

Included in the famous SAS set of tools for quantitative data analysis, the module for text analysis includes clustering algorithms, document categorisation and data extraction. Overview and screenshots.

Shailendra Singh

http://shails.150m.com/

User Preferences concerning Search Engines and Teletext in the framework of Rough-Fuzzy Theory : CV and publications.

SPSS Text Analysis for Surveys

http://www.spss.com/textanalysis_surveys/

[Win] Natural language processing (NLP) and WordNet-based software to analyze and recode open-ended surveys.

Synthema

http://www.synthema.it/textmining/

Features, demo and case studies of Twid Expert and Temis Online Miner/Categorizer.

Systems Services - Web Data Retrieval

http://www.logical.btinternet.co.uk/webinf.htm

Develops applications to search out and retrieve data from web pages and e-mail archives and provides turnkey services to find and retrieve information from the web, XML databases or stores of incoming e-mail. Features contact information.

Text Mining at Waikato

http://www.cs.waikato.ac.nz/~nzdl/textmining/

The Text Mining group at the University of Waikato in New Zealand. With a focus on Viterbi search and entropy-based methods the group has a compression feel to it.

Text Mining, Web Mining, Information Retrieval and Extraction from the WWW References

http://filebox.vt.edu/users/wfan/text_mining.html

Links to reviews and analyses of text mining research. Features online presentations, white papers and other projects, papers, people and products.

TextAI: Text Analysis International

http://www.textanalysis.com/

Provides NLP applications based on its proprietary VisualText technology. Product and service information, online software tour and documentation.

TextAnalyst

http://www.megaputer.com/products/ta/index.php3

TextAnalyst is a unique text mining tool, using a semantic network for retrieval, clustering, classification, summarization, and natural language querying.

TextMining.org

http://www.textmining.org/

Information, links, download and faq on text mining and natural language processing.

Untangling Text Data Mining

http://www.sims.berkeley.edu/~hearst/papers/acl99/acl99-tdm.html

Defines data mining, information access, and corpus-based computational linguistics, and then discusses the relationship of these to text data mining. The intent behind these contrasts is to draw attention to exciting new kinds of problems for computational linguists.

WebAnalyst

http://www.megaputer.com/products/wa/index.php3

Profiles the content of a web page, or from a content database, and uses data mining techniques to associate profiled content dynamically during a browsing session.

WordStat

http://www.provalisresearch.com/wordstat/wordstat.html

Module specifically designed to study textual information. Features screen shots and purchasing information.