login/register

Snip!t channels - Alan Dix

Channels > text mining

Order by: date | title | url | snip    Show: just this cat | subcats too

2011-03-02 16:12:33     Docvert - Microsoft Word to Open Standards [current 4.0]

http://holloway.co.nz/docvert/index.html

Docvert takes word processor files (typically .doc) and ...
Web Service receives .doc file and converts it to a Open ...
The resulting OpenDocument is then optionally converted ...
The result is returned in a .zip file.
Docvert has a user-friendly inter

view snip

/Channels/techie/web development
/Channels/text mining

2011-03-02 13:30:49     The Stanford NLP (Natural Language Processing) Group

http://nlp.stanford.edu/software/CRF-NER.shtml

CRFClassifier is a Java implementation of a Named Entity... Entity Recognition (NER) labels sequences of words in a ... names of things, such as person and company names, or ge... names. The software provides a general (arbitrary order)... linear chai

view snip

/Channels/text mining

2011-03-02 11:34:03     Acromine

http://www.chokkan.org/research/acromine/

Acronyms result from a highly productive type of term va... substitutes fully expanded terms (e.g., retinoic acid re... shortened term-forms (e.g., RARA). Even though no generi... patterns have been established for dealing with acronym ...
Acromine is

view snip

/Channels/text mining

2011-03-02 13:36:57     Tagged datasets for named entity recognition tasks

http://www.cs.technion.ac.il/~gabr/resou.../ne_datasets.html

Tagged datasets for named entity recognition tasks

view snip

/Channels/text mining

2011-03-02 13:36:45     Automatic Content Extraction (ACE) Evaluation

http://www.itl.nist.gov/iad/894.01/tests/ace/

The objective of the ACE program is to develop automatic... technology to support automatic processing of human lang... from a variety of sources (such as newswire, broadcast c... weblogs). ACE technology R&D is aimed at supporting various classificati

view snip

/Channels/text mining

Order by: date | title | url | snip    Show: just this cat | subcats too