Bookmarks
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • Daily Feed
  • Weekly Feed
  • Monthly Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Filters

Untagged links
4 results tagged tokens  ✕   ✕
GitHub - chartbeat-labs/textacy https://github.com/chartbeat-labs/textacy
Wed 20 Nov 2019 01:31:50 PM PST archive.org

textacy is a Python library for performing a variety of natural language processing (NLP) tasks, built on the high-performance spaCy library. With the fundamentals --- tokenization, part-of-speech tagging, dependency parsing, etc. --- delegated to another library, textacy focuses primarily on the tasks that come before and follow after. Abstracts away the boilerplate for the stuff you actually care about.

Quickstart: https://chartbeat-labs.github.io/textacy/getting_started/quickstart.html

python module exocortex nlp faas tokens tagging dependencies parsing ai ml
spacy · PyPI https://pypi.org/project/spacy/
Wed 08 May 2019 01:05:48 PM PDT archive.org

spaCy is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. spaCy comes with pre-trained statistical models and word vectors, and currently supports tokenization for 45+ languages. It features the fastest syntactic parser in the world, convolutional neural network models for tagging, parsing and named entity recognition and easy deep learning integration. It's commercial open-source software, released under the MIT license.

python nlp exocortex models tokens languages ml betafork module text
pnathan/signifier: finds tokens of significance, in significant files. https://github.com/pnathan/signifier
Mon 19 Mar 2018 11:27:36 PM PDT archive.org

Somebody I know on Mastodon threw together a quick utility that picks keywords out of documents you feed it and throws them into a Neo4j graph database for indexing. Written in rust.

repo tokens github rust keywords
jdkato/prose: A Golang library for text processing, including tokenization, part-of-speech tagging, and named-entity extraction. https://github.com/jdkato/prose
Mon 19 Mar 2018 03:43:42 PM PDT archive.org

prose is a natural language processing library (English only, at the moment) in pure Go. It supports tokenization, segmentation, part-of-speech tagging, and named-entity extraction. Parses English text, can also natively extract e-mail addresses, hashtags, @mentions, URLs, and emoticons. Can tag segmented and analyzed text by part of speech, including punctuation marks. Can identify types of entities (people, places). Also has the option to build and train custom models.

foss github golang nlp tokens tagging extraction
4129 links, including 264 private
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn