goru001
released this
This contains addition of new feature - get_similar_sentences - with which you can augment and multiply your data in supported languages
Assets
4
goru001
released this
New Features:
- You can now get 400 dimensional encoding for sentences using
get_sentence_encoding- supported for all languages in iNLTK - You can now get similarity score (cosine similarity) between 2 sentences using
get_sentence_similarity- supported for all languages in iNLTK.
New Model:
- The above features will not work for punjabi language with the old model. Please execute the following code-snippet before using them
from inltk.inltk import reset_models
>> reset_models('pa')
>> setup('pa')
Assets
4
goru001
released this
Added Urdu support to iNLTK - thanks to @anuragshas contributions
Added Windows 10 support - thanks to @ibrahiminfinite contributions
Assets
4
goru001
released this
Added get_embedding_vectors function to allow users to get embedding vectors for their words/sentences/documents
Assets
4
goru001
released this
Added tamil support

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
