Releases: winkjs/wink-nlp
Releases · winkjs/wink-nlp
Added non-breaking space handling capabilities
Version 2.2.0 April 03, 2024
✨ Features
- Detokenization restores both regular and non-breaking spaces to their original positions. 🤓
Introducing cosine similarity for word vectors
Version 2.1.0 March 24, 2024
✨ Features
- You can now use
similarity.vector.cosine( vectorA, vectorB )
to compute similarity between two vectors on a scale of 0 to 1. 🤓
Word embeddings have arrived!
Version 2.0.0 March 24, 2024
✨ Features
- Seamless word embedding integration enhances winkNLP's semantic capabilities. 🎉 👏 🙌
- Pre-trained 100-dimensional word embeddings for over 350,000 English words released: wink-embeddings-sg-100d. 💯
- API remains unchanged — no code updates needed for existing projects. The new APIs include: 🤩
- Obtain vector for a token: Use the
.vectorOf( token )
API. - Compute sentence/document embeddings: Employ the
as.vector
helper: use.out( its.lemma, as.vector )
on tokens of a sentence or document. You can also useits.value
orits.normal
. Tokens can be pre-processed to remove stop words etc using the.filter()
API. Note, theas.vector
helper uses averaging technique. - Generate contextual vectors: Leverage the
.contextualVectors()
method on a document. Useful for pure browser-side applications! Generate custom vectors contextually relevant to your corpus and use them in place of larger pre-trained wink embeddings.
- Obtain vector for a token: Use the
- Comprehensive documentation along with interesting examples is coming up shortly. Stay tuned for updates! 😎
Added Deno example
Version 1.14.3 July 21, 2023
✨ Features
- Added a live example for how to run winkNLP on Deno. 👍
Fixed a bug
Version 1.14.2 July 1, 2023
🐛 Fixes
- Paramteters in
markup()
are optional now in TS code — squashed a typescript declaration bug. 🙌
Squashed a bug
Introducing helper for extracting important sentences from a document
Version 1.14.0 May 20, 2023
✨ Features
- You can now use
its.sentenceWiseImprotance
helper to obtain sentence wise importance (on a scale of 0 to 1) of a document, if it is supported by language model. 📚📊🤓 - Checkout live example How to visualize key sentences in a document? 👀
Operational update
Version 1.13.1 March 27, 2023
⚙️ Updates
- Some behind the scene model improvements. 😎 🤓
- Add clarity on typescript configuration in README. ✅
Improving mark's functionality in custom entities
Version 1.13.0 December 09, 2022
✨ Features
- Mark allows marking w.r.t. the last element of the pattern. For example if a pattern matches
a fluffy cat
thenmark: [-2, -1]
will extractfluffy cat
— especially useful when the match length is unknown. 💃 - Improved error handling while processing mark's arguments. 🙌
Operational update
Version 1.12.3 November 18, 2022
⚙️ Updates
- README is now more informative and links to examples and benchmarks 👍
- Benchmarked on latest machine, browser versions 🖥