Changelog
All notable changes to this project are documented here. The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
v0.5.0 (2024-05-04)
Added
Recursive chunking strategy.
Documentation updated.
Changed
Saving chunks to disk option has been removed, as this is something that can be done outside of the library.
v0.4.0 (2024-04-17)
Changed
Langformers now requires Python 3.10 or higher.
Made the dependencies in the semantic search pipleines optional. Previously, FAISS, ChromaDB and Pinecone were required to install Langformers.
Documentation updated.
v0.3.1 (2024-04-16)
Changed
Fixed a “CUDA error” in the mimicking pipeline which was caused due to incorrect use of tokenizer’s max_length and model’s max_position_embeddings. CPU and Apple Silicon devices did not have this issue.
Documentation updated.
Added
Precision and Recall metrics (macro and weighted) added to classification pipleine.
v0.3.0 (2024-04-14)
Added
- Chunking pipelines
Fixed-size chunking
Semantic chunking
Documentation and README.md updated.
v0.2.0 (2024-04-10)
Added
Reranking pipeline.
Documentation and README.md updated.
Changed
Device (cuda, mps, cpu) check logic improved.
v0.1.0 (2024-04-08)
Added
First public release of Langformers.
Core NLP pipelines.
Documentation and README.md.