diff --git a/CHANGELOG.md b/CHANGELOG.md index 9bdf7f624..77b7eeb4c 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -11,9 +11,13 @@ Notable changes between versions. - For full release notes, see: - For detailed commit changes, see: - (select tags to compare) + (select tags to compare) -## Version 5.0.5 -> Dev +## Version 5.1.0 -> Dev + +[WIP] + +## Version 5.0.5 -> 5.1.0 - Add Thai Discourse Treebank postag #910 - Add Thai Universal Dependency Treebank postag #916 @@ -21,6 +25,10 @@ Notable changes between versions. - Add support for list of strings as input to sent_tokenize() #927 - Add pythainlp.tools.safe_print to handle UnicodeEncodeError on console #969 - Fix collate() to consider tonemark in ordering #926 +- Fix nlpo3.load_dict() that never print error msg when not success #979 +- Add Thai Solar Date convert to Thai Lunar Date #998 +- Add Thai pangram text #1045 +- Remove clause_tokenize #1024 ## Version 5.0.4 -> 5.0.5 diff --git a/CITATION.cff b/CITATION.cff index 7113feb53..aa34b65a1 100644 --- a/CITATION.cff +++ b/CITATION.cff @@ -45,5 +45,5 @@ keywords: - "Thai language" - "Thai NLP" license: Apache-2.0 -version: 5.1.0-beta2 -date-released: "2025-02-09" +version: 5.1.0 +date-released: "2025-02-25" diff --git a/README.md b/README.md index 865b6598f..028461e44 100644 --- a/README.md +++ b/README.md @@ -30,8 +30,8 @@ pip install pythainlp | Version | Description | Status | |:------:|:--:|:------:| -| [5.0.5](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/788) | -| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 5.1 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/900) | +| [5.1.0](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/900) | +| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 5.2 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/1080) | ## Getting Started diff --git a/README_TH.md b/README_TH.md index c59dbd746..bb0b2e5cc 100644 --- a/README_TH.md +++ b/README_TH.md @@ -26,8 +26,8 @@ pip install pythainlp | รุ่น | คำอธิบาย | สถานะ | |:------:|:--:|:------:| -| [5.0.5](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/788) | -| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 5.1 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/900) | +| [5.1.0](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/900) | +| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 5.2 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/1080) | ติดตามพวกเราบน [PyThaiNLP Facebook page](https://www.facebook.com/pythainlp/) เพื่อรับข่าวสารเพิ่มเติม diff --git a/SECURITY.md b/SECURITY.md index 97403d404..1e73a929f 100644 --- a/SECURITY.md +++ b/SECURITY.md @@ -4,8 +4,9 @@ | Version | Supported | | ------- | ------------------ | +| 5.1.x | :white_check_mark: | | 5.0.x | :white_check_mark: | -| 4.0.x | :white_check_mark: | +| 4.0.x | :x: | | 3.1.x | :x: | | 3.0.x | :x: | | 2.3.x | :x: | diff --git a/codemeta.json b/codemeta.json index 72328792e..7df85191a 100644 --- a/codemeta.json +++ b/codemeta.json @@ -3,7 +3,7 @@ "@type": "SoftwareSourceCode", "name": "PyThaiNLP", "description": "Thai Natural Language Processing in Python", - "version": "5.1.0-beta2", + "version": "5.1.0", "author": [ { "@type": "Person", diff --git a/docs/api/augment.rst b/docs/api/augment.rst index 39af41c18..c41be8b5a 100644 --- a/docs/api/augment.rst +++ b/docs/api/augment.rst @@ -8,13 +8,6 @@ Introduction The `pythainlp.augment` module is a powerful toolset for text augmentation in the Thai language. Text augmentation is a process that enriches and diversifies textual data by generating alternative versions of the original text. This module is a valuable resource for improving the quality and variety of Thai language data for NLP tasks. -TextAugment Class ------------------ - -The central component of the `pythainlp.augment` module is the `TextAugment` class. This class provides various text augmentation techniques and functions to enhance the diversity of your text data. It offers the following methods: - -.. autoclass:: pythainlp.augment.TextAugment - :members: WordNetAug Class ---------------- diff --git a/docs/api/transliterate.rst b/docs/api/transliterate.rst index 8b71bff1b..6222e9cf7 100644 --- a/docs/api/transliterate.rst +++ b/docs/api/transliterate.rst @@ -37,15 +37,13 @@ Transliteration Engines **thai2rom** -.. automodule:: pythainlp.transliterate.thai2rom.romanize - :members: +.. autofunction:: pythainlp.transliterate.thai2rom.romanize The `thai2rom` engine specializes in transliterating Thai text into romanized form. It's particularly useful for rendering Thai words accurately in an English phonetic format. **royin** -.. automodule:: pythainlp.transliterate.royin.romanize - :members: +.. autofunction:: pythainlp.transliterate.royin.romanize The `royin` engine focuses on transliterating Thai text into English characters. It provides an alternative approach to transliteration, ensuring accurate representation of Thai words. diff --git a/docs/api/word_vector.rst b/docs/api/word_vector.rst index 3550af5a1..107328e88 100644 --- a/docs/api/word_vector.rst +++ b/docs/api/word_vector.rst @@ -13,30 +13,6 @@ Before using this module, you need to ensure that the `numpy` and `gensim` libra Modules ------- -.. autofunction:: doesnt_match - :noindex: - - The `doesnt_match` function is designed to identify the word that does not match a set of words in terms of semantic similarity. It is useful for tasks like word sense disambiguation. - -.. autofunction:: get_model - :noindex: - - The `get_model` function allows you to load a pre-trained word vector model, which can then be used for various word vector operations. This function serves as the entry point for accessing pre-trained word vectors. - -.. autofunction:: most_similar_cosmul - :noindex: - - The `most_similar_cosmul` function finds words that are most similar to a given word in terms of cosine similarity. This function is useful for word analogy tasks and word similarity measurement. - -.. autofunction:: sentence_vectorizer - :noindex: - - The `sentence_vectorizer` function takes a sentence as input and returns a vector representation of the entire sentence based on word vectors. This is valuable for document similarity and text classification tasks. - -.. autofunction:: similarity - :noindex: - - The `similarity` function calculates the cosine similarity between two words based on their word vectors. It helps in measuring the semantic similarity between words. .. autoclass:: WordVector :members: diff --git a/pythainlp/__init__.py b/pythainlp/__init__.py index a7fa3c701..b32052002 100644 --- a/pythainlp/__init__.py +++ b/pythainlp/__init__.py @@ -2,7 +2,7 @@ # SPDX-FileCopyrightText: 2016-2025 PyThaiNLP Project # SPDX-FileType: SOURCE # SPDX-License-Identifier: Apache-2.0 -__version__ = "5.1.0-beta2" +__version__ = "5.1.0" thai_consonants = "กขฃคฅฆงจฉชซฌญฎฏฐฑฒณดตถทธนบปผฝพฟภมยรลวศษสหฬอฮ" # 44 chars diff --git a/setup.cfg b/setup.cfg index adda69820..0d701287c 100644 --- a/setup.cfg +++ b/setup.cfg @@ -1,5 +1,5 @@ [bumpversion] -current_version = 5.1.0-beta2 +current_version = 5.1.0 commit = True tag = True parse = (?P\d+)\.(?P\d+)\.(?P\d+)(\-(?P[a-z]+)(?P\d+))? diff --git a/setup.py b/setup.py index 56e58b316..06c4d755b 100644 --- a/setup.py +++ b/setup.py @@ -150,7 +150,7 @@ setup( name="pythainlp", - version="5.1.0-beta2", + version="5.1.0", description="Thai Natural Language Processing library", long_description=LONG_DESC, long_description_content_type="text/markdown",