Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
12 changes: 10 additions & 2 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,16 +11,24 @@ Notable changes between versions.
- For full release notes, see:
<https://github.com/PyThaiNLP/pythainlp/releases>
- For detailed commit changes, see:
<https://github.com/PyThaiNLP/pythainlp/compare/v5.0.4...dev> (select tags to compare)
<https://github.com/PyThaiNLP/pythainlp/compare/v5.1.0...dev> (select tags to compare)

## Version 5.0.5 -> Dev
## Version 5.1.0 -> Dev

[WIP]

## Version 5.0.5 -> 5.1.0

- Add Thai Discourse Treebank postag #910
- Add Thai Universal Dependency Treebank postag #916
- Add Thai G2P v2 Grapheme-to-Phoneme model #923
- Add support for list of strings as input to sent_tokenize() #927
- Add pythainlp.tools.safe_print to handle UnicodeEncodeError on console #969
- Fix collate() to consider tonemark in ordering #926
- Fix nlpo3.load_dict() that never print error msg when not success #979
- Add Thai Solar Date convert to Thai Lunar Date #998
- Add Thai pangram text #1045
- Remove clause_tokenize #1024

## Version 5.0.4 -> 5.0.5

Expand Down
4 changes: 2 additions & 2 deletions CITATION.cff
Original file line number Diff line number Diff line change
Expand Up @@ -45,5 +45,5 @@ keywords:
- "Thai language"
- "Thai NLP"
license: Apache-2.0
version: 5.1.0-beta2
date-released: "2025-02-09"
version: 5.1.0
date-released: "2025-02-25"
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -30,8 +30,8 @@ pip install pythainlp

| Version | Description | Status |
|:------:|:--:|:------:|
| [5.0.5](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/788) |
| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 5.1 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/900) |
| [5.1.0](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/900) |
| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 5.2 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/1080) |

## Getting Started

Expand Down
4 changes: 2 additions & 2 deletions README_TH.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,8 +26,8 @@ pip install pythainlp

| รุ่น | คำอธิบาย | สถานะ |
|:------:|:--:|:------:|
| [5.0.5](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/788) |
| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 5.1 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/900) |
| [5.1.0](https://github.com/PyThaiNLP/pythainlp/releases) | Stable | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/900) |
| [`dev`](https://github.com/PyThaiNLP/pythainlp/tree/dev) | Release Candidate for 5.2 | [Change Log](https://github.com/PyThaiNLP/pythainlp/issues/1080) |

ติดตามพวกเราบน [PyThaiNLP Facebook page](https://www.facebook.com/pythainlp/) เพื่อรับข่าวสารเพิ่มเติม

Expand Down
3 changes: 2 additions & 1 deletion SECURITY.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,8 +4,9 @@

| Version | Supported |
| ------- | ------------------ |
| 5.1.x | :white_check_mark: |
| 5.0.x | :white_check_mark: |
| 4.0.x | :white_check_mark: |
| 4.0.x | :x: |
| 3.1.x | :x: |
| 3.0.x | :x: |
| 2.3.x | :x: |
Expand Down
2 changes: 1 addition & 1 deletion codemeta.json
Original file line number Diff line number Diff line change
Expand Up @@ -3,7 +3,7 @@
"@type": "SoftwareSourceCode",
"name": "PyThaiNLP",
"description": "Thai Natural Language Processing in Python",
"version": "5.1.0-beta2",
"version": "5.1.0",
"author": [
{
"@type": "Person",
Expand Down
7 changes: 0 additions & 7 deletions docs/api/augment.rst
Original file line number Diff line number Diff line change
Expand Up @@ -8,13 +8,6 @@ Introduction

The `pythainlp.augment` module is a powerful toolset for text augmentation in the Thai language. Text augmentation is a process that enriches and diversifies textual data by generating alternative versions of the original text. This module is a valuable resource for improving the quality and variety of Thai language data for NLP tasks.

TextAugment Class
-----------------

The central component of the `pythainlp.augment` module is the `TextAugment` class. This class provides various text augmentation techniques and functions to enhance the diversity of your text data. It offers the following methods:

.. autoclass:: pythainlp.augment.TextAugment
:members:

WordNetAug Class
----------------
Expand Down
6 changes: 2 additions & 4 deletions docs/api/transliterate.rst
Original file line number Diff line number Diff line change
Expand Up @@ -37,15 +37,13 @@ Transliteration Engines

**thai2rom**

.. automodule:: pythainlp.transliterate.thai2rom.romanize
:members:
.. autofunction:: pythainlp.transliterate.thai2rom.romanize

The `thai2rom` engine specializes in transliterating Thai text into romanized form. It's particularly useful for rendering Thai words accurately in an English phonetic format.

**royin**

.. automodule:: pythainlp.transliterate.royin.romanize
:members:
.. autofunction:: pythainlp.transliterate.royin.romanize

The `royin` engine focuses on transliterating Thai text into English characters. It provides an alternative approach to transliteration, ensuring accurate representation of Thai words.

Expand Down
24 changes: 0 additions & 24 deletions docs/api/word_vector.rst
Original file line number Diff line number Diff line change
Expand Up @@ -13,30 +13,6 @@ Before using this module, you need to ensure that the `numpy` and `gensim` libra

Modules
-------
.. autofunction:: doesnt_match
:noindex:

The `doesnt_match` function is designed to identify the word that does not match a set of words in terms of semantic similarity. It is useful for tasks like word sense disambiguation.

.. autofunction:: get_model
:noindex:

The `get_model` function allows you to load a pre-trained word vector model, which can then be used for various word vector operations. This function serves as the entry point for accessing pre-trained word vectors.

.. autofunction:: most_similar_cosmul
:noindex:

The `most_similar_cosmul` function finds words that are most similar to a given word in terms of cosine similarity. This function is useful for word analogy tasks and word similarity measurement.

.. autofunction:: sentence_vectorizer
:noindex:

The `sentence_vectorizer` function takes a sentence as input and returns a vector representation of the entire sentence based on word vectors. This is valuable for document similarity and text classification tasks.

.. autofunction:: similarity
:noindex:

The `similarity` function calculates the cosine similarity between two words based on their word vectors. It helps in measuring the semantic similarity between words.

.. autoclass:: WordVector
:members:
Expand Down
2 changes: 1 addition & 1 deletion pythainlp/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
# SPDX-FileCopyrightText: 2016-2025 PyThaiNLP Project
# SPDX-FileType: SOURCE
# SPDX-License-Identifier: Apache-2.0
__version__ = "5.1.0-beta2"
__version__ = "5.1.0"

thai_consonants = "กขฃคฅฆงจฉชซฌญฎฏฐฑฒณดตถทธนบปผฝพฟภมยรลวศษสหฬอฮ" # 44 chars

Expand Down
2 changes: 1 addition & 1 deletion setup.cfg
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
[bumpversion]
current_version = 5.1.0-beta2
current_version = 5.1.0
commit = True
tag = True
parse = (?P<major>\d+)\.(?P<minor>\d+)\.(?P<patch>\d+)(\-(?P<release>[a-z]+)(?P<build>\d+))?
Expand Down
2 changes: 1 addition & 1 deletion setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -150,7 +150,7 @@

setup(
name="pythainlp",
version="5.1.0-beta2",
version="5.1.0",
description="Thai Natural Language Processing library",
long_description=LONG_DESC,
long_description_content_type="text/markdown",
Expand Down
Loading