Skip to content

Commit c43dc97

Browse files
committed
Merge branch 'doc/classifcation.md' into 'develop'
Added limitations of Text-Based Holistic Classification in the classification.md file See merge request genaiic-reusable-assets/engagement-artifacts/genaiic-idp-accelerator!234
2 parents f6e42aa + 3165907 commit c43dc97

File tree

1 file changed

+12
-0
lines changed

1 file changed

+12
-0
lines changed

docs/classification.md

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -67,6 +67,18 @@ classification:
6767
</document-text>
6868
```
6969
70+
## Limitations of Text-Based Holistic Classification
71+
72+
Despite its strengths in handling full-document context, this method has several limitations:
73+
74+
**Context & Model Constraints:**:
75+
- Long documents can exceed the context window of smaller models, resulting in request failure.
76+
- Lengthy inputs may dilute the model’s focus, leading to inaccurate or inconsistent classifications.
77+
- Requires high-context models such as Amazon Nova Premier, which supports up to 1 million tokens. Smaller models are not suitable for this method.
78+
- For more details on supported models and their context limits, refer to the [Amazon Bedrock Supported Models documentation](https://docs.aws.amazon.com/bedrock/latest/userguide/models-supported.html).
79+
80+
**Scalability Challenges**: Not ideal for very large or visually complex document sets. In such cases, the Multi-Modal Page-Level Classification method is more appropriate.
81+
7082
#### MultiModal Page-Level Classification with Few-Shot Examples
7183
7284
- Classifies each page independently using both text and image data

0 commit comments

Comments
 (0)