You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Merge branch 'doc/classifcation.md' into 'develop'
Added limitations of Text-Based Holistic Classification in the classification.md file
See merge request genaiic-reusable-assets/engagement-artifacts/genaiic-idp-accelerator!234
Copy file name to clipboardExpand all lines: docs/classification.md
+12Lines changed: 12 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -67,6 +67,18 @@ classification:
67
67
</document-text>
68
68
```
69
69
70
+
## Limitations of Text-Based Holistic Classification
71
+
72
+
Despite its strengths in handling full-document context, this method has several limitations:
73
+
74
+
**Context & Model Constraints:**:
75
+
- Long documents can exceed the context window of smaller models, resulting in request failure.
76
+
- Lengthy inputs may dilute the model’s focus, leading to inaccurate or inconsistent classifications.
77
+
- Requires high-context models such as Amazon Nova Premier, which supports up to 1 million tokens. Smaller models are not suitable for this method.
78
+
- For more details on supported models and their context limits, refer to the [Amazon Bedrock Supported Models documentation](https://docs.aws.amazon.com/bedrock/latest/userguide/models-supported.html).
79
+
80
+
**Scalability Challenges**: Not ideal for very large or visually complex document sets. In such cases, the Multi-Modal Page-Level Classification method is more appropriate.
81
+
70
82
#### MultiModal Page-Level Classification with Few-Shot Examples
71
83
72
84
- Classifies each page independently using both text and image data
0 commit comments