You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+14-2Lines changed: 14 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,14 +5,26 @@ SPDX-License-Identifier: MIT-0
5
5
6
6
## [Unreleased]
7
7
8
+
## [0.3.10]
9
+
10
+
### Added
11
+
12
+
-**Automatic Glue Table Creation for Document Sections**
13
+
- Added automatic creation of AWS Glue tables for each document section type (classification) during processing
14
+
- Tables are created dynamically when new section types are encountered, eliminating manual table creation
15
+
- Consistent lowercase naming convention for tables ensures compatibility with case-sensitive S3 paths
16
+
- Tables are configured with partition projection for efficient date-based queries without manual partition management
17
+
- Automatic schema evolution - tables update when new fields are detected in extraction results
18
+
19
+
20
+
## [0.3.9]
21
+
8
22
### Added
9
23
-**Optional Permissions Boundary Support for Enterprise Deployments**
10
24
- Added `PermissionsBoundaryArn` parameter to all CloudFormation templates for organizations with Service Control Policies (SCPs) requiring permissions boundaries
11
25
- Comprehensive support for both explicit IAM roles and implicit roles created by AWS SAM functions and statemachines`
12
26
- Conditional implementation ensures backward compatibility - when no permissions boundary is provided, roles deploy normally
13
27
14
-
## [0.3.8]
15
-
16
28
### Added
17
29
- IDP Configuration and Prompting Best Practices documentation [doc](./docs/idp-configuration-best-practices.md)
Copy file name to clipboardExpand all lines: docs/reporting-database.md
+12-2Lines changed: 12 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -107,11 +107,21 @@ The metering table is particularly valuable for:
107
107
108
108
## Document Sections Tables
109
109
110
-
The document sections tables store the actual extracted data from document sections in a structured format suitable for analytics. These tables are automatically discovered by AWS Glue Crawler and are organized by section type (classification).
110
+
The document sections tables store the actual extracted data from document sections in a structured format suitable for analytics. These tables are automatically created when new section types are encountered during document processing, eliminating the need for manual table creation.
111
+
112
+
### Automatic Table Creation
113
+
114
+
When a document is processed and a new section type (classification) is detected, the system automatically:
115
+
1. Creates a new Glue table for that section type (e.g., `document_sections_invoice`, `document_sections_receipt`, `document_sections_w2`)
116
+
2. Configures the table with appropriate schema based on the extracted data
117
+
3. Sets up partition projection for efficient date-based queries
118
+
4. Updates the table schema if new fields are detected in subsequent documents
119
+
120
+
**Important:** Section type names are normalized to lowercase for consistency with case-sensitive S3 paths. For example, a section classified as "W2" will create a table named `document_sections_w2` with data stored in `document_sections/w2/`.
111
121
112
122
### Dynamic Section Tables
113
123
114
-
Document sections are stored in dynamically created tables based on the section classification. Each section type gets its own table (e.g., `document_sections_invoice`, `document_sections_receipt`, `document_sections_bank_statement`, etc.) with the following characteristics:
124
+
Document sections are stored in dynamically created tables based on the section classification. Each section type gets its own table with the following characteristics:
0 commit comments