Commit 083f487
Staging To Main (#495)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
* Delete
* merge fixes
* code suggestions
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* chunks create before extraction using is_pre_process variable (#383)
* chunks create before extraction using is_pre_process variable
* Return total pages for Model
* update requirement.txt
* total pages on uplaod API
* added the Confirmation Dialog
* added the selected files into the confirmation modal
* format and lint fixes
* added the stop watch image
* fileselection on alert dialog
* Add timeout in docker for gunicorn workers
* Add cancel icon to info popup (#384)
* Info Modal Changes
* css changes
* recent merges
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* added the default page size
* Convert is_cancelled value from string to bool
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Save Total Pages in DB
* Added total Pages
* file selection when we didn't select anything from Main table
* added the danger icon only for large files
* added the overflow for more files and file selection for all new files
* moved the interface to types
* added the icon accoroding to the source
* set total page for wiki and youtube
* h3 heading
* merge
* updated the alert on basis if total pages
* deleted chunks
* polling based on total pages
* isNan check
* large file based on file size for s3 and gcs
* file source in server side event
* time calculation based on chunks for gcs and s3
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* fixed the layout issue
* Populate graph schema (#399)
* crreate new endpoint populate_graph_schema and update the query for getting lables from DB
* Added main.py changes
* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)
* added the condtion
* removed llms
* Fixed issue : Remove extra unused param
* get emb only if used (#278)
* Chatbot chunks (#402)
* Added file name to the content sent to LLM
* added chunk text in the response
* increased the docs parts sent to llm
* Modified graph query
* mardown rendering
* youtube starttime
* icons
* offset changes
* removed the files due to codespace space issue
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user (#405)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* fixed css issue
* fixed status blank issue
* Modified response when no docs is retrived (#413)
* Fixed env/docker-compose for local deployments + README doc (#410)
* Fixed env/docker-compose for local deployments + README doc
* wrong place for ENV in README
* by default, removed langsmith + fixed knn score string to float
* by default, removed langsmith + fixed knn score string to float
* Fixed strings in docker-compose env
* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)
* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that
* Support for all unstructured files (#401)
* all unstructured files
* responsiveness
* added file type
* added the extensions
* spell mistake
* ppt file changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* Extract schema using direct ChatOpenAI API and Chain
* integrated the checkbox for schema to text dialog
* Update SettingModal.tsx
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* gcs file content read via storage client (#417)
* gcs file content read via storage client
* added the access token the file state
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* pypdf2 to read files from gcs (#420)
* 407 remove driver from frontend (#416)
* removed driver
* removed API
* connecting to database on page refresh
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Css handling of info modal and Tooltips (#418)
* css change
* toolTips
* Sidebar Tooltips
* copy to clip
* css change
* added image types
* added gcs
* type fix
* docker changes
* speech
* added the toolip for dropzone sources
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed retrival bugs (#421)
* yarn format fixes
* changed the delete message
* added the cancel button
* changed the message on tooltip
* added space
* UI fixes
* tooltip for setting
* updated req
* wikipedia URL input (#424)
* accept only wikipedia links
* added wikipedia link
* added wikilink regex
* wikipedia single url only
* changed the alert message
* wording change
* pushed validation state persist error
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* speech and copy (#422)
* speech and copy
* startTime
* added chunk properties
* tooltips
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed issue for out of range in KNN API
* solved conflicts
* conflict solved
* Remove logging info from update KNN API
* tooltip changes
* format and lint fixes
* responsiveness changes
* Fixed issue for total pages GCS, S3
* UI polishing (#428)
* button and tooltip changes
* checking validation on change
* settings module populate fix
* format fixes
* opening the modal after auth success
* removed the limit
* added the scrobar for dropdowns
* speech state (#426)
* speech state
* Button Details changes
* delete wording change
* Total pages in buckets (#431)
* page number NA for buckets
* added N/A for gcs and s3 pages
* total pages for gcs
* remove unwanted logger
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* removed the max width
* Update FileTable.tsx
* Update the docker file
* Modified prompt (#438)
* Update Dockerfile
* Update Dockerfile
* Update Dockerfile
* rendering Fix
* Local file upload gcs (#442)
* Uplaod file to GCS
* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled
* Add life cycle rule on uploaded bucket
* pdf upload local and gcs bucket check
* delete files when processed and extract changes
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Modified chat length and entities used (#443)
* metadata for unstructured files (#446)
* Unstructured file metadata (#447)
* metadata for unstructured files
* sleep in gcs upload
* updated
* icons added to chunks (#435)
* icons added to chunks
* info modal icons
* Dev (#433)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
* Delete
* merge fixes
* code suggestions
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* chunks create before extraction using is_pre_process variable (#383)
* chunks create before extraction using is_pre_process variable
* Return total pages for Model
* update requirement.txt
* total pages on uplaod API
* added the Confirmation Dialog
* added the selected files into the confirmation modal
* format and lint fixes
* added the stop watch image
* fileselection on alert dialog
* Add timeout in docker for gunicorn workers
* Add cancel icon to info popup (#384)
* Info Modal Changes
* css changes
* recent merges
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* added the default page size
* Convert is_cancelled value from string to bool
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Save Total Pages in DB
* Added total Pages
* file selection when we didn't select anything from Main table
* added the danger icon only for large files
* added the overflow for more files and file selection for all new files
* moved the interface to types
* added the icon accoroding to the source
* set total page for wiki and youtube
* h3 heading
* merge
* updated the alert on basis if total pages
* deleted chunks
* polling based on total pages
* isNan check
* large file based on file size for s3 and gcs
* file source in server side event
* time calculation based on chunks for gcs and s3
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* fixed the layout issue
* Populate graph schema (#399)
* crreate new endpoint populate_graph_schema and update the query for getting lables from DB
* Added main.py changes
* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)
* added the condtion
* removed llms
* Fixed issue : Remove extra unused param
* get emb only if used (#278)
* Chatbot chunks (#402)
* Added file name to the content sent to LLM
* added chunk text in the response
* increased the docs parts sent to llm
* Modified graph query
* mardown rendering
* youtube starttime
* icons
* offset changes
* removed the files due to codespace space issue
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user (#405)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* fixed css issue
* fixed status blank issue
* Modified response when no docs is retrived (#413)
* Fixed env/docker-compose for local deployments + README doc (#410)
* Fixed env/docker-compose for local deployments + README doc
* wrong place for ENV in README
* by default, removed langsmith + fixed knn score string to float
* by default, removed langsmith + fixed knn score string to float
* Fixed strings in docker-compose env
* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)
* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that
* Support for all unstructured files (#401)
* all unstructured files
* responsiveness
* added file type
* added the extensions
* spell mistake
* ppt file changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* Extract schema using direct ChatOpenAI API and Chain
* integrated the checkbox for schema to text dialog
* Update SettingModal.tsx
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* gcs file content read via storage client (#417)
* gcs file content read via storage client
* added the access token the file state
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* pypdf2 to read files from gcs (#420)
* 407 remove driver from frontend (#416)
* removed driver
* removed API
* connecting to database on page refresh
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Css handling of info modal and Tooltips (#418)
* css change
* toolTips
* Sidebar Tooltips
* copy to clip
* css change
* added image types
* added gcs
* type fix
* docker changes
* speech
* added the toolip for dropzone sources
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed retrival bugs (#421)
* yarn format fixes
* changed the delete message
* added the cancel button
* changed the message on tooltip
* added space
* UI fixes
* tooltip for setting
* updated req
* wikipedia URL input (#424)
* accept only wikipedia links
* added wikipedia link
* added wikilink regex
* wikipedia single url only
* changed the alert message
* wording change
* pushed validation state persist error
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* speech and copy (#422)
* speech and copy
* startTime
* added chunk properties
* tooltips
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed issue for out of range in KNN API
* solved conflicts
* conflict solved
* Remove logging info from update KNN API
* tooltip changes
* format and lint fixes
* responsiveness changes
* Fixed issue for total pages GCS, S3
* UI polishing (#428)
* button and tooltip changes
* checking validation on change
* settings module populate fix
* format fixes
* opening the modal after auth success
* removed the limit
* added the scrobar for dropdowns
* speech state (#426)
* speech state
* Button Details changes
* delete wording change
* Total pages in buckets (#431)
* page number NA for buckets
* added N/A for gcs and s3 pages
* total pages for gcs
* remove unwanted logger
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* removed the max width
* Update FileTable.tsx
* Update the docker file
* Modified prompt (#438)
* Update Dockerfile
* Update Dockerfile
* Update Dockerfile
* rendering Fix
* Local file upload gcs (#442)
* Uplaod file to GCS
* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled
* Add life cycle rule on uploaded bucket
* pdf upload local and gcs bucket check
* delete files when processed and extract changes
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Modified chat length and entities used (#443)
* metadata for unstructured files (#446)
* Unstructured file metadata (#447)
* metadata for unstructured files
* sleep in gcs upload
* updated
* icons added to chunks (#435)
* icons added to chunks
* info modal icons
---------
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: Ajay Meena <meenajy1996@gmail.com>
Co-authored-by: Morgan Senechal <morgan@neo4j.com>
Co-authored-by: karanchellani <142801957+karanchellani@users.noreply.github.com>
* fixed gcs status message issue
* added if check for failed count
* Null issue Fixed from backend for upload API and graph_document when model name mismatch
* added word break issue
* Added neo4j-rust-ext
* processing time estimation based on bytes
* File extension upper case fixed, File delete from GCS or local based on env variable.
* timer per byte
* Update Dockerfile
* Adding sort rows on the table (#451)
* Gcs upload folder hashed (#453)
* implement foldername hashed in GCS bucket uplaod
* Raise exception if invalid model selected
* folder name for gcs upload
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* upload all unstructuredfiles to gcs (#455)
* Mofified chunk query (#454)
* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.
- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"
* Fix the PARTIAL CONTENT issue
* File-table no data found (#456)
* 'file-table''
* review comment
* Llm format change (#459)
* changed the llm models format to lowercase
* added the error message
* llm model changes
* format fixes
* removed unused import
* added the capitalize method
* delete files from merged_file_path only if source is local file
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* commented total page code (#460)
* format fixes
* removed the disabled check on dropdown
* Large file env
* DEV to STAGING (#461)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
* Delete
* merge fixes
* code suggestions
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* chunks create before extraction using is_pre_process variable (#383)
* chunks create before extraction using is_pre_process variable
* Return total pages for Model
* update requirement.txt
* total pages on uplaod API
* added the Confirmation Dialog
* added the selected files into the confirmation modal
* format and lint fixes
* added the stop watch image
* fileselection on alert dialog
* Add timeout in docker for gunicorn workers
* Add cancel icon to info popup (#384)
* Info Modal Changes
* css changes
* recent merges
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* added the default page size
* Convert is_cancelled value from string to bool
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Save Total Pages in DB
* Added total Pages
* file selection when we didn't select anything from Main table
* added the danger icon only for large files
* added the overflow for more files and file selection for all new files
* moved the interface to types
* added the icon accoroding to the source
* set total page for wiki and youtube
* h3 heading
* merge
* updated the alert on basis if total pages
* deleted chunks
* polling based on total pages
* isNan check
* large file based on file size for s3 and gcs
* file source in server side event
* time calculation based on chunks for gcs and s3
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* fixed the layout issue
* Populate graph schema (#399)
* crreate new endpoint populate_graph_schema and update the query for getting lables from DB
* Added main.py changes
* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)
* added the condtion
* removed llms
* Fixed issue : Remove extra unused param
* get emb only if used (#278)
* Chatbot chunks (#402)
* Added file name to the content sent to LLM
* added chunk text in the response
* increased the docs parts sent to llm
* Modified graph query
* mardown rendering
* youtube starttime
* icons
* offset changes
* removed the files due to codespace space issue
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user (#405)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* fixed css issue
* fixed status blank issue
* Modified response when no docs is retrived (#413)
* Fixed env/docker-compose for local deployments + README doc (#410)
* Fixed env/docker-compose for local deployments + README doc
* wrong place for ENV in README
* by default, removed langsmith + fixed knn score string to float
* by default, removed langsmith + fixed knn score string to float
* Fixed strings in docker-compose env
* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)
* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that
* Support for all unstructured files (#401)
* all unstructured files
* responsiveness
* added file type
* added the extensions
* spell mistake
* ppt file changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* Extract schema using direct ChatOpenAI API and Chain
* integrated the checkbox for schema to text dialog
* Update SettingModal.tsx
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* gcs file content read via storage client (#417)
* gcs file content read via storage client
* added the access token the file state
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* pypdf2 to read files from gcs (#420)
* 407 remove driver from frontend (#416)
* removed driver
* removed API
* connecting to database on page refresh
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Css handling of info modal and Tooltips (#418)
* css change
* toolTips
* Sidebar Tooltips
* copy to clip
* css change
* added image types
* added gcs
* type fix
* docker changes
* speech
* added the toolip for dropzone sources
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed retrival bugs (#421)
* yarn format fixes
* changed the delete message
* added the cancel button
* changed the message on tooltip
* added space
* UI fixes
* tooltip for setting
* updated req
* wikipedia URL input (#424)
* accept only wikipedia links
* added wikipedia link
* added wikilink regex
* wikipedia single url only
* changed the alert message
* wording change
* pushed validation state persist error
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* speech and copy (#422)
* speech and copy
* startTime
* added chunk properties
* tooltips
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed issue for out of range in KNN API
* solved conflicts
* conflict solved
* Remove logging info from update KNN API
* tooltip changes
* format and lint fixes
* responsiveness changes
* Fixed issue for total pages GCS, S3
* UI polishing (#428)
* button and tooltip changes
* checking validation on change
* settings module populate fix
* format fixes
* opening the modal after auth success
* removed the limit
* added the scrobar for dropdowns
* speech state (#426)
* speech state
* Button Details changes
* delete wording change
* Total pages in buckets (#431)
* page number NA for buckets
* added N/A for gcs and s3 pages
* total pages for gcs
* remove unwanted logger
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* removed the max width
* Update FileTable.tsx
* Update the docker file
* Modified prompt (#438)
* Update Dockerfile
* Update Dockerfile
* Update Dockerfile
* rendering Fix
* Local file upload gcs (#442)
* Uplaod file to GCS
* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled
* Add life cycle rule on uploaded bucket
* pdf upload local and gcs bucket check
* delete files when processed and extract changes
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Modified chat length and entities used (#443)
* metadata for unstructured files (#446)
* Unstructured file metadata (#447)
* metadata for unstructured files
* sleep in gcs upload
* updated
* icons added to chunks (#435)
* icons added to chunks
* info modal icons
* fixed gcs status message issue
* added if check for failed count
* Null issue Fixed from backend for upload API and graph_document when model name mismatch
* added word break issue
* Added neo4j-rust-ext
* processing time estimation based on bytes
* File extension upper case fixed, File delete from GCS or local based on env variable.
* timer per byte
* Update Dockerfile
* Adding sort rows on the table (#451)
* Gcs upload folder hashed (#453)
* implement foldername hashed in GCS bucket uplaod
* Raise exception if invalid model selected
* folder name for gcs upload
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* upload all unstructuredfiles to gcs (#455)
* Mofified chunk query (#454)
* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.
- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"
* Fix the PARTIAL CONTENT issue
* File-table no data found (#456)
* 'file-table''
* review comment
* Llm format change (#459)
* changed the llm models format to lowercase
* added the error message
* llm model changes
* format fixes
* removed unused import
* added the capitalize method
* delete files from merged_file_path only if source is local file
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* commented total page code (#460)
* format fixes
* removed the disabled check on dropdown
* Large file env
---------
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: Ajay Meena <meenajy1996@gmail.com>
Co-authored-by: Morgan Senechal <morgan@neo4j.com>
Co-authored-by: karanchellani <142801957+karanchellani@users.noreply.github.com>
* DEV to STAGING (#462)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
* Delete
* merge fixes
* code suggestions
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* chunks create before extraction using is_pre_process variable (#383)
* chunks create before extraction using is_pre_process variable
* Return total pages for Model
* update requirement.txt
* total pages on uplaod API
* added the Confirmation Dialog
* added the selected files into the confirmation modal
* format and lint fixes
* added the stop watch image
* fileselection on alert dialog
* Add timeout in docker for gunicorn workers
* Add cancel icon to info popup (#384)
* Info Modal Changes
* css changes
* recent merges
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* added the default page size
* Convert is_cancelled value from string to bool
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Save Total Pages in DB
* Added total Pages
* file selection when we didn't select anything from Main table
* added the danger icon only for large files
* added the overflow for more files and file selection for all new files
* moved the interface to types
* added the icon accoroding to the source
* set total page for wiki and youtube
* h3 heading
* merge
* updated the alert on basis if total pages
* deleted chunks
* polling based on total pages
* isNan check
* large file based on file size for s3 and gcs
* file source in server side event
* time calculation based on chunks for gcs and s3
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* fixed the layout issue
* Populate graph schema (#399)
* crreate new endpoint populate_graph_schema and update the query for getting lables from DB
* Added main.py changes
* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)
* added the condtion
* removed llms
* Fixed issue : Remove extra unused param
* get emb only if used (#278)
* Chatbot chunks (#402)
* Added file name to the content sent to LLM
* added chunk text in the response
* increased the docs parts sent to llm
* Modified graph query
* mardown rendering
* youtube starttime
* icons
* offset changes
* removed the files due to codespace space issue
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user (#405)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* fixed css issue
* fixed status blank issue
* Modified response when no docs is retrived (#413)
* Fixed env/docker-compose for local deployments + README doc (#410)
* Fixed env/docker-compose for local deployments + README doc
* wrong place for ENV in README
* by default, removed langsmith + fixed knn score string to float
* by default, removed langsmith + fixed knn score string to float
* Fixed strings in docker-compose env
* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)
* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that
* Support for all unstructured files (#401)
* all unstructured files
* responsiveness
* added file type
* added the extensions
* spell mistake
* ppt file changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* Extract schema using direct ChatOpenAI API and Chain
* integrated the checkbox for schema to text dialog
* Update SettingModal.tsx
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* gcs file content read via storage client (#417)
* gcs file content read via storage client
* added the access token the file state
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* pypdf2 to read files from gcs (#420)
* 407 remove driver from frontend (#416)
* removed driver
* removed API
* connecting to database on page refresh
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Css handling of info modal and Tooltips (#418)
* css change
* toolTips
* Sidebar Tooltips
* copy to clip
* css change
* added image types
* added gcs
* type fix
* docker changes
* speech
* added the toolip for dropzone sources
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed retrival bugs (#421)
* yarn format fixes
* changed the delete message
* added the cancel button
* changed the message on tooltip
* added space
* UI fixes
* tooltip for setting
* updated req
* wikipedia URL input (#424)
* accept only wikipedia links
* added wikipedia link
* added wikilink regex
* wikipedia single url only
* changed the alert message
* wording change
* pushed validation state persist error
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* speech and copy (#422)
* speech and copy
* startTime
* added chunk properties
* tooltips
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed issue for out of range in KNN API
* solved conflicts
* conflict solved
* Remove logging info from update KNN API
* tooltip changes
* format and lint fixes
* responsiveness changes
* Fixed issue for total pages GCS, S3
* UI polishing (#428)
* button and tooltip changes
* checking validation on change
* settings module populate fix
* format fixes
* opening the modal after auth success
* removed the limit
* added the scrobar for dropdowns
* speech state (#426)
* speech state
* Button Details changes
* delete wording change
* Total pages in buckets (#431)
* page number NA for buckets
* added N/A for gcs and s3 pages
* total pages for gcs
* remove unwanted logger
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* removed the max width
* Update FileTable.tsx
* Update the docker file
* Modified prompt (#438)
* Update Dockerfile
* Update Dockerfile
* Update Dockerfile
* rendering Fix
* Local file upload gcs (#442)
* Uplaod file to GCS
* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled
* Add life cycle rule on uploaded bucket
* pdf upload local and gcs bucket check
* delete files when processed and extract changes
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Modified chat length and entities used (#443)
* metadata for unstructured files (#446)
* Unstructured file metadata (#447)
* metadata for unstructured files
* sleep in gcs upload
* updated
* icons added to chunks (#435)
* icons added to chunks
* info modal icons
* fixed gcs status message issue
* added if check for failed count
* Null issue Fixed from backend for upload API and graph_document when model name mismatch
* added word break issue
* Added neo4j-rust-ext
* processing time estimation based on bytes
* File extension upper case fixed, File delete from GCS or local based on env variable.
* timer per byte
* Update Dockerfile
* Adding sort rows on the table (#451)
* Gcs upload folder hashed (#453)
* implement foldername hashed in GCS bucket uplaod
* Raise exception if invalid model selected
* folder name for gcs upload
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* upload all unstructuredfiles to gcs (#455)
* Mofified chunk query (#454)
* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.
- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"
* Fix the PARTIAL CONTENT issue
* File-table no data found (#456)
* 'file-table''
* review comment
* Llm format change (#459)
* changed the llm models format to lowercase
* added the error message
* llm model changes
* format fixes
* removed unused import
* added the capitalize method
* delete files from merged_file_path only if source is local file
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* commented total page code (#460)
* format fixes
* removed the disabled check on dropdown
* Large file env
---------
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: Ajay Meena <meenajy1996@gmail.com>
Co-authored-by: Morgan Senechal <morgan@neo4j.com>
Co-authored-by: karanchellani <142801957+karanchellani@users.noreply.github.com>
* added upload api
* changed the dropzone error message
* Dev to staging (#466)
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* recent merges
* pdf deletion due to out of diskspace
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* Convert is_cancelled value from string to bool
* added the default page size
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* offset in chunks (#389)
* page number in gcs loader (#393)
* added youtube timestamps (#392)
* chat pop up button (#387)
* expand
* minimize-icon
* css changes
* chat history
* chatbot wider Side Nav
* expand icon
* chatbot UI
* Delete
* merge fixes
* code suggestions
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* chunks create before extraction using is_pre_process variable (#383)
* chunks create before extraction using is_pre_process variable
* Return total pages for Model
* update requirement.txt
* total pages on uplaod API
* added the Confirmation Dialog
* added the selected files into the confirmation modal
* format and lint fixes
* added the stop watch image
* fileselection on alert dialog
* Add timeout in docker for gunicorn workers
* Add cancel icon to info popup (#384)
* Info Modal Changes
* css changes
* recent merges
* Integration_qa test (#375)
* Test IntegrationQA added
* update test cases
* update test
* update node count assertions
* test changes
* update changes
* modification test
* Code refatctor test cases
* Handle allowedlist issue in test
* test changes
* update test
* test case execution
* test chatbot updates
* test case update file
* added file
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* fixed status blank issue
* Rendering the file name instead of link for gcs and s3 sources in the info modal
* added the default page size
* Convert is_cancelled value from string to bool
* Issue fixed Processed chunked as 0 when file re-process again
* Youtube timestamps (#386)
* Wikipedia source to accept all valid urls
* wikipedia url to support multiple languages
* integrated wiki langauge param for extract api
* Youtube video timestamps
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* groq llm integration backend (#286)
* groq llm integration backend
* groq and description in node properties
* added groq in options
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Save Total Pages in DB
* Added total Pages
* file selection when we didn't select anything from Main table
* added the danger icon only for large files
* added the overflow for more files and file selection for all new files
* moved the interface to types
* added the icon accoroding to the source
* set total page for wiki and youtube
* h3 heading
* merge
* updated the alert on basis if total pages
* deleted chunks
* polling based on total pages
* isNan check
* large file based on file size for s3 and gcs
* file source in server side event
* time calculation based on chunks for gcs and s3
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* fixed the layout issue
* Populate graph schema (#399)
* crreate new endpoint populate_graph_schema and update the query for getting lables from DB
* Added main.py changes
* conditionally-including-the-gcs-login-flow-in-gcs-as-source (#396)
* added the condtion
* removed llms
* Fixed issue : Remove extra unused param
* get emb only if used (#278)
* Chatbot chunks (#402)
* Added file name to the content sent to LLM
* added chunk text in the response
* increased the docs parts sent to llm
* Modified graph query
* mardown rendering
* youtube starttime
* icons
* offset changes
* removed the files due to codespace space issue
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user (#405)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* fixed css issue
* fixed status blank issue
* Modified response when no docs is retrived (#413)
* Fixed env/docker-compose for local deployments + README doc (#410)
* Fixed env/docker-compose for local deployments + README doc
* wrong place for ENV in README
* by default, removed langsmith + fixed knn score string to float
* by default, removed langsmith + fixed knn score string to float
* Fixed strings in docker-compose env
* Added requirements (neo4j 5.15 or later, APOC, and instructions for Neo4j Desktop)
* Missed the TIME_PER_PAGE env, was causing NaN issue in the approx time processing notification. fixed that
* Support for all unstructured files (#401)
* all unstructured files
* responsiveness
* added file type
* added the extensions
* spell mistake
* ppt file changes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Settings modal to support generating the labels from the llm by using text given by user with checkbox (#415)
* added the json
* added schema from text dialog
* integrated the schemaAPI
* added the alert
* resize fixes
* Extract schema using direct ChatOpenAI API and Chain
* integrated the checkbox for schema to text dialog
* Update SettingModal.tsx
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* gcs file content read via storage client (#417)
* gcs file content read via storage client
* added the access token the file state
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* pypdf2 to read files from gcs (#420)
* 407 remove driver from frontend (#416)
* removed driver
* removed API
* connecting to database on page refresh
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Css handling of info modal and Tooltips (#418)
* css change
* toolTips
* Sidebar Tooltips
* copy to clip
* css change
* added image types
* added gcs
* type fix
* docker changes
* speech
* added the toolip for dropzone sources
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed retrival bugs (#421)
* yarn format fixes
* changed the delete message
* added the cancel button
* changed the message on tooltip
* added space
* UI fixes
* tooltip for setting
* updated req
* wikipedia URL input (#424)
* accept only wikipedia links
* added wikipedia link
* added wikilink regex
* wikipedia single url only
* changed the alert message
* wording change
* pushed validation state persist error
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* speech and copy (#422)
* speech and copy
* startTime
* added chunk properties
* tooltips
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* Fixed issue for out of range in KNN API
* solved conflicts
* conflict solved
* Remove logging info from update KNN API
* tooltip changes
* format and lint fixes
* responsiveness changes
* Fixed issue for total pages GCS, S3
* UI polishing (#428)
* button and tooltip changes
* checking validation on change
* settings module populate fix
* format fixes
* opening the modal after auth success
* removed the limit
* added the scrobar for dropdowns
* speech state (#426)
* speech state
* Button Details changes
* delete wording change
* Total pages in buckets (#431)
* page number NA for buckets
* added N/A for gcs and s3 pages
* total pages for gcs
* remove unwanted logger
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* removed the max width
* Update FileTable.tsx
* Update the docker file
* Modified prompt (#438)
* Update Dockerfile
* Update Dockerfile
* Update Dockerfile
* rendering Fix
* Local file upload gcs (#442)
* Uplaod file to GCS
* GCS local upload fixed issue and delete file from GCS after processing and failed or cancelled
* Add life cycle rule on uploaded bucket
* pdf upload local and gcs bucket check
* delete files when processed and extract changes
---------
Co-authored-by: Pravesh Kumar <121786590+praveshkumar1988@users.noreply.github.com>
* Modified chat length and entities used (#443)
* metadata for unstructured files (#446)
* Unstructured file metadata (#447)
* metadata for unstructured files
* sleep in gcs upload
* updated
* icons added to chunks (#435)
* icons added to chunks
* info modal icons
* fixed gcs status message issue
* added if check for failed count
* Null issue Fixed from backend for upload API and graph_document when model name mismatch
* added word break issue
* Added neo4j-rust-ext
* processing time estimation based on bytes
* File extension upper case fixed, File delete from GCS or local based on env variable.
* timer per byte
* Update Dockerfile
* Adding sort rows on the table (#451)
* Gcs upload folder hashed (#453)
* implement foldername hashed in GCS bucket uplaod
* Raise exception if invalid model selected
* folder name for gcs upload
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* upload all unstructuredfiles to gcs (#455)
* Mofified chunk query (#454)
* Added libre office for fixing error -- soffice command was not found. Please install libreoffice
on your system and try again.
- Install instructions: https://www.libreoffice.org/get-help/install-howto/
- Mac: https://formulae.brew.sh/cask/libreoffice
- Debian: https://wiki.debian.org/LibreOffice"
* Fix the PARTIAL CONTENT issue
* File-table no data found (#456)
* 'file-table''
* review comment
* Llm format change (#459)
* changed the llm models format to lowercase
* added the error message
* llm model changes
* format fixes
* removed unused import
* added the capitalize method
* delete files from merged_file_path only if source is local file
---------
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
* commented total page code (#460)
* format fixes
* removed the disabled check on dropdown
* Large file env
* added upload api
* changed the dropzone error message
---------
Co-authored-by: abhishekkumar-27 <164544129+abhishekkumar-27@users.noreply.github.com>
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
Co-authored-by: aashipandya <156318202+aashipandya@users.noreply.github.com>
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
Co-authored-by: Prakriti Solankey <156313631+prakriti-solankey@users.noreply.github.com>
Co-authored-by: Ajay Meena <meenajy1996@gmail.com>
Co-authored-by: Morgan Senechal <morgan@neo4j.com>
Co-authored-by: karanchellani <142801957+karanchellani@users.noreply.github.com>
* format fixes
* Close connect when graph object is not none
* Call garbage collector to release the menory
* Change error message
* Added driver config as user_agent
* Updated doc for the LLM_MODELS and GCS_FILE_CACHE (#473)
* Web URLs are user input (#475)
* web url support backend
* added the tabs for input source
* user agent added for Neo4jGraph connection
* Tab view for sources
* extract handling for web ur's
* initial input handling
* chunk creation before processing
* code structure
* format fixes
---------
Co-authored-by: kartikpersistent <101251502+kartikpersistent@users.noreply.github.com>
* changed the regex for web and cancel button naming
* changed the schema dropdown type
* readme updates
* PROD version fix
* changed the alert message for gcs
* Delete unconnected entities from DB (#482)
* 457 add schema before generate graph (#478)
* schema setting from generate graph
* changes
* changes
* badge changes
* bug fix
* Fulltext index and Update similarity graph (#479)
* added full_text index
* added one common function for post_processing
* post processing api
* added tasks param
* modifed logging
* post processing changes
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
* Graph and vector search (#485)
* Modified the retrival query
* added the chatmode toggle component
* Modified to vector search
* Moved the templates to constants
* added the icons
* added chat modes
* code structure changes
* Intergrated the API changges
* Modified retrieval queries,refactored code
* API integration changes
* added the score
* order change
* wording change
* modified constants
* added graph+vector
* added the tooltips
* Modified query
* removed the graph mode
* tooltip camel Case
* added the icon and extern link for web source in the info modal
* added the youtube link in the source used tab
* format fixes
* added the hoverable link
---------
Co-authored-by: vasanthasaikalluri <165021735+vasanthasaikalluri@users.noreply.github.com>
* Update InfoModal.tsx
* removed hover from chunks
* removed page number
* removed page number
* removed pag…1 parent 02ee9b2 commit 083f487
File tree
94 files changed
+3503
-1374
lines changed- POC_Documents/V1
- backend
- src
- document_sources
- docs
- backend
- frontend
- frontend
- src
- HOC
- assets/images
- components
- ChatBot
- Info
- DataSources
- AWS
- GCS
- Local
- Graph
- Layout
- Popups
- ConnectionModal
- DeletePopUpForOrphanNodes
- DeletePopUp
- LargeFilePopUp
- Settings
- UI
- WebSources
- Web
- WikiPedia
- Youtube
- context
- hooks
- services
- utils
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
94 files changed
+3503
-1374
lines changedBinary file not shown.
Binary file not shown.
Binary file not shown.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
2 | 1 | | |
3 | | - | |
4 | | - | |
5 | 2 | | |
6 | | - | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
7 | 24 | | |
8 | 25 | | |
9 | 26 | | |
10 | 27 | | |
11 | 28 | | |
12 | | - | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
13 | 32 | | |
14 | 33 | | |
15 | 34 | | |
| |||
21 | 40 | | |
22 | 41 | | |
23 | 42 | | |
24 | | - | |
| 43 | + | |
25 | 44 | | |
26 | 45 | | |
27 | 46 | | |
28 | 47 | | |
29 | 48 | | |
30 | | - | |
| 49 | + | |
31 | 50 | | |
32 | 51 | | |
33 | 52 | | |
| |||
36 | 55 | | |
37 | 56 | | |
38 | 57 | | |
39 | | - | |
| 58 | + | |
40 | 59 | | |
41 | | - | |
| 60 | + | |
42 | 61 | | |
43 | | - | |
| 62 | + | |
44 | 63 | | |
45 | 64 | | |
46 | 65 | | |
47 | 66 | | |
48 | | - | |
| 67 | + | |
49 | 68 | | |
50 | 69 | | |
51 | 70 | | |
| |||
76 | 95 | | |
77 | 96 | | |
78 | 97 | | |
79 | | - | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
80 | 116 | | |
81 | 117 | | |
82 | 118 | | |
| |||
86 | 122 | | |
87 | 123 | | |
88 | 124 | | |
89 | | - | |
| 125 | + | |
90 | 126 | | |
91 | 127 | | |
92 | 128 | | |
| |||
98 | 134 | | |
99 | 135 | | |
100 | 136 | | |
101 | | - | |
| 137 | + | |
102 | 138 | | |
103 | 139 | | |
104 | | - | |
| 140 | + | |
105 | 141 | | |
| 142 | + | |
106 | 143 | | |
107 | 144 | | |
108 | | - | |
109 | | - | |
110 | | - | |
111 | | - | |
112 | | - | |
113 | | - | |
114 | | - | |
115 | | - | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | | - | |
120 | | - | |
121 | | - | |
122 | | - | |
123 | | - | |
124 | | - | |
125 | | - | |
126 | | - | |
127 | | - | |
128 | | - | |
129 | | - | |
130 | | - | |
131 | | - | |
132 | | - | |
133 | | - | |
134 | | - | |
135 | | - | |
136 | | - | |
137 | | - | |
138 | | - | |
139 | | - | |
140 | | - | |
141 | | - | |
142 | | - | |
143 | | - | |
144 | | - | |
145 | | - | |
146 | | - | |
147 | | - | |
148 | | - | |
149 | | - | |
150 | | - | |
151 | | - | |
152 | | - | |
153 | | - | |
154 | | - | |
155 | | - | |
156 | | - | |
157 | | - | |
158 | | - | |
159 | | - | |
160 | | - | |
161 | | - | |
162 | | - | |
163 | | - | |
164 | | - | |
165 | | - | |
166 | | - | |
167 | | - | |
168 | | - | |
169 | | - | |
170 | | - | |
171 | | - | |
172 | | - | |
173 | | - | |
174 | | - | |
175 | | - | |
176 | | - | |
177 | | - | |
178 | | - | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
179 | 154 | | |
180 | 155 | | |
181 | | - | |
182 | | - | |
183 | 156 | | |
| 157 | + | |
| 158 | + | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
20 | 20 | | |
21 | 21 | | |
22 | 22 | | |
23 | | - | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
0 commit comments