-
Notifications
You must be signed in to change notification settings - Fork 394
[importer] Introduce new Importer module with separate Configs, API Endpoints, and Dependencies #4089
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
[importer] Introduce new Importer module with separate Configs, API Endpoints, and Dependencies #4089
Changes from all commits
Commits
Show all changes
29 commits
Select commit
Hold shift + click to select a range
75defec
[importer] Add new component and API endpoint with new directory stru…
Harshg999 935ca5d
[importer] Implement file upload API for CSV and Excel formats with v…
Harshg999 1ce2856
Refactor importer API: remove unused import and delete obsolete templ…
Harshg999 c20623e
Refactors file format detection and metadata extraction
Harshg999 db2a4ec
Add file metadata detection and update dependencies
Harshg999 6ba570f
Refactors file upload API for better separation of concerns
Harshg999 5552e78
Refactor file metadata detection API and improve efficiency
Harshg999 b901255
Improves file metadata extraction and error handling
Harshg999 48f07aa
Improves file type detection with graceful magic lib fallback
Harshg999 c7a2a11
Adds file preview API for data import functionality
Harshg999 4810c36
Add coerce_bool handling for has_header parameter
Harshg999 c9a2760
Enhance field separator handling in preview_file API with unicode dec…
Harshg999 880fbb5
Refactor file metadata handling by introducing GuessFileMetadataSeria…
Harshg999 e1af698
Refactor file preview functionality by introducing PreviewFileSeriali…
Harshg999 70e7fd7
Add has_header field to PreviewFileSerializer for explicit header det…
Harshg999 307fe3f
Uncomment old code
Harshg999 c312738
Change variable name to uploaded_file
Harshg999 3732baf
fix docstring
Harshg999 4d163db
Remove unnecessary blank line before api_error_handler function
Harshg999 06b5d0a
Refactor comments
Harshg999 323fd19
Add configurable importer restrictions and settings
Harshg999 1d4ff77
Improves Excel sheet name extraction performance
Harshg999 6baa522
Add API and logic for file header row detection
Harshg999 590f772
Add API for mapping Polars types to SQL types
Harshg999 708e75a
Sort req packages
Harshg999 5a9832b
Add unit tests
Harshg999 e235b65
Refactor import statements for better organization and clarity
Harshg999 dba5f8f
Remove redundant validate methods from GuessFileMetadataSerializer an…
Harshg999 a023370
Change "data file importer" to just "importer"
Harshg999 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
#!/usr/bin/env python | ||
Harshg999 marked this conversation as resolved.
Show resolved
Hide resolved
|
||
# Licensed to Cloudera, Inc. under one | ||
# or more contributor license agreements. See the NOTICE file | ||
# distributed with this work for additional information | ||
# regarding copyright ownership. Cloudera, Inc. licenses this file | ||
# to you under the Apache License, Version 2.0 (the | ||
# "License"); you may not use this file except in compliance | ||
# with the License. You may obtain a copy of the License at | ||
# | ||
# http://www.apache.org/licenses/LICENSE-2.0 | ||
# | ||
# Unless required by applicable law or agreed to in writing, software | ||
# distributed under the License is distributed on an "AS IS" BASIS, | ||
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
# See the License for the specific language governing permissions and | ||
# limitations under the License. |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.