This web application allows users to upload local Parquet files or fetch remote Parquet files, and displays the columns and sample data from these files.
- Upload local Parquet files
- Fetch remote Parquet files via URL
- Display Parquet file schema (columns and their types)
- Show sample data from the Parquet file
- Responsive web interface
- FastAPI
- Python 3.12
- pyarrow
- pandas
- React
- Material-UI
- Axios
-
Navigate to the backend directory:
cd backend
-
Create a virtual environment:
python3 -m venv venv source venv/bin/activate # On Windows use `venv\Scripts\activate`
-
Install dependencies:
pip install -r requirements.txt
-
Run the backend server:
uvicorn main:app --reload
The backend will be available at
http://localhost:8000
.
-
Navigate to the frontend directory:
cd frontend
-
Install dependencies:
npm install
-
Start the development server:
npm start
The frontend will be available at
http://localhost:3000
.
- Open your web browser and go to
http://localhost:3000
. - To upload a local Parquet file, click on "Choose File" under "Upload Local Parquet File".
- To fetch a remote Parquet file, enter the URL in the input field under "Fetch Parquet File from URL" and click "Fetch File".
- The application will display the file's schema and a sample of the data.
POST /api/upload
: Upload a local Parquet filePOST /api/fetch
: Fetch a remote Parquet file
Contributions are welcome! Please feel free to submit a Pull Request.
This project is open source and available under the MIT License.