Smart CSV/Excel Assistant


Project Technical Information

Project Name:
Smart CSV/Excel Assistant
Project Type:
AI Data Analysis CSV Processing Natural Language Queries
Tech Stack:
Python 3.8+ Flask Pandas OpenPyXL HTML/CSS/JS DataTables Gunicorn Python-dotenv Prompt Engineering LLM SLM
AI Features:
Mistral AI (Data Analysis) Natural Language Processing Intelligent Data Insights Pattern Detection Outlier Analysis Pandas Code Generation

Project Summary

An intelligent AI-powered data analysis tool that transforms CSV file exploration using natural language processing. This Flask web application allows users to upload CSV files, get comprehensive data summaries, and ask questions about their data in plain English. The system provides instant insights, detects patterns and outliers automatically, and includes sample datasets for exploration.

Skills Demonstrated

AI Data Analysis Natural Language Processing CSV File Processing Flask Web Development API Integration Error Handling & Retry Logic File Upload & Validation Interactive Data Tables Pandas Data Manipulation Frontend JavaScript Bootstrap UI Design Security Implementation

Tools Used

Python 3.8+ Flask Pandas OpenPyXL Requests HTML/CSS/JS DataTables Mistral AI Gunicorn Python-dotenv Prompt Engineering LLM SLM

Solution

The Smart CSV/Excel Assistant solves the challenge of complex data analysis by providing an intuitive interface where users can simply ask questions about their data in natural language. The system automatically processes CSV files, generates comprehensive statistics, and uses Mistral AI to provide intelligent insights. It includes robust error handling with exponential backoff, API key rotation, and security features like file size limits and validation.

Approach

  1. Frontend: Built responsive Flask web interface with Bootstrap for file uploads and data display.
  2. File Processing: Implemented secure CSV upload with validation, size limits, and automatic cleanup.
  3. Data Analysis: Used Pandas for comprehensive data statistics, shape analysis, and sample generation.
  4. AI Integration: Integrated Mistral AI API with retry logic, exponential backoff, and multiple API keys.
  5. Code Generation: AI generates pandas code and executes it to provide real-time results for user queries.
  6. Interactive Tables: Added DataTables for searchable, sortable data presentation with pagination.
  7. Sample Datasets: Included real-world CSV samples from GitHub repositories for exploration.

Project Link(s)

Designed and Developed by Aradhya Pavan H S