I am a passionate Data Scientist pursuing my Masterβs in Data Science at Florida Atlantic University (GPA: 3.95/4.0). My work spans data analytics, machine learning, and AI, with a focus on educational technology and natural language processing. I have a strong background in building scalable data pipelines, fine-tuning large language models, and developing AI-driven solutions.
- Programming & Scripting: Python, PySpark, SQL, MS Powershell, GIT
- Data Management & Annotation: JSON, YAML, Azure Data Factory, Azure DevOps
- Data Analytics: Pandas, NumPy, Power BI, Databricks, Jupyter Notebook, Google Colab
- Data Modeling: Star Schema, Snowflake Schema
- Databases: SQL and NoSQL
- Data Visualization & BI: Power BI, Tableau
- AI & ML: Machine Learning, Deep Learning, LLM Training, Prompt Engineering
- Thesis (2023 - Present): Developed the "PhysmolLM" language model for enhancing physics education, focusing on custom datasets and instruction-based fine-tuning.
- Graduate Research Assistant (Feb 2024 - Aug 2024): Conducted sentiment analysis, trend analysis, and topic modeling on social media data to assess attitudes toward generative AI in education.
- Data Engineer at Accenture (Aug 2021 - July 2023): Built scalable ETL pipelines, optimized data flows in Azure, and developed data migration strategies.
- Data Engineer at Infosys (Mar 2019 - July 2021): Created ETL pipelines, designed data marts, and engineered data quality checks for large-scale data management.
- Winner (3rd Place), Responsible AI Hackathon: Developed the Academic Responsible Assistant (ARA) to proactively stop academic cheating using RAG architecture.
- President, PACE Club: Led initiatives for professional growth, organized guest lectures, and promoted networking opportunities.
- Prestige Graduate Leadership Award: Recognized for outstanding leadership and contributions to the university community.
- LinkedIn: Akhil Vallala
- GitHub: akhilfau
- Email: avallala2023@fau.edu
Β© 2025 Akhil Vallala. All Rights Reserved.


