About me

Hi. I am an aspiring data scientist.

my_dict = {
    "name": "Kathy Tran",
    "Passion": "Data Science",
    "Projects":
}

My skills

  • visualization icon

    Generative AI with RAG and LLM

    - Develop Retrieval-Augmented Generation (RAG) pipelines that combine vector databases and LLMs to deliver context-aware, domain-specific answers.
    - Integrate open-source LLMs (e.g., LLaMA 3, Mistral) or API-based models (e.g., GPT-4o) into applications for conversational AI, document Q&A, and knowledge assistants.

  • machine learning icon

    Machine Learning

    - Utilize predictive modeling techniques to identify patterns for data-driven decisions using Python Scikit-learn

    - Classification: Logistic regression, K-nearest neighbors, Decision Tree, Random Forest, Gradient Boosting Regression Trees

    - Quantitative: Linear regression

    - Unsupervised: K Means clustering

  • data collection icon

    Data collection

    - Utilize APIs to collect and aggregate data for analysis.

    - Perform web scraping with Python BeautifulSoup and Selenium WebDriver to gather information

  • Data wrangling

    Data wrangling

    - Clean and organize raw data with Python Pandas and Numpy, ensuring high-quality datasets for analysis.

    - Utilize SciPy for solving linear equations & statistical analysis to preprocess complex datasets.

  • database icon

    Database management

    - Manage and maintain databases using MySQL, PostgreSQL, or MongoDB (NoSQL)

    - Design databases with Entity-relationship Diagrams (ERDs) & logical modeling

    - Migrate from on-premises / local data centers to the Cloud with AWS, Microsoft Azure or Google Cloud

  • visualization icon

    Data Visualization

    - Create visualizations with Python Matplotlib and Seaborn

    - Develop interactive dashboards with Tableau & Power BI to effectively communicate insights.

Portfolio

Achievements

Awards

Scroll right to see more

  • KTHack

    Second Place Overall - KT Hack

    Data Engineer
  • Hack RU

    Best Sustainability University Hack - Hack RU

    Data Engineer
  • Hack TCNJ

    Best Dot Tech Domain Website - Hack TCNJ

    Machine Learning Engineer