Skip to main content Link Menu Expand (external link) Document Search Copy Copied

Exploratory Data Analysis using LLM and LangChain

In this tutorial, we will explore how to leverage LLM (Large Language Models) to do Exploratory Data Analysis (EDA), which is an important step in developing machine learning models.

We will use OpenAI as LLM and LangChain framework to orchestrate the prompts.


EDA steps to be explored using LLM:

  • Getting meta data about the dataset
  • Finding missing values
  • Univariate Analysis like probability distribution, Histogram, box plot and outlier analysis
  • Hypothesis test (t-test to compare distributions)
  • Bivariate Analysis: Scatter plot, correlation analysis and heatmap


Open in Colab