Pandas Fundamentals in 40 Minutes

Pandas is the Python library that made data science what it is today. Ten verbs get you through 90 percent of day-to-day data work.

45 min · Reviewed 2026

Pandas Is the Table API

Pandas was created in 2008 by Wes McKinney at a hedge fund. Today it is the default Python library for tabular data, downloaded over 100 million times per month. Its two main types are Series (a single column) and DataFrame (a table).

Ten verbs you will use constantly

import pandas as pd # 1. Load df = pd.read_csv('data.csv') # 2. Peek df.head() df.info() df.describe() # 3. Select columns df['age'] # one column (Series) df[['age', 'income']] # multiple columns (DataFrame) # 4. Filter rows df[df['age'] > 18] df[(df['age'] > 18) & (df['country'] == 'US')] # 5. Sort df.sort_values('income', ascending=False) # 6. Create columns df['income_per_age'] = df['income'] / df['age'] # 7. Group and aggregate df.groupby('country')['income'].mean() df.groupby(['country', 'gender']).agg({ 'income': ['mean', 'median'], 'age': 'mean' }) # 8. Join tables merged = pd.merge(df, other_df, on='user_id', how='left') # 9. Pivot pd.pivot_table(df, index='country', columns='year', values='income') # 10. Save df.to_csv('clean.csv', index=False) df.to_parquet('clean.parquet')The ten most important pandas operations

Indexing: the most confusing part

# .loc uses labels df.loc[5] # row with index label 5 df.loc[df['age'] > 18, 'name'] # name column, filtered rows # .iloc uses positions df.iloc[5] # 6th row regardless of index label df.iloc[:10, :3] # first 10 rows, first 3 cols # Chained assignment is a trap # df[df.age > 18]['score'] = 100 # DO NOT DO THIS df.loc[df.age > 18, 'score'] = 100 # CORRECTCorrect indexing patterns

Common patterns worth memorizing

# Top N per group top3 = df.groupby('country').apply( lambda g: g.nlargest(3, 'income') ).reset_index(drop=True) # Rolling stats df['7d_avg'] = df['sales'].rolling(window=7).mean() # Replace based on mapping df['country'] = df['country'].replace({'USA': 'US', 'U.S.A.': 'US'}) # One-hot encoding df_encoded = pd.get_dummies(df, columns=['color']) # Handle dates df['date'] = pd.to_datetime(df['date']) df['day_of_week'] = df['date'].dt.day_name()Patterns you will use every week

The big idea: pandas rewards the ten verbs you use 90 percent of the time. Master those before chasing fancier features, and the other 10 percent will come naturally when you need it.

End-of-lesson check

6 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-data-pandas-fundamentals

What is the main idea of "Pandas Fundamentals in 40 Minutes"?
1. Pandas is the Python library that made data science what it is today. Ten verbs get you through 90 percent of day-to-day data work.
2. Use AI as the final authority for the whole decision
3. Avoid checking the answer once it sounds polished
4. Focus only on speed instead of judgment
Which concept is most central to "Pandas Fundamentals in 40 Minutes"?
1. DataFrame
2. pandas
3. Series
4. indexing
What should a careful learner remember about "The SettingWithCopyWarning"?
1. Use "The SettingWithCopyWarning" as a reminder to verify the AI output before anyone relies on it.
2. Skip the context so the tool can guess faster
3. Treat the output as private even after sharing it online
4. Use the answer without checking the source
You want to use AI after this lesson. What is the safest next step?
1. Act immediately because the AI answer is written clearly
2. Use AI for drafting and comparison, but verify before publishing or relying on it.
3. Hide uncertainty so the final answer looks cleaner
4. Use private or sensitive details before checking permission
How should AI output about pandas be treated?
1. As proof that no other source is needed
2. As a replacement for context, consent, or expert review
3. As a draft or helper output that still needs human judgment and verification
4. As something that becomes correct when it sounds confident
Name one way to verify an AI answer about pandas.

← Back to interactive lesson

Tendril · Creators · AI Foundations

Pandas Fundamentals in 40 Minutes

Pandas is the Python library that made data science what it is today. Ten verbs get you through 90 percent of day-to-day data work.

45 min · Reviewed 2026

Pandas Is the Table API

Ten verbs you will use constantly

import pandas as pd # 1. Load df = pd.read_csv('data.csv') # 2. Peek df.head() df.info() df.describe() # 3. Select columns df['age'] # one column (Series) df[['age', 'income']] # multiple columns (DataFrame) # 4. Filter rows df[df['age'] > 18] df[(df['age'] > 18) & (df['country'] == 'US')] # 5. Sort df.sort_values('income', ascending=False) # 6. Create columns df['income_per_age'] = df['income'] / df['age'] # 7. Group and aggregate df.groupby('country')['income'].mean() df.groupby(['country', 'gender']).agg({ 'income': ['mean', 'median'], 'age': 'mean' }) # 8. Join tables merged = pd.merge(df, other_df, on='user_id', how='left') # 9. Pivot pd.pivot_table(df, index='country', columns='year', values='income') # 10. Save df.to_csv('clean.csv', index=False) df.to_parquet('clean.parquet')The ten most important pandas operations

Indexing: the most confusing part

# .loc uses labels df.loc[5] # row with index label 5 df.loc[df['age'] > 18, 'name'] # name column, filtered rows # .iloc uses positions df.iloc[5] # 6th row regardless of index label df.iloc[:10, :3] # first 10 rows, first 3 cols # Chained assignment is a trap # df[df.age > 18]['score'] = 100 # DO NOT DO THIS df.loc[df.age > 18, 'score'] = 100 # CORRECTCorrect indexing patterns

Common patterns worth memorizing

# Top N per group top3 = df.groupby('country').apply( lambda g: g.nlargest(3, 'income') ).reset_index(drop=True) # Rolling stats df['7d_avg'] = df['sales'].rolling(window=7).mean() # Replace based on mapping df['country'] = df['country'].replace({'USA': 'US', 'U.S.A.': 'US'}) # One-hot encoding df_encoded = pd.get_dummies(df, columns=['color']) # Handle dates df['date'] = pd.to_datetime(df['date']) df['day_of_week'] = df['date'].dt.day_name()Patterns you will use every week

The big idea: pandas rewards the ten verbs you use 90 percent of the time. Master those before chasing fancier features, and the other 10 percent will come naturally when you need it.

End-of-lesson check

6 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-data-pandas-fundamentals

What is the main idea of "Pandas Fundamentals in 40 Minutes"?
1. Pandas is the Python library that made data science what it is today. Ten verbs get you through 90 percent of day-to-day data work.
2. Use AI as the final authority for the whole decision
3. Avoid checking the answer once it sounds polished
4. Focus only on speed instead of judgment
Which concept is most central to "Pandas Fundamentals in 40 Minutes"?
1. DataFrame
2. pandas
3. Series
4. indexing
What should a careful learner remember about "The SettingWithCopyWarning"?
1. Use "The SettingWithCopyWarning" as a reminder to verify the AI output before anyone relies on it.
2. Skip the context so the tool can guess faster
3. Treat the output as private even after sharing it online
4. Use the answer without checking the source
You want to use AI after this lesson. What is the safest next step?
1. Act immediately because the AI answer is written clearly
2. Use AI for drafting and comparison, but verify before publishing or relying on it.
3. Hide uncertainty so the final answer looks cleaner
4. Use private or sensitive details before checking permission
How should AI output about pandas be treated?
1. As proof that no other source is needed
2. As a replacement for context, consent, or expert review
3. As a draft or helper output that still needs human judgment and verification
4. As something that becomes correct when it sounds confident
Name one way to verify an AI answer about pandas.

← Back to interactive lesson