Tendril — AI Lessons for Real Life

Tendril

The premise

Paste the plan plus table DDL; the model walks the plan tree, names hot nodes, and suggests indexes worth trying.

What AI does well here

Identify seq scans on large tables

Spot row-estimate vs actual mismatches

Suggest candidate indexes with rationale

What AI cannot do

Know your write/read ratio

Predict planner behavior after VACUUM/ANALYZE

Replace measurement on production data

End-of-lesson check

15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-ai-coding-AI-sql-explain-plan-review-creators

When you run EXPLAIN ANALYZE on a query in PostgreSQL, what additional information do you get compared to just running EXPLAIN?

The actual execution time and row counts returned by each plan node
The exact server hardware specifications being used
The SQL syntax errors if any exist in the query
A list of all indexes that exist on every table in the database

Why should you walk through an EXPLAIN ANALYZE plan from bottom to top (reverse order) when analyzing it with an LLM?

Top nodes always take the longest time regardless of what happens below
The database optimizer requires this order to validate the plan
Child nodes feed data into parent nodes, so understanding what each child produces helps explain the parent's behavior
PostgreSQL stores plans in reverse order for historical reasons

What does it typically mean when the 'rows' estimate in a query plan differs significantly from the actual rows processed?

The SQL syntax is invalid and needs correction
The query is definitely returning incorrect results
The database has detected hardware failure
The query planner's statistics may be outdated, causing it to choose a suboptimal plan

What is a major limitation of using an LLM to suggest indexes for your production database?

The LLM will delete existing indexes that conflict with its suggestions
The LLM doesn't know your application's read/write ratio, so it may suggest indexes that severely slow down writes
LLMs always suggest incorrect SQL syntax for index creation
The LLM cannot connect to your database to run EXPLAIN ANALYZE

After running VACUUM ANALYZE on a PostgreSQL database, what might change about future query plans?

Query plans will always become faster after VACUUM
The planner's row estimates may become more accurate because fresh statistics were collected
The database will switch to a different storage engine
All existing indexes will be automatically rebuilt

Why is it dangerous to apply an LLM's index suggestions directly to a production database without testing?

The index might significantly slow down INSERT, UPDATE, and DELETE operations while only marginally improving reads
The LLM might suggest indexes that corrupt data
The suggestion might use syntax that causes syntax errors
PostgreSQL prevents index changes without downtime

What specific information should you provide to an LLM along with an EXPLAIN ANALYZE output to get useful index suggestions?

A list of all queries ever run against the database
The database server's IP address and password
The table DDL (CREATE TABLE statements) showing column types and existing indexes
The complete database backup file

What does the 'actual time' value in each node of an EXPLAIN ANALYZE output represent?

The network latency between the client and database server
The time in milliseconds that node spent executing, accumulated across all iterations
The estimated time the planner predicts the node will take
The time PostgreSQL took to compile the query plan

What is a 'hot node' in the context of query plan analysis?

A node that is currently being accessed by multiple database connections
A node that performs encryption on sensitive data
A plan node that consumes a disproportionate amount of execution time or processes far more rows than expected
A node that has been cached in PostgreSQL's shared buffer

When an LLM suggests an index, what 'tradeoff' should you evaluate before creating it?

The index will use additional disk space and memory
All of the above
The index might become outdated as data grows
The index will speed up queries but add overhead to every data-modifying operation

What does an LLM NOT know about your database that could affect whether its index suggestions are appropriate?

The username of the database administrator
The exact version of PostgreSQL being used
The total size of all tables combined
Your application's typical read-to-write ratio and query patterns

What should you do before applying any index suggestion from an LLM to production?

Ask the LLM to confirm the suggestion three times
Apply it during peak traffic to see real impact
Test the index on a production-like clone and measure the actual performance difference
Wait exactly 24 hours before applying

What does the 'buffers' information in EXPLAIN ANALYZE output tell you?

The exact memory allocation for each query operation
How many pages were read from memory (cache) vs disk for each node
The network buffer sizes configured for PostgreSQL
A list of all database connection buffers currently in use

If EXPLAIN shows a 'Nested Loop' join with an estimated 100 rows but actual 50,000 rows, what is likely happening?

The query returned an error and only partial results
The database ran out of memory and switched to a different algorithm
The join is performing far more work than the planner expected, likely due to outdated statistics
The index on the inner table is being used incorrectly

What is one reason an LLM might incorrectly estimate the effectiveness of an index suggestion?

The LLM always uses the most recent PostgreSQL documentation
The LLM performs test queries to verify suggestions
The LLM cannot know about other concurrent queries and overall system load
The LLM has access to your actual production data

The premise

Paste the plan plus table DDL; the model walks the plan tree, names hot nodes, and suggests indexes worth trying.

What AI does well here

Identify seq scans on large tables

Spot row-estimate vs actual mismatches

Suggest candidate indexes with rationale

What AI cannot do

Know your write/read ratio

Predict planner behavior after VACUUM/ANALYZE

Replace measurement on production data

End-of-lesson check

15 questions · take it digitally for instant feedback at tendril.neural-forge.io/learn/quiz/end-ai-coding-AI-sql-explain-plan-review-creators

When you run EXPLAIN ANALYZE on a query in PostgreSQL, what additional information do you get compared to just running EXPLAIN?

The actual execution time and row counts returned by each plan node
The exact server hardware specifications being used
The SQL syntax errors if any exist in the query
A list of all indexes that exist on every table in the database

Why should you walk through an EXPLAIN ANALYZE plan from bottom to top (reverse order) when analyzing it with an LLM?

Top nodes always take the longest time regardless of what happens below
The database optimizer requires this order to validate the plan
Child nodes feed data into parent nodes, so understanding what each child produces helps explain the parent's behavior
PostgreSQL stores plans in reverse order for historical reasons

What does it typically mean when the 'rows' estimate in a query plan differs significantly from the actual rows processed?

The SQL syntax is invalid and needs correction
The query is definitely returning incorrect results
The database has detected hardware failure
The query planner's statistics may be outdated, causing it to choose a suboptimal plan

What is a major limitation of using an LLM to suggest indexes for your production database?

The LLM will delete existing indexes that conflict with its suggestions
The LLM doesn't know your application's read/write ratio, so it may suggest indexes that severely slow down writes
LLMs always suggest incorrect SQL syntax for index creation
The LLM cannot connect to your database to run EXPLAIN ANALYZE

After running VACUUM ANALYZE on a PostgreSQL database, what might change about future query plans?

Query plans will always become faster after VACUUM
The planner's row estimates may become more accurate because fresh statistics were collected
The database will switch to a different storage engine
All existing indexes will be automatically rebuilt

Why is it dangerous to apply an LLM's index suggestions directly to a production database without testing?

The index might significantly slow down INSERT, UPDATE, and DELETE operations while only marginally improving reads
The LLM might suggest indexes that corrupt data
The suggestion might use syntax that causes syntax errors
PostgreSQL prevents index changes without downtime

What specific information should you provide to an LLM along with an EXPLAIN ANALYZE output to get useful index suggestions?

A list of all queries ever run against the database
The database server's IP address and password
The table DDL (CREATE TABLE statements) showing column types and existing indexes
The complete database backup file

What does the 'actual time' value in each node of an EXPLAIN ANALYZE output represent?

The network latency between the client and database server
The time in milliseconds that node spent executing, accumulated across all iterations
The estimated time the planner predicts the node will take
The time PostgreSQL took to compile the query plan

What is a 'hot node' in the context of query plan analysis?

A node that is currently being accessed by multiple database connections
A node that performs encryption on sensitive data
A plan node that consumes a disproportionate amount of execution time or processes far more rows than expected
A node that has been cached in PostgreSQL's shared buffer

When an LLM suggests an index, what 'tradeoff' should you evaluate before creating it?

The index will use additional disk space and memory
All of the above
The index might become outdated as data grows
The index will speed up queries but add overhead to every data-modifying operation

What does an LLM NOT know about your database that could affect whether its index suggestions are appropriate?

The username of the database administrator
The exact version of PostgreSQL being used
The total size of all tables combined
Your application's typical read-to-write ratio and query patterns

What should you do before applying any index suggestion from an LLM to production?

Ask the LLM to confirm the suggestion three times
Apply it during peak traffic to see real impact
Test the index on a production-like clone and measure the actual performance difference
Wait exactly 24 hours before applying

What does the 'buffers' information in EXPLAIN ANALYZE output tell you?

The exact memory allocation for each query operation
How many pages were read from memory (cache) vs disk for each node
The network buffer sizes configured for PostgreSQL
A list of all database connection buffers currently in use

If EXPLAIN shows a 'Nested Loop' join with an estimated 100 rows but actual 50,000 rows, what is likely happening?

The query returned an error and only partial results
The database ran out of memory and switched to a different algorithm
The join is performing far more work than the planner expected, likely due to outdated statistics
The index on the inner table is being used incorrectly

What is one reason an LLM might incorrectly estimate the effectiveness of an index suggestion?

The LLM always uses the most recent PostgreSQL documentation
The LLM performs test queries to verify suggestions
The LLM cannot know about other concurrent queries and overall system load
The LLM has access to your actual production data

AI for Reading SQL EXPLAIN Plans

The premise

What AI does well here

What AI cannot do

End-of-lesson check

AI for Reading SQL EXPLAIN Plans

The premise

What AI does well here

What AI cannot do

End-of-lesson check