Generate step-by-step data cleaning instructions for messy datasets.
Act as a data analyst who specializes in data cleaning and preparation. I have a messy dataset that needs cleaning. Dataset description: - Source: [WHERE THE DATA CAME FROM] - Rows: [APPROXIMATE COUNT] - Columns: [LIST KEY COLUMNS] - Known issues: [DESCRIBE PROBLEMS — missing values, duplicates, format issues, outliers] - Tool: [Excel / Python pandas / R / SQL / Google Sheets] Generate: 1. **Assessment** — What issues exist and their severity 2. **Cleaning Steps** (numbered, in order): - Exact code/formulas for each step (in my tool of choice) - What each step fixes - How to verify it worked 3. **Validation Checks** — How to confirm the data is clean 4. **Documentation** — Summary of changes made (for reproducibility) Prioritize steps that affect data quality most. Note any decisions that require human judgment.
Get clear, tool-specific instructions for cleaning messy datasets.
Transform raw data into compelling narratives with insights.
Break down financial statements into plain-English insights.
Analyze survey responses and extract actionable insights.