Common Cleaning Risks and How to Avoid Them
Table 1
Table
6 / 9
Table 1
Table
Table 1
| Risk | Example | Prevention |
|---|---|---|
| Overwriting raw variables | Replacing numeric code with label in same column | Create new derived or labeled variable |
| Silent row loss | Filtering without counting exclusions | Record before and after counts |
| Incorrect dates | Treating text dates as real dates | Parse dates explicitly and inspect results |
| Wrong recoding | Reversing `1 = Female`, `2 = Male` | Validate against data dictionary |
| Uncontrolled missing codes | Treating `999` as a real value | Define and convert special missing codes carefully |
| Unreviewed automation | Sending query lists without human review | Require data manager review before action |