Skip to content

Conversation

@JesperDramsch
Copy link
Owner

Bug fixes:

  • BUG 1: Use consistent normalization for mapping_dict keys and df_new lookups
  • BUG 2: Handle missing 'variations' key with .get() defaults
  • BUG 3: Check column existence before accessing cfp_ext_x/y merge columns
  • BUG 4: Deduplicate by [conference, year] to preserve multi-year entries

Added 22 regression tests in tests/test_merge_no_data_loss.py covering:

  • Normalization consistency and idempotency
  • No silent data loss during merge
  • Variations key handling
  • cfp_ext column handling
  • Multi-year deduplication

Bug fixes:
- BUG 1: Use consistent normalization for mapping_dict keys and df_new lookups
- BUG 2: Handle missing 'variations' key with .get() defaults
- BUG 3: Check column existence before accessing cfp_ext_x/y merge columns
- BUG 4: Deduplicate by [conference, year] to preserve multi-year entries

Added 22 regression tests in tests/test_merge_no_data_loss.py covering:
- Normalization consistency and idempotency
- No silent data loss during merge
- Variations key handling
- cfp_ext column handling
- Multi-year deduplication
@github-actions github-actions bot added the Tests label Jan 18, 2026
@JesperDramsch JesperDramsch merged commit ad47c66 into main Jan 18, 2026
11 checks passed
@JesperDramsch JesperDramsch deleted the claude/fix-csv-merge-loss-4WPng branch January 18, 2026 01:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants