Skip to content

Instantly share code, notes, and snippets.

@sobernaut
Created July 19, 2020 23:30
Show Gist options
  • Select an option

  • Save sobernaut/abc18e4773159a57b41fb9dd1e23e780 to your computer and use it in GitHub Desktop.

Select an option

Save sobernaut/abc18e4773159a57b41fb9dd1e23e780 to your computer and use it in GitHub Desktop.
Enter year1992,1998,1999,2000,2001,2002,2003,2004,2005,2006,2007,2008,2009,2010,2011,2012
------------------------Year 1992------------------------
Processed file ./updated/1992.csv with 666207 lines
Processed file ./data/new/1992new.csv with 666207 lines
Total ids in new 666206
Total ids in old 666206
Same or not? [False, False]
Ids in new that are not in old 33519
Processed file ./data/1992.csv with 666207 lines
original 666206
Same or not? [False, False] [False, False]
Ids in new that are not in original 41797
Ids in old that are not in original 45170
------------------------Year 1998------------------------
Processed file ./updated/1998.csv with 683861 lines
Processed file ./data/new/1998new.csv with 683861 lines
Total ids in new 683860
Total ids in old 683860
Same or not? [False, False]
Ids in new that are not in old 22601
Processed file ./data/1998.csv with 683861 lines
original 683860
Same or not? [False, False] [False, False]
Ids in new that are not in original 39894
Ids in old that are not in original 48997
------------------------Year 1999------------------------
Processed file ./updated/1999.csv with 688416 lines
Processed file ./data/new/1999new.csv with 688416 lines
Total ids in new 688415
Total ids in old 688415
Same or not? [False, False]
Ids in new that are not in old 22772
Processed file ./data/1999.csv with 688416 lines
original 688415
Same or not? [False, False] [False, False]
Ids in new that are not in original 40491
Ids in old that are not in original 49819
------------------------Year 2000------------------------
Processed file ./updated/2000.csv with 691061 lines
Processed file ./data/new/2000new.csv with 691060 lines
Total ids in new 691059
Total ids in old 691060
Same or not? [False, False]
Ids in new that are not in old 22632
Processed file ./data/2000.csv with 691060 lines
original 691059
Same or not? [False, False] [False, False]
Ids in new that are not in original 40629
Ids in old that are not in original 51002
------------------------Year 2001------------------------
Processed file ./updated/2001.csv with 694942 lines
Processed file ./data/new/2001new.csv with 694941 lines
Total ids in new 694940
Total ids in old 694941
Same or not? [False, False]
Ids in new that are not in old 22443
Processed file ./data/2001.csv with 694941 lines
original 694940
Same or not? [False, False] [False, False]
Ids in new that are not in original 39037
Ids in old that are not in original 49783
------------------------Year 2002------------------------
Processed file ./updated/2002.csv with 697007 lines
Processed file ./data/new/2002new.csv with 697006 lines
Total ids in new 697005
Total ids in old 697006
Same or not? [False, False]
Ids in new that are not in old 22174
Processed file ./data/2002.csv with 697006 lines
original 697005
Same or not? [False, False] [False, False]
Ids in new that are not in original 25295
Ids in old that are not in original 36401
------------------------Year 2003------------------------
Processed file ./updated/2003.csv with 687347 lines
Processed file ./data/new/2003new.csv with 699904 lines
Total ids in new 699903
Total ids in old 687346
Same or not? [False, False]
Ids in new that are not in old 31170
Processed file ./data/2003.csv with 699904 lines
original 699903
Same or not? [False, False] [False, False]
Ids in new that are not in original 25073
Ids in old that are not in original 36508
------------------------Year 2004------------------------
Processed file ./updated/2004.csv with 690905 lines
Processed file ./data/new/2004new.csv with 703535 lines
Total ids in new 703534
Total ids in old 690904
Same or not? [False, False]
Ids in new that are not in old 31356
Processed file ./data/2004.csv with 703535 lines
original 703534
Same or not? [False, False] [False, False]
Ids in new that are not in original 25617
Ids in old that are not in original 37195
------------------------Year 2005------------------------
Processed file ./updated/2005.csv with 706752 lines
Processed file ./data/new/2005new.csv with 706754 lines
Total ids in new 706753
Total ids in old 706751
Same or not? [False, False]
Ids in new that are not in old 22129
Processed file ./data/2005.csv with 706754 lines
original 706753
Same or not? [False, False] [False, False]
Ids in new that are not in original 25400
Ids in old that are not in original 36960
------------------------Year 2006------------------------
Processed file ./updated/2006.csv with 709743 lines
Processed file ./data/new/2006new.csv with 709614 lines
Total ids in new 709613
Total ids in old 709742
Same or not? [False, False]
Ids in new that are not in old 13047
Processed file ./data/2006.csv with 709614 lines
original 709613
Same or not? [False, False] [False, False]
Ids in new that are not in original 12141
Ids in old that are not in original 14744
------------------------Year 2007------------------------
Processed file ./updated/2007.csv with 715564 lines
Processed file ./data/new/2007new.csv with 715435 lines
Total ids in new 715434
Total ids in old 715563
Same or not? [False, False]
Ids in new that are not in old 12978
Processed file ./data/2007.csv with 715435 lines
original 715434
Same or not? [False, False] [False, False]
Ids in new that are not in original 12070
Ids in old that are not in original 14722
------------------------Year 2008------------------------
Processed file ./updated/2008.csv with 717947 lines
Processed file ./data/new/2008new.csv with 717823 lines
Total ids in new 717822
Total ids in old 717946
Same or not? [False, False]
Ids in new that are not in old 12938
Processed file ./data/2008.csv with 717823 lines
original 717822
Same or not? [False, False] [False, False]
Ids in new that are not in original 12087
Ids in old that are not in original 14869
------------------------Year 2009------------------------
Processed file ./updated/2009.csv with 713230 lines
Processed file ./data/new/2009new.csv with 713116 lines
Total ids in new 713115
Total ids in old 713229
Same or not? [False, False]
Ids in new that are not in old 12954
Processed file ./data/2009.csv with 713116 lines
original 713115
Same or not? [False, False] [False, False]
Ids in new that are not in original 12019
Ids in old that are not in original 14739
------------------------Year 2010------------------------
Processed file ./updated/2010.csv with 712559 lines
Processed file ./data/new/2010new.csv with 604494 lines
Total ids in new 604493
Total ids in old 712558
Same or not? [False, False]
Ids in new that are not in old 12945
Processed file ./data/2010.csv with 604494 lines
original 604493
Same or not? [False, False] [False, False]
Ids in new that are not in original 8643
Ids in old that are not in original 28611
------------------------Year 2011------------------------
Processed file ./updated/2011.csv with 712313 lines
Processed file ./data/new/2011new.csv with 605104 lines
Total ids in new 605103
Total ids in old 712312
Same or not? [False, False]
Ids in new that are not in old 39
Processed file ./data/2011.csv with 605104 lines
original 605103
Same or not? [False, False] [False, False]
Ids in new that are not in original 15
Ids in old that are not in original 14190
------------------------Year 2012------------------------
Processed file ./updated/2012.csv with 716437 lines
Processed file ./data/new/2012new.csv with 607381 lines
Total ids in new 607380
Total ids in old 716436
Same or not? [False, False]
Ids in new that are not in old 38
Processed file ./data/2012.csv with 607381 lines
original 607380
Same or not? [False, False] [False, False]
Ids in new that are not in original 15
Ids in old that are not in original 13421
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment