Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Save eliassoares/604fc73005584b6dd22b59091e03f33c to your computer and use it in GitHub Desktop.
Save eliassoares/604fc73005584b6dd22b59091e03f33c to your computer and use it in GitHub Desktop.

Revisions

  1. @gjreda gjreda revised this gist Nov 19, 2014. 1 changed file with 4 additions and 1 deletion.
    5 changes: 4 additions & 1 deletion gistfile1.py
    Original file line number Diff line number Diff line change
    @@ -10,4 +10,7 @@

    %time trips = pd.read_csv('data/divvy/Divvy_Trips_2013.csv', converters={'starttime': to_datetime, 'stoptime': to_datetime})
    # CPU times: user 17.6 s, sys: 269 ms, total: 17.9 s
    # Wall time: 17.9 s
    # Wall time: 17.9 s

    # $ wc -l divvy/Divvy_Trips_2013.csv
    # 759789 divvy/Divvy_Trips_2013.csv
  2. @gjreda gjreda revised this gist Nov 19, 2014. 1 changed file with 1 addition and 0 deletions.
    1 change: 1 addition & 0 deletions gistfile1.py
    Original file line number Diff line number Diff line change
    @@ -1,5 +1,6 @@
    # When you're sure of the format, it's much quicker to explicitly convert your dates than use `parse_dates`
    # Makes sense; was just surprised by the time difference.
    import pandas as pd
    from datetime import datetime
    to_datetime = lambda d: datetime.strptime(d, '%m/%d/%Y %H:%M')

  3. @gjreda gjreda created this gist Nov 19, 2014.
    12 changes: 12 additions & 0 deletions gistfile1.py
    Original file line number Diff line number Diff line change
    @@ -0,0 +1,12 @@
    # When you're sure of the format, it's much quicker to explicitly convert your dates than use `parse_dates`
    # Makes sense; was just surprised by the time difference.
    from datetime import datetime
    to_datetime = lambda d: datetime.strptime(d, '%m/%d/%Y %H:%M')

    %time trips = pd.read_csv('data/divvy/Divvy_Trips_2013.csv', parse_dates=['starttime', 'stoptime'])
    # CPU times: user 1min 29s, sys: 331 ms, total: 1min 29s
    # Wall time: 1min 30s

    %time trips = pd.read_csv('data/divvy/Divvy_Trips_2013.csv', converters={'starttime': to_datetime, 'stoptime': to_datetime})
    # CPU times: user 17.6 s, sys: 269 ms, total: 17.9 s
    # Wall time: 17.9 s