Skip to content

Instantly share code, notes, and snippets.

@Cdaprod
Forked from isaacarnault/OUTPUT.md
Created February 17, 2024 18:42
Show Gist options
  • Select an option

  • Save Cdaprod/3ccc41e0bfaadb21b50a0774dd3dac10 to your computer and use it in GitHub Desktop.

Select an option

Save Cdaprod/3ccc41e0bfaadb21b50a0774dd3dac10 to your computer and use it in GitHub Desktop.

Revisions

  1. @isaacarnault isaacarnault revised this gist Dec 22, 2018. 2 changed files with 7 additions and 7 deletions.
    8 changes: 4 additions & 4 deletions scripting_in_Python.R
    Original file line number Diff line number Diff line change
    @@ -1,19 +1,19 @@
    # 1. Checking Python version
    #1 Checking Python version
    !python -V

    # 2. Import pandas to read the dataframe
    #2 Import pandas to read the dataframe
    import pandas as pd
    pd.set_option('display.max_columns', None)

    MyData = pd.read_csv("http://samplecsvs.s3.amazonaws.com/SalesJan2009.csv")
    #3 Show the first rows of the dataframe

    #3 Show the first rows of the dataframe
    MyData.head()

    #4 Get the dimensions of the dataframe
    MyData.shape

    #Full code
    # Full code
    !python -V

    import pandas as pd
    6 changes: 3 additions & 3 deletions scripting_in_R.R
    Original file line number Diff line number Diff line change
    @@ -1,7 +1,7 @@
    # 1. Checking R version
    #1 Checking R version
    R.Version()$version.string

    # 2. Download the dataframe from a remote server
    #2 Download the dataframe from a remote server
    download.file("http://samplecsvs.s3.amazonaws.com/SalesJan2009.csv",
    destfile="/resources/data/SalesJan2009.csv", quiet = TRUE)

    @@ -13,7 +13,7 @@ head(MyData, 5)
    ncol(MyData)
    nrow(MyData)

    #Full code
    # Full code
    R.Version()$version.string

    download.file("http://samplecsvs.s3.amazonaws.com/SalesJan2009.csv",
  2. @isaacarnault isaacarnault revised this gist Dec 22, 2018. 1 changed file with 1 addition and 16 deletions.
    17 changes: 1 addition & 16 deletions OUTPUT.md
    Original file line number Diff line number Diff line change
    @@ -2,22 +2,7 @@

    <details>
    <summary>See output</summary>
    <p>

    [![isaac-arnault-data-collection-P.png](https://i.postimg.cc/fLNGND26/isaac-arnault-data-collection-P.png)](https://postimg.cc/B8mY59Gg)

    </p>
    </details>

    # Data collection using R

    <details>
    <summary>See output</summary>
    <p>

    <details>
    <summary>See output</summary>
    <p>
    <p>

    [![isaac-arnault-data-collection-P.png](https://i.postimg.cc/fLNGND26/isaac-arnault-data-collection-P.png)](https://postimg.cc/B8mY59Gg)

  3. @isaacarnault isaacarnault revised this gist Dec 21, 2018. 1 changed file with 2 additions and 3 deletions.
    5 changes: 2 additions & 3 deletions OUTPUT.md
    Original file line number Diff line number Diff line change
    @@ -13,9 +13,8 @@

    <details>
    <summary>See output</summary>
    <p>
    # Data collection using Python

    <p>

    <details>
    <summary>See output</summary>
    <p>
  4. @isaacarnault isaacarnault revised this gist Dec 19, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -15,7 +15,7 @@ I used no versioning system for this gist. My gist gist's <b>repos status<b> is

    ## Licence

    All public gists https://gist.github.com/aiPhD<br>
    All public gists https://gist.github.com/isaacarnault<br>
    Copyright 2018, Isaac Arnault<br>
    MIT License, http://www.opensource.org/licenses/mit-license.php

  5. @isaacarnault isaacarnault revised this gist Dec 18, 2018. 1 changed file with 17 additions and 1 deletion.
    18 changes: 17 additions & 1 deletion OUTPUT.md
    Original file line number Diff line number Diff line change
    @@ -11,11 +11,27 @@

    # Data collection using R

    <details>
    <summary>See output</summary>
    <p>
    # Data collection using Python

    <details>
    <summary>See output</summary>
    <p>

    [![isaac-arnault-data-collection-P.png](https://i.postimg.cc/fLNGND26/isaac-arnault-data-collection-P.png)](https://postimg.cc/B8mY59Gg)

    </p>
    </details>

    # Data collection using R

    <details>
    <summary>See output</summary>
    <p>

    [![isaac-arnault-data-collection-R.png](https://i.postimg.cc/WpCfMxDq/isaac-arnault-data-collection-R.png)](https://postimg.cc/V59RwGWz)
    [![isaac-arnault-data-collection-using-R.png](https://i.postimg.cc/nVyN0yJX/isaac-arnault-data-collection-using-R.png)](https://postimg.cc/23xH5X3Y)

    </p>
    </details>
  6. @isaacarnault isaacarnault revised this gist Dec 17, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -7,7 +7,7 @@ The following gist offers a focus on <b>Data Collection</b>, one of the stages*

    # Versioning

    I used no versioning system for this gist. Its status is flagged as <b>active</b> because it has reached a stable, usable state. Original [gist](https://gist.github.com/aiPhD/15873ff613af833f9693e1a595bdfcc6) related to this repository is pending as <b>concept</b>.
    I used no versioning system for this gist. My gist gist's <b>repos status<b> is flagged as <b>concept<b> because it is intended to be a demo or POC (proof-of-concept).

    ## Author

  7. @isaacarnault isaacarnault revised this gist Dec 16, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion architecture.md
    Original file line number Diff line number Diff line change
    @@ -2,7 +2,7 @@
    <summary>Vertices of Data Science methodology</summary>
    <p>

    [![isaac-arnault-data-science-methodology.png](https://i.postimg.cc/VkdvHWyp/isaac-arnault-data-science-methodology.png)](https://postimg.cc/rD2qDWwN)
    [![isaac-arnault-data-science-methodology.png](https://i.postimg.cc/hPDLBtNq/isaac-arnault-data-science-methodology.png)](https://postimg.cc/QVz9ghX6)

    </p>
    </details>
  8. @isaacarnault isaacarnault revised this gist Dec 14, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion architecture.md
    Original file line number Diff line number Diff line change
    @@ -2,7 +2,7 @@
    <summary>Vertices of Data Science methodology</summary>
    <p>

    [![isaac-arnault-data-science-methodology.png](https://i.postimg.cc/Zng48XHj/isaac-arnault-data-science-methodology.png)](https://postimg.cc/GTkWr7Zs)
    [![isaac-arnault-data-science-methodology.png](https://i.postimg.cc/VkdvHWyp/isaac-arnault-data-science-methodology.png)](https://postimg.cc/rD2qDWwN)

    </p>
    </details>
  9. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -11,7 +11,7 @@ I used no versioning system for this gist. Its status is flagged as <b>active</b

    ## Author

    * **Isaac Arnault** - Suggesting two implementations in `Python` and `R`, from *Initial work* [Cognitive Class Lab - Module 2](https://cognitiveclass.ai/courses/data-science-methodology-2/) and one exercise.
    * **Isaac Arnault** - Suggesting two implementations in `Python` and `R`, from *Initial work* [Cognitive Class Lab - Module 2](https://cognitiveclass.ai/courses/data-science-methodology-2/) and providing one exercise.

    ## Licence

  10. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -7,7 +7,7 @@ The following gist offers a focus on <b>Data Collection</b>, one of the stages*

    # Versioning

    I used no versioning system for this gist. My repository's status is flagged as <b>active</b> because it has reached a stable, usable state. Original [gist](https://gist.github.com/aiPhD/15873ff613af833f9693e1a595bdfcc6) related to this repository is pending as <b>concept</b>.
    I used no versioning system for this gist. Its status is flagged as <b>active</b> because it has reached a stable, usable state. Original [gist](https://gist.github.com/aiPhD/15873ff613af833f9693e1a595bdfcc6) related to this repository is pending as <b>concept</b>.

    ## Author

  11. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -7,7 +7,7 @@ The following gist offers a focus on <b>Data Collection</b>, one of the stages*

    # Versioning

    I used no vesioning system for this gist. My repository's status is flagged as <b>active</b> because it has reached a stable, usable state. Original [gist](https://gist.github.com/aiPhD/15873ff613af833f9693e1a595bdfcc6) related to this repository is pending as <b>concept</b>.
    I used no versioning system for this gist. My repository's status is flagged as <b>active</b> because it has reached a stable, usable state. Original [gist](https://gist.github.com/aiPhD/15873ff613af833f9693e1a595bdfcc6) related to this repository is pending as <b>concept</b>.

    ## Author

  12. @isaacarnault isaacarnault revised this gist Dec 10, 2018. No changes.
  13. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 2 changed files with 3 additions and 3 deletions.
    4 changes: 2 additions & 2 deletions OUTPUT.md
    Original file line number Diff line number Diff line change
    @@ -1,10 +1,10 @@
    # Data collection using P
    # Data collection using Python

    <details>
    <summary>See output</summary>
    <p>

    [![isaac-arnault-data-collection-P.png](https://i.postimg.cc/x8rSqL1F/isaac-arnault-data-collection-P.png)](https://postimg.cc/Q9qzy95p)
    [![isaac-arnault-data-collection-P.png](https://i.postimg.cc/fLNGND26/isaac-arnault-data-collection-P.png)](https://postimg.cc/B8mY59Gg)

    </p>
    </details>
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -27,6 +27,6 @@ MIT License, http://www.opensource.org/licenses/mit-license.php
    * Perform a data collection in `Python` and `R` using `Jupyter`.<br>
    ⇢ Use the following dataframe from [Spatialkey.com](http://samplecsvs.s3.amazonaws.com/TechCrunchcontinentalUSA.csv).
    * How many observations and variables does the dataframe contain? Base your assessment on your scripting outputs.
    * Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.
    * Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.<br>
    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>architecture.md</b>.
  14. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 2 additions and 2 deletions.
    4 changes: 2 additions & 2 deletions README.md
    Original file line number Diff line number Diff line change
    @@ -26,7 +26,7 @@ MIT License, http://www.opensource.org/licenses/mit-license.php
    ## Exercise
    * Perform a data collection in `Python` and `R` using `Jupyter`.<br>
    ⇢ Use the following dataframe from [Spatialkey.com](http://samplecsvs.s3.amazonaws.com/TechCrunchcontinentalUSA.csv).
    * How many observations and variables do the dataframe contain? Base your assessment on your scripting outputs.
    * Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.<br>
    * How many observations and variables does the dataframe contain? Base your assessment on your scripting outputs.
    * Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.
    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>architecture.md</b>.
  15. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -27,6 +27,6 @@ MIT License, http://www.opensource.org/licenses/mit-license.php
    * Perform a data collection in `Python` and `R` using `Jupyter`.<br>
    ⇢ Use the following dataframe from [Spatialkey.com](http://samplecsvs.s3.amazonaws.com/TechCrunchcontinentalUSA.csv).
    * How many observations and variables do the dataframe contain? Base your assessment on your scripting outputs.
    * Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.
    * Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.<br>
    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>architecture.md</b>.
  16. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -20,7 +20,7 @@ Copyright 2018, Isaac Arnault<br>
    MIT License, http://www.opensource.org/licenses/mit-license.php

    ## Sources
    * Architecture appended in <b>OUTPUT.md</b>, inspired by [Cognitiveclass.ai](https://cognitiveclass.ai/).<br>
    * Figure appended in <b>architecture.md</b>, inspired by [Cognitiveclass.ai](https://cognitiveclass.ai/).<br>
    * Dataframe used as sample coming from [Spatialkey.com](https://support.spatialkey.com/spatialkey-sample-csv-data/).

    ## Exercise
  17. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -7,7 +7,7 @@ The following gist offers a focus on <b>Data Collection</b>, one of the stages*

    # Versioning

    I used no vesioning system for this gist. My repository's status is flagged as <b>active</b> because it has reached a stable, usable state. Original [gist](https://gist.github.com/aiPhD/f4cdef7878e88ee2bed1254a2b5fbcb5) related to this repository is pending as <b>concept</b>.
    I used no vesioning system for this gist. My repository's status is flagged as <b>active</b> because it has reached a stable, usable state. Original [gist](https://gist.github.com/aiPhD/15873ff613af833f9693e1a595bdfcc6) related to this repository is pending as <b>concept</b>.

    ## Author

  18. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 2 additions and 2 deletions.
    4 changes: 2 additions & 2 deletions README.md
    Original file line number Diff line number Diff line change
    @@ -24,8 +24,8 @@ MIT License, http://www.opensource.org/licenses/mit-license.php
    * Dataframe used as sample coming from [Spatialkey.com](https://support.spatialkey.com/spatialkey-sample-csv-data/).

    ## Exercise
    * Perform data collection in `Python` and `R` using `Jupyter`.<br>
    * Use the following dataframe from [Spatialkey.com](http://samplecsvs.s3.amazonaws.com/TechCrunchcontinentalUSA.csv).
    * Perform a data collection in `Python` and `R` using `Jupyter`.<br>
    Use the following dataframe from [Spatialkey.com](http://samplecsvs.s3.amazonaws.com/TechCrunchcontinentalUSA.csv).
    * How many observations and variables do the dataframe contain? Base your assessment on your scripting outputs.
    * Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.
  19. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 2 additions and 2 deletions.
    4 changes: 2 additions & 2 deletions README.md
    Original file line number Diff line number Diff line change
    @@ -24,9 +24,9 @@ MIT License, http://www.opensource.org/licenses/mit-license.php
    * Dataframe used as sample coming from [Spatialkey.com](https://support.spatialkey.com/spatialkey-sample-csv-data/).

    ## Exercise
    * Perform data collection in `Python` and `R` using `Jupyter`<br>
    * Perform data collection in `Python` and `R` using `Jupyter`.<br>
    * Use the following dataframe from [Spatialkey.com](http://samplecsvs.s3.amazonaws.com/TechCrunchcontinentalUSA.csv).
    * How many observations and variable do the dataframe contain? Base your assessment on your scripting outputs.
    * How many observations and variables do the dataframe contain? Base your assessment on your scripting outputs.
    * Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.
    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>architecture.md</b>.
  20. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -1,4 +1,4 @@
    # Data collection and operations using Python and R
    # Data collection and statistics using Python and R

    [![Project Status: Concept – Minimal or no implementation has been done yet, or the repository is only intended to be a limited example, demo, or proof-of-concept.](https://www.repostatus.org/badges/latest/concept.svg)](https://www.repostatus.org/#concept)

  21. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion README.md
    Original file line number Diff line number Diff line change
    @@ -11,7 +11,7 @@ I used no vesioning system for this gist. My repository's status is flagged as <

    ## Author

    * **Isaac Arnault** - Suggesting two implementations in `Python` and `R`, from *Initial work* [Cognitive Class Lab - Module 2](https://cognitiveclass.ai/courses/data-science-methodology-2/)
    * **Isaac Arnault** - Suggesting two implementations in `Python` and `R`, from *Initial work* [Cognitive Class Lab - Module 2](https://cognitiveclass.ai/courses/data-science-methodology-2/) and one exercise.

    ## Licence

  22. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 2 additions and 2 deletions.
    4 changes: 2 additions & 2 deletions README.md
    Original file line number Diff line number Diff line change
    @@ -1,9 +1,9 @@
    # Data collection using Python and R - Using one dataframe
    # Data collection and operations using Python and R

    [![Project Status: Concept – Minimal or no implementation has been done yet, or the repository is only intended to be a limited example, demo, or proof-of-concept.](https://www.repostatus.org/badges/latest/concept.svg)](https://www.repostatus.org/#concept)

    ## Scripting in Python and R
    The following gist offers a focus on <b>Data Collection</b>, one of the stages* of the Data Science methodology.
    The following gist offers a focus on <b>Data Collection</b>, one of the stages* of the Data Science methodology. We will also perform basic math operations on a single dataframe to see how they render using Python or R.

    # Versioning

  23. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 3 additions and 2 deletions.
    5 changes: 3 additions & 2 deletions README.md
    Original file line number Diff line number Diff line change
    @@ -26,6 +26,7 @@ MIT License, http://www.opensource.org/licenses/mit-license.php
    ## Exercise
    * Perform data collection in `Python` and `R` using `Jupyter`<br>
    * Use the following dataframe from [Spatialkey.com](http://samplecsvs.s3.amazonaws.com/TechCrunchcontinentalUSA.csv).
    * How many observations and variable do the dataframe contain. Base your assessment on your scripting outputs.
    * How many observations and variable do the dataframe contain? Base your assessment on your scripting outputs.
    * Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.
    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>figures.md</b>.
    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>architecture.md</b>.
  24. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 2 changed files with 16 additions and 3 deletions.
    2 changes: 1 addition & 1 deletion architecture.md
    Original file line number Diff line number Diff line change
    @@ -5,4 +5,4 @@
    [![isaac-arnault-data-science-methodology.png](https://i.postimg.cc/Zng48XHj/isaac-arnault-data-science-methodology.png)](https://postimg.cc/GTkWr7Zs)

    </p>
    </details>
    </details>
    17 changes: 15 additions & 2 deletions exercise_solutions.md
    Original file line number Diff line number Diff line change
    @@ -22,10 +22,23 @@ Mean = 10131487.5 # Using R in Jupyter, otherwise Mean = 10131488 in RStudio
    </details>

    <details>
    <summary>Vertices of Data Science methodology</summary>
    <summary>Complete solution using Python and Pandas</summary>
    <p>

    [![isaac-arnault-data-science-methodology.png](https://i.postimg.cc/Zng48XHj/isaac-arnault-data-science-methodology.png)](https://postimg.cc/GTkWr7Zs)
    [![isaac-arnault-using-pandas-P.png](https://i.postimg.cc/TwqPbnk1/isaac-arnault-using-pandas-P.png)](https://postimg.cc/DJ0Fk433)

    [![isaac-arnault-using-pandas-P-2.png](https://i.postimg.cc/tg6JXKWh/isaac-arnault-using-pandas-P-2.png)](https://postimg.cc/8F1N4ny5)

    </p>
    </details>

    <details>
    <summary>Complete solution using R</summary>
    <p>

    [![isaac-arnault-using-pandas-R.png](https://i.postimg.cc/6qWpfTS0/isaac-arnault-using-pandas-R.png)](https://postimg.cc/8JYG1kTJ)

    [![isaac-arnault-using-pandas-R2.png](https://i.postimg.cc/1tZt4vrb/isaac-arnault-using-pandas-R2.png)](https://postimg.cc/mc8R5NqS)

    </p>
    </details>
  25. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 23 additions and 1 deletion.
    24 changes: 23 additions & 1 deletion exercise_solutions.md
    Original file line number Diff line number Diff line change
    @@ -1,9 +1,31 @@
    <details>
    <summary>See answer</summary>
    <summary>Question answer</summary>
    <p>

    ```Python
    There are 10 variables and 1461 observations in the dataframe.
    ```
    </p>
    </details>

    <details>
    <summary>Calculations using Python and R</summary>
    <p>

    ```Python
    Sum = 14791971750
    Min = 6000
    Max = 300000000
    Mean = 10131487.5 # Using R in Jupyter, otherwise Mean = 10131488 in RStudio
    ```
    </p>
    </details>

    <details>
    <summary>Vertices of Data Science methodology</summary>
    <p>

    [![isaac-arnault-data-science-methodology.png](https://i.postimg.cc/Zng48XHj/isaac-arnault-data-science-methodology.png)](https://postimg.cc/GTkWr7Zs)

    </p>
    </details>
  26. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 2 additions and 2 deletions.
    4 changes: 2 additions & 2 deletions OUTPUT.md
    Original file line number Diff line number Diff line change
    @@ -1,7 +1,7 @@
    # Data collection using P

    <details>
    <summary>See notebook</summary>
    <summary>See output</summary>
    <p>

    [![isaac-arnault-data-collection-P.png](https://i.postimg.cc/x8rSqL1F/isaac-arnault-data-collection-P.png)](https://postimg.cc/Q9qzy95p)
    @@ -12,7 +12,7 @@
    # Data collection using R

    <details>
    <summary>See notebook</summary>
    <summary>See output</summary>
    <p>

    [![isaac-arnault-data-collection-R.png](https://i.postimg.cc/WpCfMxDq/isaac-arnault-data-collection-R.png)](https://postimg.cc/V59RwGWz)
  27. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 3 deletions.
    4 changes: 1 addition & 3 deletions README.md
    Original file line number Diff line number Diff line change
    @@ -23,11 +23,9 @@ MIT License, http://www.opensource.org/licenses/mit-license.php
    * Architecture appended in <b>OUTPUT.md</b>, inspired by [Cognitiveclass.ai](https://cognitiveclass.ai/).<br>
    * Dataframe used as sample coming from [Spatialkey.com](https://support.spatialkey.com/spatialkey-sample-csv-data/).

    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>figures.md</b>.

    ## Exercise
    * Perform data collection in `Python` and `R` using `Jupyter`<br>
    * Use the following dataframe from [Spatialkey.com](http://samplecsvs.s3.amazonaws.com/TechCrunchcontinentalUSA.csv).
    * How many observations and variable do the dataframe contain. Base your assessment on your scripting outputs.

    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>figures.md</b>.
  28. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 4 changed files with 17 additions and 1 deletion.
    7 changes: 7 additions & 0 deletions README.md
    Original file line number Diff line number Diff line change
    @@ -23,4 +23,11 @@ MIT License, http://www.opensource.org/licenses/mit-license.php
    * Architecture appended in <b>OUTPUT.md</b>, inspired by [Cognitiveclass.ai](https://cognitiveclass.ai/).<br>
    * Dataframe used as sample coming from [Spatialkey.com](https://support.spatialkey.com/spatialkey-sample-csv-data/).

    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>figures.md</b>.

    ## Exercise
    * Perform data collection in `Python` and `R` using `Jupyter`<br>
    * Use the following dataframe from [Spatialkey.com](http://samplecsvs.s3.amazonaws.com/TechCrunchcontinentalUSA.csv).
    * How many observations and variable do the dataframe contain. Base your assessment on your scripting outputs.

    (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See <b>figures.md</b>.
    9 changes: 9 additions & 0 deletions exercise_solutions.md
    Original file line number Diff line number Diff line change
    @@ -0,0 +1,9 @@
    <details>
    <summary>See answer</summary>
    <p>

    ```Python
    There are 10 variables and 1461 observations in the dataframe.
    ```
    </p>
    </details>
    2 changes: 1 addition & 1 deletion scripting_in_Python.r → scripting_in_Python.R
    Original file line number Diff line number Diff line change
    @@ -24,4 +24,4 @@ MyData = pd.read_csv("http://samplecsvs.s3.amazonaws.com/SalesJan2009.csv")
    #3 Show the first rows of the dataframe
    MyData.head()

    MyData.shape
    MyData.shape
    File renamed without changes.
  29. @isaacarnault isaacarnault renamed this gist Dec 10, 2018. 1 changed file with 4 additions and 0 deletions.
    4 changes: 4 additions & 0 deletions README-Template.md → README.md
    Original file line number Diff line number Diff line change
    @@ -5,6 +5,10 @@
    ## Scripting in Python and R
    The following gist offers a focus on <b>Data Collection</b>, one of the stages* of the Data Science methodology.

    # Versioning

    I used no vesioning system for this gist. My repository's status is flagged as <b>active</b> because it has reached a stable, usable state. Original [gist](https://gist.github.com/aiPhD/f4cdef7878e88ee2bed1254a2b5fbcb5) related to this repository is pending as <b>concept</b>.

    ## Author

    * **Isaac Arnault** - Suggesting two implementations in `Python` and `R`, from *Initial work* [Cognitive Class Lab - Module 2](https://cognitiveclass.ai/courses/data-science-methodology-2/)
  30. @isaacarnault isaacarnault revised this gist Dec 10, 2018. 1 changed file with 1 addition and 1 deletion.
    2 changes: 1 addition & 1 deletion OUTPUT.md
    Original file line number Diff line number Diff line change
    @@ -4,7 +4,7 @@
    <summary>See notebook</summary>
    <p>

    [![isaac-arnault-data-collection-P.png](https://i.postimg.cc/fTdq55X2/isaac-arnault-data-collection-P.png)](https://postimg.cc/GHcjmFXG)
    [![isaac-arnault-data-collection-P.png](https://i.postimg.cc/x8rSqL1F/isaac-arnault-data-collection-P.png)](https://postimg.cc/Q9qzy95p)

    </p>
    </details>