Skip to content
#

data-wrangling

Here are 550 public repositories matching this topic...

thadguidry
thadguidry commented Jul 13, 2021

I would like to import a .zst or .zstd file but currently that file type is not recognized to be imported by OpenRefine.

Proposed solution

Perhaps using Apache Commons Compress (we already have usage in ImportingUtilities.java)
https://commons.apache.org/proper/commons-compress/examples.html#Zstandard

Allow importing the .zstor .zstd file from my local computer, as well as fr

gsvidaurre
gsvidaurre commented Aug 3, 2021

These materials are really excellent, and I have a small suggestion that you should feel free to take or leave as you see fit. In the section "Knowing your way around RStudio" it might be beneficial to highlight the different options that users can change about the RStudio appearance in Tools > Global Options.

Some of these options available in "Global Options" are really helpful and if users k

umnik20
umnik20 commented May 4, 2020

Dear Community,

There is a typo in the section titled "The StringsAsFactors argument" after the second block of code that demonstrates the use of the str() function. Right after the code boxes is written "We can see that the $Color and $State columns are factors and $Speed is a numeric column", but the box shows that the $Color column is a vector of strings.

Regards,

Rodolfo

davis68
davis68 commented Sep 17, 2020
  • I felt like nunique was arbitrarily (re)introduced when it was necessary. It wouldn't be top-of-mind for students solving problems.
  • The lesson answers need to be adjacent to the exercises.
  • I like the pre-introduction of masks and then circling back around to explain them.
  • I feel like Part 4 needs to be broken up and integrated across other lessons: it felt thin on its own.
  • Horizo
alee
alee commented Aug 17, 2021

Currently the episode 15 reflections text expects to happen after we go over functions but due to lesson episode reordering this is no longer the case. We either need to come up with other reflections or move the break after functions (which might be too long)

https://swcarpentry.github.io/python-novice-gapminder/15-coffee/index.html

https://swcarpentry.github.io/python-novice-gapminder/16-w

lachlandeer
lachlandeer commented Jul 30, 2018

In episode _episodes_rmd/12-time-series-raster.Rmd

There is a big chunk of code that can probably be made to look nicer via dplyr:

# Plot RGB data for Julian day 133
 RGB_133 <- stack("data/NEON-DS-Landsat-NDVI/HARV/2011/RGB/133_HARV_landRGB.tif")
 RGB_133_df <- raster::as.data.frame(RGB_133, xy = TRUE)
 quantiles = c(0.02, 0.98)
 r <- quantile(RGB_133_df$X133_HARV_landRGB.1, q
cmrfoley
cmrfoley commented Jun 12, 2018

The discussion of data types and data structures in "Vectors and data types" could be clarified. Perhaps even defining these terms before using them would help. Also note that the first sentence of the section reads "A vector is the most common and basic data type in R, and is pretty much the workhorse of R." perhaps this should be changed to "basic data structure"

Improve this page

Add a description, image, and links to the data-wrangling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-wrangling topic, visit your repo's landing page and select "manage topics."

Learn more