Skip to content

Tags: alex-s-gardner/DataFrames.jl

Tags

v1.5.0

Toggle v1.5.0's commit message
[Diff since v1.4.4](JuliaData/DataFrames.jl@v1.4.4...v1.5.0)

**Closed issues:**
- New contents about handing missing values in DataFrame (JuliaData#1662)
- Functions taking collections of column names always require them to be in AbstractVectors (JuliaData#1769)
- Stack/Melt over multiple sets of variables (JuliaData#1839)
- Allow unstack to take multiple columns to unstack on (JuliaData#2148)
- Feature request: unstack multiple :values columns (JuliaData#2215)
- Add `all` keyword argument to `nonunique` (JuliaData#2238)
- special case `percentage` in `combine` (JuliaData#2272)
- Add a `pushfirst!` method (JuliaData#2275)
- add `filter` example to docs on taking subsets (JuliaData#2318)
- Some code blocks missing syntax highlighting in docs (JuliaData#2319)
- Stacking multiple groups of columns (JuliaData#2414)
- Add more keyword arguments to `stack` and `unstack` (JuliaData#2422)
- Add reverse and reverse! functions similar to sort and sort! (JuliaData#2438)
- Allow keeping first or last observation with unique function  (JuliaData#2443)
- Add `insert!` (JuliaData#2446)
- Improve inline documentation of select to include examples of multiple columns not to be included (JuliaData#2513)
- Transposing DataFrame (JuliaData#2743)
- add a keyword to allow specifying target row order in joins (JuliaData#2753)
- Improve flatten (slightly breaking) (JuliaData#2767)
- Add manual part for indexing and selection (JuliaData#2887)
- a new method of the flatten function in DataFrames  (JuliaData#2890)
- Generalization of the value parameter in the unstack function (JuliaData#3066)
- resolve circular reference issue when printing (JuliaData#3148)
- Support `allunique` with column selectors? (JuliaData#3205)
- Add support for Tables.AbstractRow to functions that take row (JuliaData#3244)
- Stack Overflow during type inference with large dataframes (JuliaData#3246)
- `innerjoin` fast path where join column is allequal? (JuliaData#3247)
- Invalidations when loading CSV (JuliaData#3248)
- Improve groupby sort (JuliaData#3251)
- improve performance of dropmissing (JuliaData#3254)
- Let DataFrame behave more like GroupedDataFrame with one zero-key group (JuliaData#3257)
- Lifecycle annotations (JuliaData#3259)
- `String` display quotation missing (JuliaData#3261)
- Bool columns are printed as 0/1 in HTML, but not in plain (JuliaData#3265)
- sum doesn't work with Missing column (JuliaData#3267)
- Views of DataFrame design issue (JuliaData#3272)
- Multi-threading hangs combine on Julia nightly (JuliaData#3275)
- Check CompatHelper setup (JuliaData#3278)
- Add `get` function for AbstractDataFrame (JuliaData#3281)
- Rename Iterators.partition (JuliaData#3284)

**Merged pull requests:**
- add Iterators.partition (JuliaData#3212) (@bkamins)
- add an option to intersect arguments passed to Cols (JuliaData#3224) (@bkamins)
- Add allunique and improve nonunique and describe (JuliaData#3232) (@bkamins)
- Add an option in joins to specify row order (JuliaData#3233) (@bkamins)
- Improve examples in the manual in basics.md (JuliaData#3236) (@bkamins)
- Add hints to use macro packages for new users (JuliaData#3238) (@bkamins)
- improve error message when used selector is incorrect (JuliaData#3242) (@bkamins)
- add support for Tables.AbstractRow in push!, pushfirst!, and insert! (JuliaData#3245) (@bkamins)
- fix deleteat! and subset! performance (JuliaData#3249) (@bkamins)
- Fix typo in documentation (JuliaData#3250) (@bkamins)
- Mention ReadStatTables.jl in documentation (JuliaData#3252) (@junyuan-chen)
- Add sorting options to groupby (JuliaData#3253) (@bkamins)
- Improve performance of dropmissing (JuliaData#3256) (@svilupp)
- add keep to nonunique, unique, and unique! (JuliaData#3260) (@bkamins)
- document breaking change policy (JuliaData#3262) (@bkamins)
- improve error message in operation specification syntax (JuliaData#3263) (@bkamins)
- Fix bug in subset[!] when handling no conditions case (JuliaData#3264) (@bkamins)
- Fix error in fast aggregation of missing only columns for sum and mean (JuliaData#3268) (@bkamins)
- add information about TableMetadaTools.jl to docs (JuliaData#3269) (@bkamins)
- Update TagBot.yml (JuliaData#3271) (@bkamins)
- correctly index into a SubDataFrame with no columns (JuliaData#3273) (@bkamins)
- Reduce size of multi-threading enablement to 100_000 (JuliaData#3274) (@bkamins)
- Improve allcombinations docstring + minor cleanups after JuliaData#3256 (JuliaData#3276) (@bkamins)
- Allow to pass multiple predicates in `Cols` and mix them with other selectors (JuliaData#3279) (@bkamins)
- update CompatHelper.jl setup (JuliaData#3280) (@bkamins)
- add haskey and get support for DataFrameColumns (JuliaData#3282) (@bkamins)
- Add `scalar` keyword argument to `flatten` (JuliaData#3283) (@bkamins)
- improve precompilation coverage (JuliaData#3285) (@bkamins)

v1.4.4

Toggle v1.4.4's commit message
[Diff since v1.4.3](JuliaData/DataFrames.jl@v1.4.3...v1.4.4)

**Closed issues:**
- Segmentation fault Julia 1.8.2, DataFrames v1.4.3 (JuliaData#3227)
- sizeof() not working correctly with Dataframes (JuliaData#3229)
- subset / subset! `AbstractVector` restriction inconvenient  (JuliaData#3230)

**Merged pull requests:**
- Explain column-independent operations (JuliaData#3225) (@bkamins)
- Fix unstack docstring (JuliaData#3226) (@bkamins)
- fix select bug with copycols=false on SubDataFrame (JuliaData#3231) (@bkamins)
- fix markdown tests (JuliaData#3234) (@bkamins)

v1.4.3

Toggle v1.4.3's commit message
[Diff since v1.4.2](JuliaData/DataFrames.jl@v1.4.2...v1.4.3)

**Closed issues:**
- docs for `groupindices` has wrong example (JuliaData#3210)
- (Possible) Bug with `shuffle` when shuffling `DataFrame` rows (JuliaData#3211)
- Improve combine documentation (JuliaData#3214)
- ERROR: AssertionError: length(res) > 0 (JuliaData#3217)
- Column metadata anchored to wrong column after insertion of new colums (JuliaData#3218)

**Merged pull requests:**
- Make sure we use MIME when calling repr in GroupedDataFrame printing (JuliaData#3213) (@bkamins)
- add default style to metadata! and colmetadata! (JuliaData#3216) (@bkamins)
- fix insertcols! bug (not shifting column metadata) (JuliaData#3220) (@bkamins)
- fix HTML printing tests after PrettyTables.jl 2.2 release (JuliaData#3221) (@bkamins)
- make aggregation of empty GroupedDataFrame correct with AsTable (JuliaData#3222) (@bkamins)

v1.4.2

Toggle v1.4.2's commit message
[Diff since v1.4.1](JuliaData/DataFrames.jl@v1.4.1...v1.4.2)

**Closed issues:**
- Make docstrings method specific (JuliaData#2015)
- Additional functions supported for DataFrame.jl (JuliaData#2088)
- OffsetArray Compatibility (JuliaData#2123)
- Return data frame unaltered when Not only includes columns that are not in data frame (JuliaData#2197)
- Kwarg to choose missing values for unstack (JuliaData#2205)
- Allow DF() as a selector in select and combine (JuliaData#2220)
- no method matching InvertedIndex(::String, ::String) (JuliaData#2227)
- add view::Bool kwarg to first and last (JuliaData#2845)
- Inconsistency in `push!`ing an empty row into a DataFrame (JuliaData#2953)
- Flatten errors on empty dataframe (JuliaData#3197)
- 10 seconds to `show(df)` of size (120764, 22) (JuliaData#3202)
- Ignoring ENV["LINES"] in 1.4.x (JuliaData#3203)
- JET.JL problem with v1.4.1 (JuliaData#3204)
- Speed of filter (JuliaData#3208)
- Allow `end` to select last column. (JuliaData#3209)

**Merged pull requests:**
- Mention DataFrameMacros.jl in the docs (JuliaData#3195) (@jkrumbiegel)
- make sure flatten works corretly on a data frame with zero rows (JuliaData#3198) (@bkamins)
- improve manual entry of assignment to a data frame (JuliaData#3201) (@bkamins)

v1.4.1

Toggle v1.4.1's commit message
[Diff since v1.4.0](JuliaData/DataFrames.jl@v1.4.0...v1.4.1)

**Closed issues:**
- Filtering of `eachrow(df)` not working in 1.4.0 (JuliaData#3191)

**Merged pull requests:**
- make sure getindex on DataFrameRows does not alias passed selector (JuliaData#3192) (@bkamins)
- Add missing triple quotes around docstrings (JuliaData#3194) (@bkamins)

v1.4.0

Toggle v1.4.0's commit message
[Diff since v1.3.6](JuliaData/DataFrames.jl@v1.3.6...v1.4.0)

**Closed issues:**
- Metadata for columns and/or DataFrames (JuliaData#35)
- What metadata should be (JuliaData#2276)
- Add metadata (JuliaData#2961)
- Add precompilation for PooledArray for all allowed ref types (JuliaData#3013)
- update precompilation for 1.4 release (JuliaData#3080)
- Require Julia 1.6 (JuliaData#3136)
- Metadata: follow-up notes (JuliaData#3168)
- Add references to `names` documentation (JuliaData#3171)
- sync Tables.subset (JuliaData#3180)
- change valuestransform in unstack (JuliaData#3184)
- better handling of corner cases of GroupedDataFrame printing (JuliaData#3186)
- Version incompatibility with PrettyTables.jl (JuliaData#3188)

**Merged pull requests:**
- Metadata on data frame and column level (JuliaData#3055) (@bkamins)
- Use PrettyTables.jl as HTML backend (JuliaData#3096) (@ronisbr)
- Improved REPL printing for GroupedDataFrames (JuliaData#3107) (@Jollywatt)
- 1-arg permutedims(df) (JuliaData#3115) (@anandijain)
- Require Julia 1.6 (JuliaData#3145) (@bkamins)
- synch NEWS.md between 1.4 and 1.3 branches (JuliaData#3164) (@bkamins)
- add ShiftedArrays 2.x support (JuliaData#3165) (@bkamins)
- improve error message when column is not found (JuliaData#3166) (@bkamins)
- Improve metadata documentation (JuliaData#3169) (@bkamins)
- Reduce memory use in threading correctness tests (JuliaData#3172) (@yakir12)
- Fix typos in metadata docs (JuliaData#3174) (@nalimilan)
- fix metadata handling in permutedims (JuliaData#3176) (@bkamins)
- Add better error message on error when pushing rows to a data frame (JuliaData#3177) (@bkamins)
- improve names docstring (JuliaData#3178) (@bkamins)
- Avoid method dispatch ambiguities in DataFrames.jl (JuliaData#3179) (@bkamins)
- switch from view to viewhint in Tables.subset (JuliaData#3181) (@bkamins)
- precompilation for 1.4 release (JuliaData#3182) (@bkamins)
- enable multithreading tests of joins only on 64 bit machines (JuliaData#3183) (@bkamins)
- rename valuestransform to combine in unstack (JuliaData#3185) (@bkamins)
- improve printing of GroupedDataFrame in corner cases (JuliaData#3187) (@bkamins)
- Sync metadata implementation with DataAPI.jl 1.12.0 (JuliaData#3189) (@bkamins)
- Fix deprecation warning when sorting data frame with no columns (JuliaData#3190) (@bkamins)

v1.3.6

Toggle v1.3.6's commit message
[Diff since v1.3.5](JuliaData/DataFrames.jl@v1.3.5...v1.3.6)

**Closed issues:**
- Use PrettyTable.jl as HTML backend (JuliaData#2835)
- Metadata display (JuliaData#3076)
- add length (or nrow) to describe (JuliaData#3095)
- Provide `nunique` for Integers in `describe` (JuliaData#3098)
- missing AsTable (JuliaData#3123)
- add Tables.subset and move ByRow to Tables (JuliaData#3127)
- Two Type Warnings (JuliaData#3134)
- typos (JuliaData#3141)
- Ancient version of DataFrames (JuliaData#3152)
- CSV.read returned DataFrame is different? (JuliaData#3154)

**Merged pull requests:**
- Remove unused where clauses (JuliaData#3137) (@bkamins)
- markdown typos (JuliaData#3142) (@musvaage)
- fix typos in docs and comments (JuliaData#3143) (@musvaage)
- add :nnonunique and :nuniqueall to describe (JuliaData#3146) (@bkamins)
- Fix type assertion in _filter_helper (JuliaData#3155) (@bkamins)
- Fix broken links in docs (JuliaData#3156) (@bkamins)
- further improve OutOfMemory problem handling in tests (JuliaData#3157) (@bkamins)
- Synchronize ByRow and Tables.subset with Tables.jl 1.8 (JuliaData#3158) (@bkamins)

v1.3.5

Toggle v1.3.5's commit message
[Diff since v1.3.4](JuliaData/DataFrames.jl@v1.3.4...v1.3.5)

**Closed issues:**
- add keepat! and insert! and pushfirst! (JuliaData#2936)
- Is possible to restrict number of threads? (JuliaData#2988)
- Investigate Vector on DataFrameRow failing (JuliaData#3017)
- UndefRefError: access to undefined reference (JuliaData#3053)
- Should we make AsTable constructor more restrictive (JuliaData#3062)
- Bug in unstack with valuestransform (JuliaData#3063)
- Performance issue of `transform!` on a `SubDataFrame` (JuliaData#3069)
- Fix handling of cols kwarg in vcat in case no data frames are passed (JuliaData#3074)
- Report return type of subset function in error message (JuliaData#3077)
- unique - ArgumentError: collection must be non-empty (JuliaData#3085)
- documentation enhancement request (JuliaData#3086)
- column not found in the data frame (JuliaData#3088)
- DataFrame from Matrix with Error when not specifying `:auto` (JuliaData#3090)
- outerjoins in documentation (JuliaData#3091)
- make subset keep group order when `ungroup=false` (JuliaData#3093)
- Constructor from Vector{Tuple}  (JuliaData#3103)
- [Problem] about filter and dropmissing usage (JuliaData#3105)
- VSCode linter does not recognize names method. (JuliaData#3109)
- insertcols! fails (JuliaData#3111)
- innerjoin suffixes  (JuliaData#3112)
- using map over `GroupedDataFrame`s is reserved (JuliaData#3113)
- filter is assigning value by mistake (JuliaData#3117)
- Creating SubDataFrame from SubDataFrame is extremely slow (JuliaData#3118)
- Sync stack with Julia 1.9 (JuliaData#3119)
- Drop master branch? (JuliaData#3121)
- Count nonmissing in `describe` (JuliaData#3126)
- Programatically generate DataFrame vector from x DataFrames with varying x (JuliaData#3128)
- Tag a release? Downgrades Compat.jl (JuliaData#3132)

**Merged pull requests:**
- Add a keyword argument to disable multithreading (JuliaData#3030) (@nalimilan)
- update NEWS.md (JuliaData#3046) (@bkamins)
- add resize!, keepat!, pop!, popfist!, popat! (JuliaData#3047) (@bkamins)
- Update "Julia Data Ecosystem" list (JuliaData#3048) (@Krastanov)
- Update working_with_dataframes.md (JuliaData#3050) (@reumle)
- update allcombinations docstring (JuliaData#3056) (@bkamins)
- make AsTable more careful when accepting argument (JuliaData#3064) (@bkamins)
- fix unstack for cases when column names overlap (JuliaData#3065) (@bkamins)
- add extra safety step in filter with AsTable (JuliaData#3067) (@bkamins)
- add prepend! (JuliaData#3068) (@bkamins)
- Performance of transform! on SubDataFrame (JuliaData#3070) (@bkamins)
- patch deleteat! (JuliaData#3071) (@bkamins)
- Add insert! and pushfirst! (JuliaData#3072) (@bkamins)
- Update remaining docs to reflect `threads` argument (JuliaData#3078) (@nalimilan)
- Improve subset error message (JuliaData#3079) (@bkamins)
- Fix vcat in case no data frames are passed (JuliaData#3081) (@bkamins)
- Improve documentation and speedup isempty (JuliaData#3083) (@bkamins)
- Add more examples of operation specification language (JuliaData#3087) (@bkamins)
- Improve join error message (JuliaData#3089) (@bkamins)
- fix wrong docs of outerjoin (JuliaData#3092) (@bkamins)
- fix subset with ungroup (JuliaData#3094) (@bkamins)
- Fix docstirng (JuliaData#3099) (@bkamins)
- Fix threading docstring in unstack (JuliaData#3104) (@bkamins)
- add JuliaCon 2022 tutorial (JuliaData#3106) (@bkamins)
- Extend `stack` from Base/Compat (JuliaData#3120) (@ararslan)
- Clarify that `sort!` permutes rows in-place (JuliaData#3124) (@nalimilan)
- Fix typo (JuliaData#3129) (@jariji)

v1.3.4

Toggle v1.3.4's commit message
[Diff since v1.3.3](JuliaData/DataFrames.jl@v1.3.3...v1.3.4)

**Closed issues:**
- add expandgrid (JuliaData#3027)
- `stack` not catching invalid value of keyword `variable_eltype` (JuliaData#3042)
- Appending `Dataframe`s after `CSV.read` fails for different length `String` columns (JuliaData#3044)
- make `clipboard(df)` work (JuliaData#3045)

**Merged pull requests:**
- add allcombinations (JuliaData#3031) (@bkamins)
- allow scalars in subset and subset! as conditions (JuliaData#3032) (@bkamins)
- Fix handling of variable_eltype in stack (JuliaData#3043) (@bkamins)

v1.3.3

Toggle v1.3.3's commit message
[Diff since v1.3.2](JuliaData/DataFrames.jl@v1.3.2...v1.3.3)

**Closed issues:**
- Add shuffle, shuffle! functions (JuliaData#2048)
- Add `groupindices` as special source argument in minilanguage (JuliaData#2683)
- Update the broadcasted getproperty when Julia 1.7 is out (JuliaData#2804)
- Better error for disallowmissing function (JuliaData#2945)
- Could it be useful to add the ungroup keyword to the filter function?  (JuliaData#2954)
- Error message for "Number of returned columns does not match" (JuliaData#2959)
- Function to insert columns (JuliaData#2972)
- Allow functions in DataFrames.jl to pick how many threads they use (JuliaData#2992)
- `first(gdf::GroupedDataFrame, n::Int)` should give a `GroupedDataFrame`? (JuliaData#2993)
- unstack fails without an id column (JuliaData#2994)
- Some error on Julia 1.7.1 (JuliaData#2996)
- groupby docs error? (JuliaData#2997)
- permutedims with CategoricalArray (JuliaData#3003)
- `d[:a, ]` changed the original data.frame (JuliaData#3014)
- Add keyword argument `source` in `mapreduce` to match `reduce` (JuliaData#3016)
- Fix describe documentation (JuliaData#3018)
- Flag to disable threading for debug purposes (JuliaData#3019)
- Make indexing of eachrow and eachcol return the object of the same type on a view of the parent (JuliaData#3023)
- subset(df) with no conditions should return unaltered DataFrame (JuliaData#3024)
- Keyword arg `cols` and `source` for `mapreduce` (JuliaData#3028)
- ```outerjoin```: keyword augument ```matchmissing``` not correctly passed (JuliaData#3039)

**Merged pull requests:**
- allow no rowkey in unstack (JuliaData#2995) (@bkamins)
- allow function in allowduplicates in unstack (JuliaData#2998) (@bkamins)
- Use `julia-actions/cache`; also for the `docs` job (JuliaData#2999) (@rikhuijzer)
- Fix typo in `groupby` docstring (JuliaData#3000) (@nalimilan)
- Implementation of eachindex, proprow, and groupindices (JuliaData#3001) (@bkamins)
- Handle Base.CanonicalIndexError introduced in Julia 1.8 (JuliaData#3002) (@bkamins)
- make permutedims more flexible (JuliaData#3004) (@bkamins)
- add `first`, `last` methods with `n` for gdf (JuliaData#3006) (@ericphanson)
- doc: remove superfluous word (JuliaData#3007) (@Mo-Gul)
- Improved error when column numbers do not match in transformations (JuliaData#3009) (@bkamins)
- add reverse!, shuffle, shuffle!, permute!, and invpermute! (JuliaData#3010) (@bkamins)
- Add fillcombinations function (JuliaData#3012) (@bkamins)
- Fix docstrings of fast row-wise transformation (JuliaData#3015) (@bkamins)
- add insertcols (JuliaData#3020) (@bkamins)
- add ungroup keyword argument to filter (JuliaData#3021) (@bkamins)
- make broadcasting assignment consistent with ! (JuliaData#3022) (@bkamins)
- handle empty args in subset (JuliaData#3025) (@bkamins)
- avoid categorical promotion (JuliaData#3026) (@bkamins)
- Update LICENSE.md (JuliaData#3029) (@bkamins)
- Use cycle notation to speed up `permute!` (JuliaData#3035) (@LilithHafner)
- Make indexing of eachrow return the object of the same type on a view of the parent (JuliaData#3037) (@bkamins)
- Fix keyword argument syntax in `DataFrame` docstring (JuliaData#3038) (@nalimilan)
- make sure we correctly pass matchmissing in joins (JuliaData#3040) (@bkamins)