Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Branch 43 downgraded tonic #5

Open
wants to merge 1,516 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
1516 commits
Select commit Hold shift + click to select a range
583bdc2
feat: add support for Substrait ExtendedExpression (#12728)
westonpace Oct 7, 2024
134939a
Transformed::new_transformed: Fix documentation formatting (#12787)
progval Oct 7, 2024
ef227f4
fix: Correct results for grouping sets when columns contain nulls (#1…
eejbyfeldt Oct 7, 2024
5360d20
Migrate documentation for all string functions from scalar_functions…
Omega359 Oct 7, 2024
577e4bb
Account for constant equivalence properties in union, tests (#12562)
alamb Oct 7, 2024
d8405ba
Minor: clarify comment about empty dependencies (#12786)
alamb Oct 8, 2024
47664df
Introduce Signature::String and return error if input of `strpos` is…
jayzhan211 Oct 8, 2024
b3d2d49
Minor: improve docs on MovingMin/MovingMax (#12790)
alamb Oct 8, 2024
e00af2c
Add slt tests (#12721)
alamb Oct 8, 2024
478c6be
Update Catalog README (#12797)
jonathanc-n Oct 8, 2024
7d36059
Remove redundant aggregate/window/scalar function documentation (#12745)
alamb Oct 8, 2024
c412c74
Fix bug in TopK aggregates (#12766)
avantgardnerio Oct 8, 2024
6d61503
clean up the todo comments in unnest.slt (#12795)
goldmedal Oct 8, 2024
a769025
Refactor `DependencyMap` and `Dependencies` into structs (#12761)
alamb Oct 8, 2024
48d395a
Remove unnecessary `DFSchema::check_ambiguous_name` (#12805)
jonahgao Oct 8, 2024
9df957b
API from `ParquetExec` to `ParquetExecBuilder` (#12799)
alamb Oct 8, 2024
29e9af2
Minor: add documentation note about `NullState` (#12791)
alamb Oct 9, 2024
d83b3b2
Chore: Move `aggregate statistics` optimizer test from core to optimi…
jayzhan211 Oct 9, 2024
fa95b75
Clarify documentation on ArrowBytesMap and ArrowBytesViewMap (#12789)
alamb Oct 9, 2024
e4934fd
Bump cookie and express in /datafusion/wasmtest/datafusion-wasm-app (…
dependabot[bot] Oct 9, 2024
3d347c9
Remove unused dependencies and features (#12808)
jonahgao Oct 9, 2024
3353c06
Add Aggregation fuzzer framework (#12667)
Rachelint Oct 9, 2024
30de35e
Add `PartitionEvaluatorArgs` to `WindowUDFImpl::partition_evaluator` …
jcsherin Oct 9, 2024
8cf030a
Retry apt-get and rustup on CI (#12714)
findepi Oct 9, 2024
c92d303
Support creating tables via SQL with `FixedSizeList` column (e.g. `a …
jandremarais Oct 9, 2024
ffe73fe
Make HashJoinExec::join_schema public (#12807)
progval Oct 9, 2024
0c7eb0d
Fix convert_to_state bug in `GroupsAccumulatorAdapter` (#12834)
alamb Oct 9, 2024
e0b807b
Improve description of function migration (#12743)
alamb Oct 9, 2024
58c7085
Fix: approx_percentile_cont_with_weight Panic (#12823)
jonathanc-n Oct 10, 2024
8945e7a
Fix clippy error on wasmtest (#12844)
jonahgao Oct 10, 2024
43d0bcf
Fix panic on wrong number of arguments to substr (#12837)
eejbyfeldt Oct 10, 2024
101e455
Fix Bug in Display for ScalarValue::Struct (#12856)
avantgardnerio Oct 10, 2024
939ef9e
Convert `rank` / `dense_rank` and `percent_rank` builtin functions to…
jatin510 Oct 10, 2024
4534a28
Support DictionaryString for Regex matching operators (#12768)
blaginin Oct 10, 2024
7093bbd
feat(substrait): add intersect support to consumer (#12830)
tokoko Oct 10, 2024
f7591fb
Minor: Small comment changes in sql folder (#12838)
jonathanc-n Oct 10, 2024
d5d9d30
Crypto Function Migration (#12840)
jonathanc-n Oct 10, 2024
58c32cb
Add DuckDB struct test and row as alias (#12841)
jayzhan211 Oct 10, 2024
3b6aac2
Support struct coercion in `type_union_resolution` (#12839)
jayzhan211 Oct 11, 2024
3bc7714
memory pool example (#12849)
2010YOUY01 Oct 11, 2024
eddade7
Migrate documentation for all core functions from scalar_functions.md…
Omega359 Oct 11, 2024
a8d3fae
Migrate documentation for Aggregate Functions to code (#12861)
jonathanc-n Oct 11, 2024
576da1d
Wordsmith project description (#12778)
matthewmturner Oct 11, 2024
6267ede
Added check for aggregate functions in optimizer rules (#12860)
jonathanc-n Oct 12, 2024
e7ac843
Bug-fix: MemoryExec sort expressions do NOT refer to the projected sc…
berkaysynnada Oct 12, 2024
1582e8d
Optimize `iszero` function (3-5x faster) (#12881)
simonvandel Oct 13, 2024
ebfc155
Macro for creating record batch from literal slice (#12846)
timsaucer Oct 13, 2024
646f40a
Implement special min/max accumulator for Strings and Binary (10% fas…
alamb Oct 13, 2024
1b10c9f
Make PruningPredicate's rewrite public (#12850)
adriangb Oct 13, 2024
636f433
Minor: add flags for temporary ddl (#12561)
hailelagi Oct 14, 2024
181d38c
Adding test for verifying octet_length now works with string view (#1…
Omega359 Oct 14, 2024
849bbe7
Remove Expr clones in `select_to_plan` (#12887)
jonahgao Oct 14, 2024
b932cdb
Minor: added to docs in expr folder (#12882)
jonathanc-n Oct 14, 2024
21cb357
Print undocumented functions to console while generating docs (#12874)
alamb Oct 14, 2024
16589b5
Fix: handle NULL offset of NTH_VALUE window function (#12851)
HuSen8891 Oct 14, 2024
746380b
Optimize `signum` function (3-25x faster) (#12890)
simonvandel Oct 14, 2024
4ceb950
re-export PartitionEvaluatorArgs from datafusion_expr::function (#12878)
Michael-J-Ward Oct 14, 2024
6e4bf05
Unparse Sort with pushdown limit to SQL string (#12873)
goldmedal Oct 14, 2024
6c0670d
Add spilling related metrics for aggregation (#12888)
2010YOUY01 Oct 14, 2024
5391c98
Move equivalence fuzz testing to fuzz test binary (#12767)
alamb Oct 14, 2024
f2564b7
Remove unused `math_expressions.rs` (#12917)
jonahgao Oct 14, 2024
fce331a
Migrate Regex Functions from static docs (#12886)
jonathanc-n Oct 14, 2024
377a4c5
Improve AggregationFuzzer error reporting (#12832)
alamb Oct 15, 2024
d9450da
Import Arc consistently (#12899)
findepi Oct 15, 2024
1936774
Optimize `isnan` (2-5x faster) (#12889)
simonvandel Oct 15, 2024
beb1034
move StringArrayType, StringViewArrayBuilder, etc outside of string m…
Omega359 Oct 15, 2024
a9b202c
Migrate documentation for all math functions from scalar_functions.md…
juroberttyb Oct 15, 2024
4a0b768
Remove redundant unsafe in test (#12914)
findepi Oct 15, 2024
5a0ea0b
fix(substrait): remove optimize calls from substrait consumer (#12800)
tokoko Oct 15, 2024
399e840
Ensure that math functions fulfil the ColumnarValue contract (#12922)
joroKr21 Oct 16, 2024
747001a
Combine the logic of rank, dense_rank and percent_rank udwf to reduc…
jatin510 Oct 16, 2024
e5cdc17
feat: support push down limit when full join (#12963)
JasonLi-cn Oct 16, 2024
c311cf5
Implement `GroupColumn` support for `StringView` / `ByteView` (faster…
Rachelint Oct 16, 2024
8d46fc1
Implement native support StringView for REGEXP_LIKE (#12897)
tlm365 Oct 16, 2024
8802f63
feat: Implement grouping function using grouping id (#12704)
eejbyfeldt Oct 16, 2024
87c2081
fix(substrait): consuming AggregateRel as last node (#12875)
tokoko Oct 16, 2024
f8add9b
Refactor imports to use `util` module (#12885)
loloxwg Oct 16, 2024
0e08e06
Fix zero type in `expr % 1` simplification (#12913)
eejbyfeldt Oct 16, 2024
90720c0
Optimize performance of `math::cot` (~2x faster) (#12910)
tlm365 Oct 16, 2024
435f959
Update TO_DATE, TO_TIMESTAMP scalar functions to support LargeUtf8, U…
Omega359 Oct 16, 2024
9663141
Expand wildcard expressions in distinct on (#12941)
epsio-banay Oct 16, 2024
db85d07
chores: remove redundant clone (#12964)
JasonLi-cn Oct 16, 2024
44127ec
Fix: handle NULL input in lead/lag window function (#12811)
HuSen8891 Oct 16, 2024
875aaa6
Fix logical vs physical schema mismatch for aliased `now()` (#12951)
wiedld Oct 16, 2024
caeabc1
Optimize performance of math::trunc (#12909)
tlm365 Oct 16, 2024
91899d4
Migrate Array function Documentation to code (#12948)
jonathanc-n Oct 16, 2024
5d6254c
Minor: fix Aggregation Docs from review (#12880)
jonathanc-n Oct 16, 2024
6d5ecb6
Add slt test for `DISTINCT ON` with wildcard (#12968)
alamb Oct 16, 2024
7c0e150
Minor: expr-doc small fixes (#12960)
jonathanc-n Oct 16, 2024
589c830
Fix 'Too many open files' on fuzz test. (#12961)
dhegberg Oct 16, 2024
d8e4e92
docs: Add documentation about conventional commits (#12971)
andygrove Oct 16, 2024
3d1d28d
fix: Add Int32 type override for Dialects (#12916)
peasee Oct 17, 2024
0e2023d
fix: using simple string match replace regex match for contains udf (…
zhuliquan Oct 17, 2024
56946b4
Increase minimum supported Rust version (MSRV) to 1.79 (#12962)
findepi Oct 17, 2024
e63abe7
feat(substrait): add set operations to consumer, update substrait to …
tokoko Oct 17, 2024
1ba1e53
Unparse `SubqueryAlias` without projections to SQL (#12896)
goldmedal Oct 17, 2024
b098893
Fix 2 bugs related to push down partition filters (#12902)
eejbyfeldt Oct 17, 2024
54bd26e
Move TableConstraint to Constraints conversion (#12953)
findepi Oct 17, 2024
ccfe020
Added current_timestamp alias (#12958)
jonathanc-n Oct 17, 2024
ad273ca
Improve unparsing for `ORDER BY`, `UNION`, Windows functions with Agg…
sgrebnov Oct 17, 2024
0ed369e
Handle one-element array return value in ScalarFunctionExpr (#12965)
joroKr21 Oct 17, 2024
f718fe2
Migrate datetime documentation to code (#12966)
jatin510 Oct 17, 2024
700b07f
Fix CI / regenerate functions (#12991)
alamb Oct 18, 2024
efe5708
Convert `BuiltInWindowFunction::{Lead, Lag}` to a user defined window…
jcsherin Oct 18, 2024
24148bd
Add links to new_constraint_from_table_constraints doc (#12995)
findepi Oct 18, 2024
87e931c
Split output batches of joins that do not respect batch size (#12969)
alihan-synnada Oct 18, 2024
e9435a9
Fix:fix HashJoin projection swap (#12967)
my-vegetable-has-exploded Oct 18, 2024
97f7491
refactor(substrait): refactor ReadRel consumer (#12983)
tokoko Oct 18, 2024
42f9060
feat(substrait): add wildcard handling to producer (#12987)
tokoko Oct 18, 2024
3405234
Move SMJ join filtered part out of join_output stage. LeftOuter, Left…
comphead Oct 18, 2024
73ba4c4
feat: Add regexp_count function (#12970)
Omega359 Oct 18, 2024
8c9b915
Minor: Fixed regexpr_match docs (#13008)
jonathanc-n Oct 18, 2024
10af8a7
Improve performance for physical plan creation with many columns (#12…
askalt Oct 18, 2024
34bd823
Remove logical cross join in planning (#12985)
Dandandan Oct 18, 2024
12568bf
fix spelling (#13014)
jonathanc-n Oct 19, 2024
7a34147
replace take_array with arrow util (#13013)
akurmustafa Oct 19, 2024
c7e5d8d
Improve recursive `unnest` options API (#12836)
duongcongtoai Oct 20, 2024
373fe23
Update version to 42.1.0, add CHANGELOG (#12986) (#12989)
alamb Oct 20, 2024
8d4614d
Don't preserve functional dependency when generating UNION logical pl…
Sevenannn Oct 20, 2024
972e3ab
feat: Decorrelate more predicate subqueries (#12945)
eejbyfeldt Oct 20, 2024
e9584bc
Added expresion to "with_standard_argument" (#12926)
jonathanc-n Oct 20, 2024
b42d9b8
fix: Dialect requires derived table alias (#12994)
peasee Oct 21, 2024
69a4648
[Minor]: Add data based sort expression test (#12992)
akurmustafa Oct 21, 2024
edeca39
Removed last usages of scalar_inputs, scalar_input_types and inputs2 …
buraksenn Oct 21, 2024
2de6e29
Minor: Update release instructions (#13024)
alamb Oct 21, 2024
701cb00
fix: join swap for projected semi/anti joins (#13022)
korowa Oct 21, 2024
45a316c
Extract CSE logic to `datafusion_common` (#13002)
peter-toth Oct 21, 2024
2535d88
enhance unparsing plan with pushdown to avoid unnamed subquery (#13006)
goldmedal Oct 21, 2024
4fca0d5
fix: Verify supported type for Unary::Plus in sql planner (#13019)
eejbyfeldt Oct 21, 2024
34fbe8e
Fix count on all null `VALUES` clause (#13029)
findepi Oct 21, 2024
b978cf8
Support filter in cross join elimination (#13025)
Dandandan Oct 22, 2024
465d660
Do no alias in TableScan filters (#13048)
eejbyfeldt Oct 22, 2024
755ba91
[minor]: remove same util functions from the code base. (#13026)
akurmustafa Oct 22, 2024
c22abb4
Improve `AggregateFuzz` testing: generate random queries (#12847)
alamb Oct 22, 2024
ef1365a
Fix functions with Volatility::Volatile and parameters (#13001)
agscpp Oct 22, 2024
227908f
Migrate documentation for `regr*` aggregate functions to code (#12871)
alamb Oct 22, 2024
91d2886
fix(substrait): disallow union with a single input (#13023)
tokoko Oct 22, 2024
818ce3f
refactor: Incorporate RewriteDisjunctivePredicate rule into SimplifyE…
eejbyfeldt Oct 22, 2024
afecd7b
Move filtered SMJ right join out of `join_partial` phase (#13053)
comphead Oct 22, 2024
cf60da9
Remove functions and types deprecated since 37 (#13056)
findepi Oct 22, 2024
d3920f3
Minor: Cleaned physical-plan Comments (#13055)
jonathanc-n Oct 22, 2024
cfe05b8
improve the condition checking for unparsing table_scan (#13062)
goldmedal Oct 23, 2024
d2a5e27
minor: simplify associated item bound of `hash_array_primitive` (#13070)
jonahgao Oct 23, 2024
3aa9714
Run optimzer rules on subqueries by default (#13066)
eejbyfeldt Oct 23, 2024
521966a
extended log.rs tests for unary/binary and f32/f64 casting (#13034)
buraksenn Oct 23, 2024
a4e6b07
feat: Convert CumeDist to UDWF (#13051)
jonathanc-n Oct 23, 2024
211e76e
Fix check_not_null_constraints null detection (#13033)
findepi Oct 23, 2024
7a40344
Update list of TPC-DS queries (#13075)
Dandandan Oct 23, 2024
3e940a9
Fix logical vs physical schema mismatch for UNION where some inputs a…
wiedld Oct 23, 2024
de526a9
Improve CSE stats (#13080)
peter-toth Oct 24, 2024
18b2aaa
Infer data type from schema for `Values` and add struct coercion to `…
jayzhan211 Oct 24, 2024
3f3a0cf
feat: support arbitrary expressions in `LIMIT` plan (#13028)
jonahgao Oct 24, 2024
8adbc23
[minor]: use arrow take_batch instead of get_record_batch_indices (#1…
akurmustafa Oct 24, 2024
f2da32b
deprecated (#13076)
buraksenn Oct 24, 2024
ac827ab
feat: Migrate Map Functions (#13047)
jonathanc-n Oct 24, 2024
307c1ea
Minor: Add documentation for `cot` (#13069)
alamb Oct 24, 2024
631408b
Documentation: Add API deprecation policy (#13083)
comphead Oct 24, 2024
1b14655
changed doc instance (#13097)
jonathanc-n Oct 24, 2024
31701b8
chore: Added a number of physical planning join benchmarks (#13085)
mnorfolk03 Oct 24, 2024
9e636d8
[docs]: migrate lead/lag window function docs to new docs (#13095)
buraksenn Oct 24, 2024
2322933
minor: Add deprecated policy to the contributor guide contents and fi…
comphead Oct 24, 2024
6a3c0b0
feat: improve type inference for WindowFrame (#13059)
notfilippo Oct 25, 2024
13a4225
Introduce `binary_as_string` parquet option, upgrade to arrow/parquet…
goldmedal Oct 25, 2024
02b9693
Convert `ntile` builtIn function to UDWF (#13040)
jatin510 Oct 25, 2024
7f32dce
Fix more instances of schema missing metadata (#13068)
itsjunetime Oct 25, 2024
06594c7
Bug-fix / Limit with_new_exprs() (#13109)
berkaysynnada Oct 25, 2024
813220d
Move subquery check from analyzer to PullUpCorrelatedExpr (#13091)
eejbyfeldt Oct 25, 2024
bdcf822
Include IMDB in benchmark README (#13107)
2010YOUY01 Oct 25, 2024
9623690
removed --prefer_hash_join option that causes an error when running t…
neyama Oct 25, 2024
21cfd6c
Make CI test error if a function is not documented (#12938)
alamb Oct 25, 2024
7b2284c
Allow using `cargo nextest` for running tests (#13045)
alamb Oct 25, 2024
73cfa6c
feat: Add `Date32`/`Date64` in aggregate fuzz testing (#13041)
LeslieKid Oct 26, 2024
22a242c
docs: Added Special Functions Page (#13102)
jonathanc-n Oct 26, 2024
d2511b2
fix: planning of prepare statement with limit clause (#13088)
jonahgao Oct 26, 2024
7df3e5c
Add benchmark for memory-limited aggregation (#13090)
2010YOUY01 Oct 26, 2024
412ca4e
Add clickbench parquet based queries to sql_planner benchmark (#13103)
Omega359 Oct 26, 2024
62b063c
Improve documentation and examples for `SchemaAdapterFactory`, make `…
alamb Oct 26, 2024
146f16a
Move filtered SMJ Left Anti filtered join out of `join_partial` phase…
comphead Oct 26, 2024
5db2740
Improve TableScan with filters pushdown unparsing (multiple filters s…
sgrebnov Oct 27, 2024
e22d231
Raise a plan error on union if column count is not the same between p…
Omega359 Oct 28, 2024
a0588cc
[docs]: added `alternative_syntax` function for docs (#13140)
jonathanc-n Oct 28, 2024
132b232
Minor: Delete old cume_dist and percent_rank docs (#13137)
jonathanc-n Oct 28, 2024
1fd6116
Add basic support for `unnest` unparsing (#13129)
sgrebnov Oct 28, 2024
0b45b9a
Improve TableScan with filters pushdown unparsing (joins) (#13132)
sgrebnov Oct 29, 2024
467a804
Report offending plan node when In/Exist subquery misused (#13155)
findepi Oct 29, 2024
80ad713
Remove unused assert_analyzed_plan_ne test helper (#13121)
findepi Oct 29, 2024
feeb32a
Add alternative syntax for extract, trim and substring. (#13143)
Omega359 Oct 29, 2024
67b0f25
Fix Utf8View as Join Key (#13115)
demetribu Oct 29, 2024
d00a089
Add Support for `modulus` operation in substrait (#13108)
LatrecheYasser Oct 29, 2024
4e38abd
unify cast_to function of ScalarValue (#13122)
JasonLi-cn Oct 29, 2024
ac79ef3
Add unused_qualifications with deny level to linter. Fix unused_quali…
dhegberg Oct 29, 2024
89e71ef
[Optimization] Infer predicate under all JoinTypes (#13081)
JasonLi-cn Oct 29, 2024
d62f262
feat(substrait): support order_by in aggregate functions (#13114)
bvolpato Oct 29, 2024
c03e260
Support `negate` expression in substrait (#13112)
LatrecheYasser Oct 29, 2024
1c2a2fd
Fix an issue with to_char signature not working correctly with timezo…
Omega359 Oct 29, 2024
b30d12a
chore: re-export functions_window_common::ExpressionArgs (#13149)
Michael-J-Ward Oct 29, 2024
d764c4a
minor: Fix build on main (#13159)
eejbyfeldt Oct 29, 2024
444a673
feat: Support Substrait's IntervalCompound type/literal instead of in…
Blizzara Oct 29, 2024
223bb02
docs: switch completely to generated docs for scalar and aggregate fu…
Omega359 Oct 29, 2024
9df766f
fix: add missing `NotExpr::evaluate_bounds` (#13082)
crepererum Oct 30, 2024
8c48c39
Update test case for issue #5771 showing it is resolved (#13180)
eejbyfeldt Oct 30, 2024
903a098
Test LIKE with dynamic pattern (#13141)
findepi Oct 30, 2024
d36c229
Minor: improve testing docs, mention `cargo nextest` (#13160)
alamb Oct 30, 2024
eb558de
minor: Update HOWTO to help with updating new docs (#13172)
jonathanc-n Oct 30, 2024
63e8e6a
Add config option `skip_physical_aggregate_schema_check ` (#13176)
alamb Oct 30, 2024
2d7892b
Enable reading `StringViewArray` by default from Parquet (8% improvem…
alamb Oct 30, 2024
7d34ccc
Increase fuzz testing of streaming group by / low cardinality columns…
alamb Oct 30, 2024
68bf7ad
fix: Order by mentioning missing column multiple times (#13158)
eejbyfeldt Oct 30, 2024
8c6bb39
fix: import JoinTestType without triggering unused_qualifications lin…
smarticen Oct 30, 2024
538e0d1
FFI initial implementation (#12920)
timsaucer Oct 30, 2024
4975829
Report file location and offset when CSV schema mismatch (#13185)
findepi Oct 30, 2024
f23360f
Round robin polling between tied winners in sort preserving merge (#1…
jayzhan211 Oct 30, 2024
2f745b7
Prepare for 42.2.0 release (#13191) (#13193)
alamb Oct 31, 2024
cf76421
Fix rendering of dictionary empty string values in SLT tests (#13198)
findepi Oct 31, 2024
68c042d
fix: default UDWFImpl::expressions returns all expressions (#13169)
Michael-J-Ward Oct 31, 2024
7ae1ccb
Improve push down filter of join (#13184)
JasonLi-cn Oct 31, 2024
2047d7f
feat: Implement LeftMark join to fix subquery correctness issue (#13134)
eejbyfeldt Oct 31, 2024
f2bebcd
Minor: Reduce indirection for finding changlog (#13199)
alamb Oct 31, 2024
d2a15b3
feat: support logical plan for `EXECUTE` statement (#13194)
jonahgao Nov 1, 2024
6c5823e
Support `DictionaryArray` in `OVER` clause (#13153)
adriangb Nov 1, 2024
5e53b63
Allow testing records with sibling whitespace in SLT tests and add mo…
findepi Nov 1, 2024
87f0838
Use single file write when an extension is present in the path. (#13079)
dhegberg Nov 1, 2024
a34e237
Minor: make `Expr::volatile` infallible (#13206)
alamb Nov 1, 2024
592b924
feat(substrait): handle emit_kind when consuming Substrait plans (#13…
vbarua Nov 1, 2024
a2e5330
fix: date_bin() on timstamps before 1970 (#13204)
mhilton Nov 1, 2024
b7f4db4
Deprecate ScalarUDF::invoke and invoke_no_args for invoke_batch (#13179)
findepi Nov 1, 2024
6b76a35
consider volatile function in simply_expression (#13128)
Lordworms Nov 1, 2024
9ff0800
Minor: fix merge conflict (#13219)
alamb Nov 1, 2024
752561a
Convert LexOrdering `type` to `struct`. (#13146)
ngli-me Nov 1, 2024
24d953e
fix: array_resize null fix (#13209)
jonathanc-n Nov 2, 2024
344f089
Revert "Improve push down filter of join (#13184)" (#13229)
eejbyfeldt Nov 2, 2024
89e96b4
Derive `Clone` for more ExecutionPlans (#13203)
alamb Nov 2, 2024
b40a298
feat(logical-types): add NativeType and LogicalType (#12853)
notfilippo Nov 3, 2024
a9d4d52
feat(substrait): AggregateRel grouping_expression support (#13173)
akoshchiy Nov 3, 2024
85f92ef
Apply projection to `Statistics` in `FilterExec` (#13187)
alamb Nov 3, 2024
d19865e
[minor] overload from_unixtime func to have optional timezone paramet…
buraksenn Nov 4, 2024
2482ff4
Minor: make LeftJoinData into a struct in CrossJoinExec (#13227)
alamb Nov 4, 2024
274b222
Deprecate invoke and invoke_no_args in favor of invoke_batch (#13174)
findepi Nov 4, 2024
49b9351
Support timestamp(n) type (#13231)
findepi Nov 4, 2024
659d5a3
Remove elements deprecated since v 38. (#13245)
findepi Nov 4, 2024
0458d30
fix: CSV Infer Schema now properly supports escaped characters. (#13214)
mnorfolk03 Nov 4, 2024
9707a8a
bump version and generate changelog
andygrove Nov 5, 2024
88f58bf
bump version and generate changelog
andygrove Nov 5, 2024
2d5364e
Downgrade tonic
matthewmturner Dec 23, 2024
2c35f17
[bug]: Fix wrong order by removal from plan (#13497)
akurmustafa Nov 24, 2024
5383d30
enforce_distribution: fix for limits getting lost
Max-Meldrum Dec 30, 2024
13f6aca
set default-features=false for datafusion in proto crate
Max-Meldrum Jan 7, 2025
4e29649
set schema_force_view_types to false in ParquetOptions
Max-Meldrum Jan 9, 2025
ca70f73
Revert "enforce_distribution: fix for limits getting lost"
suremarc Jan 14, 2025
37b8049
update sqllogictests after disabling view types
suremarc Jan 14, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
1 change: 1 addition & 0 deletions .gitattributes
Original file line number Diff line number Diff line change
@@ -1,3 +1,4 @@
.github/ export-ignore
datafusion/core/tests/data/newlines_in_values.csv text eol=lf
datafusion/proto/src/generated/prost.rs linguist-generated
datafusion/proto/src/generated/pbjson.rs linguist-generated
12 changes: 7 additions & 5 deletions .github/actions/setup-builder/action.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -28,16 +28,18 @@ runs:
- name: Install Build Dependencies
shell: bash
run: |
apt-get update
apt-get install -y protobuf-compiler
RETRY="ci/scripts/retry"
"${RETRY}" apt-get update
"${RETRY}" apt-get install -y protobuf-compiler
- name: Setup Rust toolchain
shell: bash
# rustfmt is needed for the substrait build script
run: |
RETRY="ci/scripts/retry"
echo "Installing ${{ inputs.rust-version }}"
rustup toolchain install ${{ inputs.rust-version }}
rustup default ${{ inputs.rust-version }}
rustup component add rustfmt
"${RETRY}" rustup toolchain install ${{ inputs.rust-version }}
"${RETRY}" rustup default ${{ inputs.rust-version }}
"${RETRY}" rustup component add rustfmt
- name: Configure rust runtime env
uses: ./.github/actions/setup-rust-runtime
- name: Fixup git permissions
Expand Down
29 changes: 25 additions & 4 deletions .github/workflows/dev_pr/labeler.yml
Original file line number Diff line number Diff line change
Expand Up @@ -17,11 +17,11 @@

development-process:
- changed-files:
- any-glob-to-any-file: ['dev/**.*', '.github/**.*', 'ci/**.*', '.asf.yaml']
- any-glob-to-any-file: ['dev/**/*', '.github/**/*', 'ci/**/*', '.asf.yaml']

documentation:
- changed-files:
- any-glob-to-any-file: ['docs/**.*', 'README.md', './**/README.md', 'DEVELOPERS.md', 'datafusion/docs/**.*']
- any-glob-to-any-file: ['docs/**/*', 'README.md', './**/README.md', 'DEVELOPERS.md', 'datafusion/docs/**/*']

sql:
- changed-files:
Expand All @@ -33,16 +33,37 @@ logical-expr:

physical-expr:
- changed-files:
- any-glob-to-any-file: ['datafusion/physical-expr/**/*']
- any-glob-to-any-file: ['datafusion/physical-expr/**/*', 'datafusion/physical-expr-common/**/*', 'datafusion/physical-expr-aggregate/**/*', 'datafusion/physical-plan/**/*']

catalog:
- changed-files:
- any-glob-to-any-file: ['datafusion/catalog/**/*']

common:
- changed-files:
- any-glob-to-any-file: ['datafusion/common/**/*', 'datafusion/common-runtime/**/*']

execution:
- changed-files:
- any-glob-to-any-file: ['datafusion/execution/**/*']

functions:
- changed-files:
- any-glob-to-any-file: ['datafusion/functions/**/*', 'datafusion/functions-aggregate/**/*', 'datafusion/functions-aggregate-common', 'datafusion/functions-nested']


optimizer:
- changed-files:
- any-glob-to-any-file: ['datafusion/optimizer/**/*']
- any-glob-to-any-file: ['datafusion/optimizer/**/*', 'datafusion/physical-optimizer/**/*']

core:
- changed-files:
- any-glob-to-any-file: ['datafusion/core/**/*']

proto:
- changed-files:
- any-glob-to-any-file: ['datafusion/proto/**/*', 'datafusion/proto-common/**/*']

substrait:
- changed-files:
- any-glob-to-any-file: ['datafusion/substrait/**/*']
Expand Down
55 changes: 55 additions & 0 deletions .github/workflows/large_files.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,55 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

name: Large files PR check

concurrency:
group: ${{ github.repository }}-${{ github.head_ref || github.sha }}-${{ github.workflow }}
cancel-in-progress: true

on:
pull_request:

jobs:
check-files:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
with:
fetch-depth: 0
- name: Check size of new Git objects
env:
# 1 MB ought to be enough for anybody.
# TODO in case we may want to consciously commit a bigger file to the repo without using Git LFS we may disable the check e.g. with a label
MAX_FILE_SIZE_BYTES: 1048576
shell: bash
run: |
git rev-list --objects ${{ github.event.pull_request.base.sha }}..${{ github.event.pull_request.head.sha }} \
> pull-request-objects.txt
exit_code=0
while read -r id path; do
# Skip objects which are not files (commits, trees)
if [ ! -z "${path}" ]; then
size="$(git cat-file -s "${id}")"
if [ "${size}" -gt "${MAX_FILE_SIZE_BYTES}" ]; then
exit_code=1
echo "Object ${id} [${path}] has size ${size}, exceeding ${MAX_FILE_SIZE_BYTES} limit." >&2
echo "::error file=${path}::File ${path} has size ${size}, exceeding ${MAX_FILE_SIZE_BYTES} limit."
fi
fi
done < pull-request-objects.txt
exit "${exit_code}"
101 changes: 0 additions & 101 deletions .github/workflows/pr_benchmarks.yml

This file was deleted.

53 changes: 0 additions & 53 deletions .github/workflows/pr_comment.yml

This file was deleted.

52 changes: 33 additions & 19 deletions .github/workflows/rust.yml
Original file line number Diff line number Diff line change
Expand Up @@ -62,8 +62,7 @@ jobs:
~/.cargo/git/db/
./target/
./datafusion-cli/target/
# this key equals the ones on `linux-build-lib` for re-use
key: cargo-cache-benchmark-${{ hashFiles('datafusion/**/Cargo.toml', 'benchmarks/Cargo.toml', 'datafusion-cli/Cargo.toml') }}
key: cargo-cache-${{ hashFiles('**/Cargo.toml', '**/Cargo.lock') }}

- name: Check datafusion without default features
# Some of the test binaries require the parquet feature still
Expand All @@ -90,8 +89,8 @@ jobs:

# Ensure that the datafusion crate can be built with only a subset of the function
# packages enabled.
- name: Check datafusion (array_expressions)
run: cargo check --no-default-features --features=array_expressions -p datafusion
- name: Check datafusion (nested_expressions)
run: cargo check --no-default-features --features=nested_expressions -p datafusion

- name: Check datafusion (crypto)
run: cargo check --no-default-features --features=crypto_expressions -p datafusion
Expand Down Expand Up @@ -234,11 +233,7 @@ jobs:
with:
rust-version: stable
- name: Run cargo doc
run: |
export RUSTDOCFLAGS="-D warnings -A rustdoc::private-intra-doc-links"
cargo doc --document-private-items --no-deps --workspace
cd datafusion-cli
cargo doc --document-private-items --no-deps
run: ci/scripts/rust_docs.sh

linux-wasm-pack:
name: build with wasm-pack
Expand Down Expand Up @@ -526,7 +521,7 @@ jobs:
run: taplo format --check

config-docs-check:
name: check configs.md is up-to-date
name: check configs.md and ***_functions.md is up-to-date
needs: [ linux-build-lib ]
runs-on: ubuntu-latest
container:
Expand All @@ -547,6 +542,11 @@ jobs:
# If you encounter an error, run './dev/update_config_docs.sh' and commit
./dev/update_config_docs.sh
git diff --exit-code
- name: Check if any of the ***_functions.md has been modified
run: |
# If you encounter an error, run './dev/update_function_docs.sh' and commit
./dev/update_function_docs.sh
git diff --exit-code

# Verify MSRV for the crates which are directly used by other projects:
# - datafusion
Expand All @@ -567,18 +567,32 @@ jobs:
- name: Check datafusion
working-directory: datafusion/core
run: |
# If you encounter an error with any of the commands below
# it means some crate in your dependency tree has a higher
# MSRV (Min Supported Rust Version) than the one specified
# in the `rust-version` key of `Cargo.toml`. Check your
# dependencies or update the version in `Cargo.toml`
cargo msrv verify
# If you encounter an error with any of the commands below it means
# your code or some crate in the dependency tree has a higher MSRV
# (Min Supported Rust Version) than the one specified in the
# `rust-version` key of `Cargo.toml`.
#
# To reproduce:
# 1. Install the version of Rust that is failing. Example:
# rustup install 1.79.0
# 2. Run the command that failed with that version. Example:
# cargo +1.79.0 check -p datafusion
#
# To resolve, either:
# 1. Change your code to use older Rust features,
# 2. Revert dependency update
# 3. Update the MSRV version in `Cargo.toml`
#
# Please see the DataFusion Rust Version Compatibility Policy before
# updating Cargo.toml. You may have to update the code instead.
# https://github.com/apache/datafusion/blob/main/README.md#rust-version-compatibility-policy
cargo msrv --output-format json --log-target stdout verify
- name: Check datafusion-substrait
working-directory: datafusion/substrait
run: cargo msrv verify
run: cargo msrv --output-format json --log-target stdout verify
- name: Check datafusion-proto
working-directory: datafusion/proto
run: cargo msrv verify
run: cargo msrv --output-format json --log-target stdout verify
- name: Check datafusion-cli
working-directory: datafusion-cli
run: cargo msrv verify
run: cargo msrv --output-format json --log-target stdout verify
Loading