- Fix
cogroup-by
for numbers of pcoll > 10 => Issue #83 - Fix
:many-files
option inread-text-file
=> Issue #75
- Add FileIO Read to read a PCollection of files
- Add ElasticSearchIO
- Upgrade to Apache Beam 2.15.0
- Upgrade dependencies
- Upgrade to Apache Beam 2.9.0
- Upgrade dependencies
- Add json-schema in bq write
- Combine-fn accepts now map as arguments
- Remove deprecated write-by
- Upgrade to Apache Beam 2.8.0
- Support dynamic file writes
- Support
:deflate
and:zip
compressions bq/->time-partitioning
now accepts:field
and:require-partition-filter
- Improve docs
- Upgrade to Dataflow 2.3.0
- Support time-partitioned Big Query tables (thanks to @neuromantik33)
- Fix broken input from BigQuery
- Fix missing safe-exec from filename policy
- Fix broken serializing after group-by in some cases
- Fix rare (hopefully) bug introduced by being lazy. with the vals after a group-by. Now we are greedy.
- Compatibility with Dataflow 2.2.0
- Compatibility with Clojure 1.9 and specs (some code had to be ported to Java shims because
proxy
is notSerializable
anymore in Clojure 1.9)
- Removed superfluous logging,
safe-exec
call and type hints
- Fix options on PipelineWithOptions to have it work again on Dataflow (oops, sorry!)
- Fixed
defoptions
again and made it useful for interop with Beam - Added
:checkpoint
option, will be made more useful later - Switched to new PubSub API
- Add support for custom filename-policy
- Fixed a possible bug causing StackOverflows on
combine
- Added
wait-pipeline-result
- Fix a bug with incorrect call to
seq
in combine - Add a missing safe-exec in
write-bq-table-raw
- Update to v2.x Dataflow version (Apache Beam)
- Add new streaming features (custom timestamp in PubSub #30 (thanks to @RoIT), BigQuery streaming writes)
- Add compressed TextIO output #31 (thanks to @RoIT)
- Add FileNamePolicy compatibility for TextIO
- Handle recursion in RECORD schema generation for Big Query #9 (thanks to @torbjornvatn)
- Add windows and trigger support #13 (thanks to @RoIT)
- Bump Dataflow SDK to 1.9.0
- Add support for standard SQL syntax for Big Query #11 (thanks to @torbjornvatn)
- Read data from pubsub topics #10 (thanks to @torbjornvatn)
- Update Datastore API to v1, add wrappers for creating Entities from and converting to Clojure maps. Some breakage in the datasplash.datastore namespace ensues.
- Add pubsub namespace #6 (Thanks to @MartinSahlen)
- Put slf4j deps in :dev profiles #8 (Thanks to @torbjornvatn)