Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dockerfile #1252

Open
wants to merge 2,911 commits into
base: branch-2.2
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
2911 commits
Select commit Hold shift + click to select a range
fb148bd
Update notebook
argenisleon Dec 7, 2021
e1205af
Merge remote-tracking branch 'origin/develop-21.11' into develop-21.11
argenisleon Dec 7, 2021
fbaea6e
Fix replace bug
luis11011 Dec 9, 2021
19da9a4
Bump version
luis11011 Jan 5, 2022
4b9f174
Update packages
luis11011 Jan 5, 2022
2e13e4e
Fix dataframe negation bug
luis11011 Jan 5, 2022
8931a20
Fix constants import bug
luis11011 Jan 5, 2022
e3530a8
Update Dask packages
luis11011 Jan 5, 2022
10b603d
Update package on databricks requirements
luis11011 Jan 5, 2022
8ac5336
Fix assign bug on partitioned dataframes
luis11011 Jan 5, 2022
5a3798a
Fix replace bug
luis11011 Jan 6, 2022
488af59
Fix percentile bug
luis11011 Jan 6, 2022
5e9c45e
Bump version
luis11011 Jan 6, 2022
4213e80
Add omw-1.4 package on tests
luis11011 Jan 6, 2022
8193835
Bump version
luis11011 Jan 6, 2022
b7c2cb4
Use "bool" instead of "boolean" on type names
luis11011 Jan 10, 2022
e5b5263
Fix apply bugs on mask functions
luis11011 Jan 10, 2022
f2a92ad
Bump version
luis11011 Jan 10, 2022
b49bc51
Use full replace by default when passing a list of tuples
luis11011 Jan 10, 2022
b3edfd8
Refactor mid function
luis11011 Jan 10, 2022
0728884
Implement substring function
luis11011 Jan 10, 2022
8997d0c
Bump version
luis11011 Jan 10, 2022
3d1befc
Fix internal bool type bug
luis11011 Jan 11, 2022
04bf386
Add col_name argument to mask.any and mask.all functions
luis11011 Jan 11, 2022
e390e08
Update mask tests
luis11011 Jan 11, 2022
316bf36
Fix bug on dataframe comparison
luis11011 Jan 11, 2022
10217e6
Bump version
luis11011 Jan 11, 2022
56c4fd8
Fix argument support on load.file
luis11011 Jan 12, 2022
31cadfe
Refactor in API functions
luis11011 Jan 20, 2022
839d2b2
Include session getter and setter on API functions
luis11011 Jan 20, 2022
ab28611
Reorganize imports
argenisleon Jan 23, 2022
6204d8c
Fix returns
argenisleon Jan 23, 2022
813c523
Improve formatting
argenisleon Jan 23, 2022
a4fcf13
Fix numpy version
argenisleon Jan 23, 2022
2f58c26
Improve formatting
argenisleon Jan 23, 2022
4600eab
Implement to to_items
argenisleon Jan 23, 2022
9da3d23
Fix _to_float()
argenisleon Jan 23, 2022
719092d
Change to param to coerce
argenisleon Jan 23, 2022
206048f
Merge remote-tracking branch 'origin/develop-22.1' into develop-22.1
argenisleon Jan 23, 2022
f602268
Implement replace_chars in cudf
argenisleon Jan 23, 2022
7b99664
Refactor in API functions
luis11011 Jan 20, 2022
e7bf784
Include session getter and setter on API functions
luis11011 Jan 20, 2022
a107658
Import preprocessing package inside required functions only
luis11011 Jan 24, 2022
9d94ccb
Update Dask-cuDF requirements
luis11011 Jan 24, 2022
d9a4eb4
Improve histogram code
argenisleon Jan 24, 2022
b323730
Fix cudf not supporting ((?i))
argenisleon Jan 24, 2022
f45144b
Update ipython requirement
luis11011 Jan 25, 2022
17d31af
Add python 3.7 support
luis11011 Jan 25, 2022
47a5984
Fix integer, float, numeric and string detection in cudf
argenisleon Jan 25, 2022
e03c4d2
Add missing contains functopm/Make cudf pandas compatible
argenisleon Jan 25, 2022
b961a87
Imrprove formatting and organization
argenisleon Jan 25, 2022
0c43b06
Merge remote-tracking branch 'origin/develop-22.1' into develop-22.1
argenisleon Jan 25, 2022
538e191
Add n=None support on pattern_counts
luis11011 Jan 28, 2022
84e1bd1
Remove buffer functions
luis11011 Feb 1, 2022
1b9d39b
Set profiling calculation function as private
luis11011 Feb 2, 2022
15d0ec1
Support range argument on histograms
luis11011 Feb 3, 2022
361b117
Order data types by hierarchy
luis11011 Feb 4, 2022
bc4fc0d
Update pandas version on requirements
luis11011 Feb 6, 2022
e9704b8
Calculate histograms and frequency when bins is greater than zero
luis11011 Feb 6, 2022
8bbddbf
Update frequency functions docstring
luis11011 Feb 6, 2022
a3e1ae2
Fix histogram bug
luis11011 Feb 6, 2022
4e0c1a8
Bump version
luis11011 Feb 7, 2022
c9412a6
Fix iloc bug on cuDF
luis11011 Feb 22, 2022
e9b4289
Add to_list to columns
argenisleon Feb 22, 2022
10deab3
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Feb 22, 2022
f1096a8
Rename to_list, to_list_one and to_dict functions
luis11011 Feb 23, 2022
de68d09
Implement to_list function on cuDF
luis11011 Feb 23, 2022
77267d2
Implement categorical data optimization on Pandas and cuDF
luis11011 Feb 23, 2022
af4c244
Fix to_dict internal function bug
luis11011 Feb 23, 2022
9ea3749
Bump version
luis11011 Feb 23, 2022
1d353db
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Feb 23, 2022
9d0d6e1
Support multiple columns output
argenisleon Mar 2, 2022
2b2dda6
Support keyword arguments on rows.apply
luis11011 Mar 2, 2022
c2c18da
Fix json file loading bug
luis11011 Mar 3, 2022
20775ee
Bump version
luis11011 Mar 3, 2022
2ca40f9
Organize imports
argenisleon Mar 7, 2022
d6eec15
Ensure count is python int when selecting a range
argenisleon Mar 7, 2022
568c89c
Declare missing functions
argenisleon Mar 7, 2022
9153152
WIP Ibis fixes
argenisleon Mar 7, 2022
5cd43ae
Fix Ibis special case
argenisleon Mar 7, 2022
2a794cf
Improve handling data type
argenisleon Mar 7, 2022
b45ba79
Improve docstrings
argenisleon Mar 7, 2022
4516a17
Improve params handling
argenisleon Mar 7, 2022
39737f9
Improve formatting
argenisleon Mar 7, 2022
c692644
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Mar 7, 2022
dbe96e0
Fix output column bug on multiple replace
luis11011 Mar 9, 2022
21f9c1d
Fix copied column bug
luis11011 Mar 9, 2022
672d26a
Fix json file name meta bug
luis11011 Mar 10, 2022
15a4b10
Fix pattern function bug
luis11011 Mar 11, 2022
db8d2c5
Improve docstrings
argenisleon Mar 14, 2022
3e42b45
Fix series handling, min and max fucntions
argenisleon Mar 14, 2022
b15cc16
Fix _iloc
argenisleon Mar 14, 2022
5b2be6a
Fix frecuency and datatype
argenisleon Mar 14, 2022
3d3c008
Fix database base database connection
argenisleon Mar 14, 2022
6dba70b
typo fixes
argenisleon Mar 14, 2022
f516162
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Mar 14, 2022
29a14a9
Fix load.file file_name bug
luis11011 Mar 17, 2022
606a3e1
Bump version
luis11011 Mar 17, 2022
9a6040b
Fix rows.append bug
luis11011 Mar 23, 2022
4b9fe04
set now can handle multiple conditions and values
argenisleon Mar 23, 2022
d856c09
set fucntion now can handle multiple conditions and values
argenisleon Mar 23, 2022
992430d
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
luis11011 Mar 24, 2022
983b959
Fix where param handling
argenisleon Mar 24, 2022
fa7eb5c
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Mar 24, 2022
99990a6
Fix where handling
argenisleon Mar 25, 2022
b1fcf49
Fix multiple set with default value bug
luis11011 Mar 25, 2022
a017972
Improve histogram and profile performance
argenisleon Mar 26, 2022
07f01ca
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Mar 26, 2022
7051eef
Add missing meta param
argenisleon Mar 29, 2022
c938e9f
bump fast-histogram version
argenisleon Mar 29, 2022
dde2dbe
Optimize pattern function
luis11011 Mar 31, 2022
46a3783
Allow uncopied iloc
luis11011 Apr 1, 2022
bed1962
Remove columns selection from iloc
luis11011 Apr 1, 2022
d1e4caa
Fix cache in patterns
argenisleon Apr 3, 2022
d3e3dce
Improve patterns calculation speed
argenisleon Apr 3, 2022
4bf47e5
Turns patterns count function private
argenisleon Apr 3, 2022
a21f917
Do not cast to string if already a string
argenisleon Apr 3, 2022
e9aa6ac
Implement base private select
argenisleon Apr 3, 2022
10e497c
General format improvements
argenisleon Apr 3, 2022
8e7833b
WIP Polars implementation
argenisleon Apr 3, 2022
455564c
WIP Vaex implementation
argenisleon Apr 3, 2022
097c3f3
Adding missing meta in spark dataframe creation
argenisleon Apr 3, 2022
e5a9516
Improve docstrings
argenisleon Apr 3, 2022
9f4562d
Order imports
argenisleon Apr 3, 2022
3fdf6e4
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Apr 3, 2022
bf6161d
WIP Vaex Engine
argenisleon Apr 4, 2022
2ee35ba
Fix so the name do not depends on the engine series name property
argenisleon Apr 4, 2022
69558c4
Implement strip function
luis11011 Apr 4, 2022
8fb5c11
Fix frequent patterns bug
luis11011 Apr 4, 2022
e1399db
Fix to_string bug
luis11011 Apr 4, 2022
4bc5da5
Fix normalize_chars bug
luis11011 Apr 4, 2022
81e38f0
Optimize replace function
luis11011 Apr 4, 2022
a90a478
Fix histogram bug
luis11011 Apr 5, 2022
4d65d66
Implement float types list
luis11011 Apr 5, 2022
3cbd2c9
Include logic operators in expression parser
luis11011 Apr 7, 2022
ad2d026
Fix rows.select bug
luis11011 Apr 7, 2022
a8b9652
Fix rows.select bug
luis11011 Apr 7, 2022
88253c8
Implement parse with optimus classes instead of internal series
luis11011 Apr 7, 2022
8eded69
Fix set function bug
luis11011 Apr 8, 2022
41b6409
Fix set bug
luis11011 Apr 8, 2022
0380ddd
Implement optimus expression support on set
luis11011 Apr 8, 2022
6506b0a
Fix set bug
luis11011 Apr 8, 2022
f6cddfb
Fix asynchronous function calling bug on Dask and Dask-cuDF
luis11011 Apr 12, 2022
c05810c
Fix repartition bug on Dask
luis11011 Apr 13, 2022
2cb35e8
Fix Dask series creation bug
luis11011 Apr 13, 2022
6c066a9
Fix replace function on Dask
luis11011 Apr 13, 2022
110644c
Fix pickling Dask client bug
luis11011 Apr 13, 2022
20eb752
Fix functions bug on dataframe pickling
luis11011 Apr 13, 2022
e3e7fb6
Fix state bug
luis11011 Apr 13, 2022
f8969d7
Remove time_it decorator
luis11011 Apr 14, 2022
5b21e2a
Fix profiler bug
luis11011 Apr 14, 2022
30f9033
Fix depickling bug
luis11011 Apr 14, 2022
6f1cd1f
Clean up Spark engine
argenisleon Apr 17, 2022
fbc96fd
Replace simple json by json standard library
argenisleon Apr 17, 2022
0c87aa9
Clean up and improve formatting
argenisleon Apr 17, 2022
adb28d5
Improve dict copy performance
argenisleon Apr 17, 2022
bbdf44b
Improve data type Inference via vectorization
argenisleon Apr 17, 2022
455ba16
Remove numba code
argenisleon Apr 17, 2022
fdb0b4f
Strip down requirements
argenisleon Apr 17, 2022
97a52d1
Now can load external models
argenisleon Apr 17, 2022
7456912
Remove orderedset library
argenisleon Apr 17, 2022
3bfe64c
Add mask new optimus datatypes
argenisleon Apr 17, 2022
7d33b1d
REplace fastfloat by float
argenisleon Apr 17, 2022
ded04be
General improvements
argenisleon Apr 17, 2022
4789157
Remove simplejson
argenisleon Apr 17, 2022
67e5351
Improve to_string performance
argenisleon Apr 17, 2022
292e86e
Fix .ip function rename
argenisleon Apr 17, 2022
46638e4
Doc for Installing libmagic
argenisleon Apr 17, 2022
de08954
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Apr 17, 2022
2aa3c7e
Remove packaging library
argenisleon Apr 17, 2022
ca7a666
Replace fastnumbers wirh pd.to_numeric
argenisleon Apr 18, 2022
f0a75c6
REmove wordninja
argenisleon Apr 18, 2022
bfc8330
Clean up imports
argenisleon Apr 18, 2022
6a6ebec
Move psutil to dask requirements
argenisleon Apr 18, 2022
8de14ed
Remove comment
argenisleon Apr 18, 2022
5580108
Fix replace string function typo
luis11011 Apr 18, 2022
0908be6
Fix nan values bug on dump_json
luis11011 Apr 18, 2022
d6bde59
Fix one hot encode bug
luis11011 Apr 18, 2022
c065a81
Now orderedset support intersect and is iterable
argenisleon Apr 18, 2022
ff4daa5
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Apr 18, 2022
2ad89d7
Use deepcopy in meta operations
luis11011 Apr 19, 2022
1071d11
Use deepcopy in Meta.get
luis11011 Apr 19, 2022
f4c5011
Add param to attach prediction to a dataframe
argenisleon Apr 19, 2022
cd38061
Fix deepcopy missed error in meta
argenisleon Apr 19, 2022
4a0722e
Merge remote-tracking branch 'origin/develop-22.2' into develop-22.2
argenisleon Apr 19, 2022
92abe7e
Support column rename in multiple columns
argenisleon Apr 20, 2022
f82a911
Support numpy on dataframe creation
argenisleon Apr 20, 2022
2efb4c4
Added parquet file infer
argenisleon Apr 21, 2022
d7357cb
Try standard deepcopy in case ujson failed
argenisleon Apr 21, 2022
51e6f13
Handle not detected datatypes
argenisleon Apr 22, 2022
5c291d9
Improve sample function
argenisleon Apr 22, 2022
335ae6d
Fix else position
argenisleon Apr 22, 2022
c32f66c
Fix is string implementation
argenisleon Apr 23, 2022
b94feed
Fix wrong str detection
argenisleon Apr 23, 2022
2afd352
Fix wrong string infer
argenisleon Apr 24, 2022
51ee274
Missing change to string constant
argenisleon Apr 24, 2022
02608cc
Improve string/zip code type handling
argenisleon Apr 24, 2022
1ef5bf2
Fix wrong tab
argenisleon Apr 24, 2022
92aa309
Handle right data type column
argenisleon Apr 24, 2022
6d10904
test reset index
argenisleon Apr 24, 2022
cb06b3e
Better Handel numeric vs string operations
argenisleon Apr 24, 2022
36056c7
Add index to operations
argenisleon Apr 24, 2022
f523afb
Fix string/numeric column arithmetic operation
argenisleon Apr 24, 2022
20ad218
Fix string/numeric column arithmetic operation
argenisleon Apr 24, 2022
084409f
Vaex WIP
argenisleon Apr 24, 2022
74dec13
Fix Saprk
argenisleon Apr 24, 2022
cf931e1
Model improve naming
argenisleon Apr 24, 2022
7f53c07
Merge remote-tracking branch 'refs/remotes/origin/develop-22.2'
argenisleon Apr 24, 2022
fa0484b
Add not operator
argenisleon Apr 24, 2022
ade1601
Increase buffer size to improve encoding detection
argenisleon May 4, 2022
8ac4003
Vaex WIP
argenisleon Jun 6, 2022
79230f6
Polars WIP
argenisleon Jun 6, 2022
0d2850d
Remove request to load remote files
argenisleon Jun 6, 2022
c4602cc
Format improvements
argenisleon Jun 6, 2022
5de5e05
Fix parquet loading
argenisleon Jun 6, 2022
1b1c6d2
WIP pyscript
argenisleon Jun 6, 2022
44480a5
Sum convert to float
argenisleon Jun 6, 2022
0699e9b
update docs
argenisleon Jun 6, 2022
be6f73d
Bug fixes unknow values
argenisleon Jun 14, 2022
4cadb11
Set imgkit requirement as optional
luis11011 Jun 19, 2022
180c5d2
Bump version
luis11011 Jun 19, 2022
3d7d29f
Re-raise ImportError
luis11011 Jun 19, 2022
c10339a
Update tensorflow from 2.6.1 to 2.9.1
pyup-bot Jun 20, 2022
3309ce5
Update tensorflow from 2.6.1 to 2.9.1
pyup-bot Jun 20, 2022
22b2745
Update numpy from 1.21.0 to 1.22.4
pyup-bot Jun 20, 2022
87752a7
Update numpy from 1.21.0 to 1.22.4
pyup-bot Jun 20, 2022
fa0452e
Fix issue when calculating hist with the exact same min and max values
argenisleon Jun 21, 2022
95053f9
Merge remote-tracking branch 'origin/develop-22.6' into develop-22.6
argenisleon Jun 21, 2022
dd5bb01
Clean up
argenisleon Jun 21, 2022
91624b4
Remove print
luis11011 Jun 21, 2022
2db1670
Fix profiling bug
luis11011 Jun 21, 2022
0dd74ae
Annotate possible bug on nulls count on profiling
luis11011 Jun 21, 2022
f637dd6
Bump version
luis11011 Jun 21, 2022
0456c68
Fix rename bug
luisboitas Sep 15, 2022
f49ddca
Bump version
luisboitas Sep 15, 2022
9ee0a52
Change orjsong by marshall to imporve copying speed
argenisleon Sep 16, 2022
6782897
Improve performance removing unnecessarydeepcpy from reset
argenisleon Sep 16, 2022
2388bd7
Fix missing deepcopy
argenisleon Sep 16, 2022
d2a7b4c
Bump glom version
argenisleon Sep 16, 2022
b034345
Merge remote-tracking branch 'origin/develop-22.6' into develop-22.6
argenisleon Sep 16, 2022
1b8a82c
Support connection with not password
argenisleon Sep 16, 2022
64e401e
Fix null columns issue
luisboitas Sep 16, 2022
b028be8
Fix meta reset bug
luisboitas Sep 16, 2022
1492183
Bump version
luisboitas Sep 16, 2022
42bc518
Fix statsmodel requirement version
luis11011 Oct 17, 2022
b51b579
Support array on rows drop operation
luis11011 Oct 17, 2022
2100afd
Bump version
luis11011 Oct 17, 2022
609b66b
Fix database connection bug
luis11011 May 1, 2023
22029c5
Bump version
luis11011 May 1, 2023
cb73842
Implement manually triggered Publish Python test Package Github action
luis11011 May 19, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
9 changes: 0 additions & 9 deletions .coveragerc

This file was deleted.

23 changes: 23 additions & 0 deletions .github/workflows/docker.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
name: docker-image

on:
workflow_dispatch:
release:
types: [published]

jobs:
build:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v2
- name: Login to Docker Hub
uses: docker/login-action@v1
with:
username: ${{ secrets.DOCKER_USERNAME }}
password: ${{ secrets.DOCKER_PASSWORD }}
- name: Build Docker image
run: |
docker build . -t hiprimus/optimus:latest
- name: Push Docker image
run: |
docker push hiprimus/optimus:latest
36 changes: 36 additions & 0 deletions .github/workflows/main.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
name: tests

on:
workflow_dispatch:
release:
types: [published]
pull_request:
branches:
- main
- develop-23.5

jobs:
build:
runs-on: ubuntu-latest
strategy:
matrix:
python-version: [3.7, 3.8]

steps:
- name: Checkout
uses: actions/checkout@v2
- name: Set up Python ${{ matrix.python-version }}
uses: actions/setup-python@v2
with:
python-version: ${{ matrix.python-version }}
- name: Install dependencies
run: |
python -m pip install --upgrade pip
if [ -f requirements.txt ]; then pip install -r requirements.txt; fi
if [ -f requirements/pandas-requirements.txt ]; then pip install -r requirements/pandas-requirements.txt; fi
if [ -f requirements/dask-requirements.txt ]; then pip install -r requirements/dask-requirements.txt; fi
if [ -f requirements/api-requirements.txt ]; then pip install -r requirements/api-requirements.txt; fi
python -m nltk.downloader punkt wordnet stopwords averaged_perceptron_tagger omw-1.4
- name: Test with unittest
run: |
python -m unittest
67 changes: 67 additions & 0 deletions .github/workflows/python-manually-publish-test.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,67 @@
name: Update Version and Publish to Test PyPI

on:
workflow_dispatch:
inputs:
new_version:
description: 'New PyOptimus version (TestPyPI)'
required: true
python_version:
description: 'Python version'
default: '3.10'
required: true

jobs:
update-version-and-create-tag:
runs-on: ubuntu-latest

steps:
- name: Checkout code
uses: actions/checkout@v2

- name: Set up Python ${{ github.event.inputs.python_version }}
uses: actions/setup-python@v2
with:
python-version: '${{ github.event.inputs.python_version }}'

- name: Update version in toml file
run: |
sed -i 's/version\s*=\s*".*"/version = "${{ github.event.inputs.new_version }}"/' pyproject.toml

- name: Commit changes
run: |
git config --local user.email "${{ github.actor }}@users.noreply.github.com"
git config --local user.name "${{ github.actor }}"
git commit -m "Update version in toml file" pyproject.toml

- name: Push changes
uses: ad-m/github-push-action@master
with:
github_token: ${{ secrets.GITHUB_TOKEN }}
branch: ${{ github.ref }}

- name: Create tag
run: |
git tag "test-${{ github.event.inputs.new_version }}"

- name: Push tag
run: |
git push origin "test-${{ github.event.inputs.new_version }}"

- name: Install dependencies
run: |
python -m pip install --upgrade pip
pip install poetry

- name: Configure Poetry
run: |
poetry config repositories.pyoptimus https://test.pypi.org/legacy/
poetry config http-basic.pyoptimus ${{ secrets.TEST_PYPI_USERNAME }} ${{ secrets.TEST_PYPI_PASSWORD }}

- name: Build and publish package
run: |
poetry build
poetry publish -r pyoptimus
env:
TEST_PYPI_USERNAME: ${{ secrets.TEST_PYPI_USERNAME }}
TEST_PYPI_PASSWORD: ${{ secrets.TEST_PYPI_PASSWORD }}
29 changes: 29 additions & 0 deletions .github/workflows/python-publish.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
name: Upload Python Package

on:
release:
types: [published]

jobs:
deploy:

runs-on: ubuntu-latest

steps:
- uses: actions/checkout@v2
- name: Set up Python
uses: actions/setup-python@v2
with:
python-version: '3.8'
- name: Install dependencies
run: |
python -m pip install --upgrade pip
pip install wheel
pip install build
- name: Build package
run: python setup.py sdist bdist_wheel --no-git
- name: Publish package
uses: pypa/gh-action-pypi-publish@release/v1
with:
user: ${{ secrets.PYPI_USERNAME }}
password: ${{ secrets.PYPI_TOKEN }}
44 changes: 8 additions & 36 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -5,13 +5,12 @@
# Directory-based project format:
.idea/

# Optimus

examples/checkPointFolder
# Visual Studio Code:
.vscode/

# Optimus
metastore_db
spark-warehouse
projectFilesBackup/tests.py

# Distribution / packaging
build/
Expand All @@ -28,39 +27,12 @@ __pycache__
.coverage*
.DS_Store
.pytest_cache/
data.json
.pytest_cache/README.md
examples/new-api-jar-test\.ipynb
venv/

# Test
tests/test.csv/
tests/test.json/
tests/test.parquet/
*.test

examples/new-api-optimus-jars\.ipynb
readme/credentials\.py
readme/data/foo\.json/
readme/data/foo\.parquet/
readme/data/foo\.csv/

examples/test\.parquet/
examples/test\.csv/
examples/test\.json/
examples/temp


node_modules


*bumblebee\.ini
examples/many-columns\.csv

examples/many=columns\.parquet/

examples/many-columns.parquet/

examples/sandbox.ipynb

examples/dask-worker-space/
node_modules/
**/dask-worker-space/
dask-worker-space/
conda/recipes/conda
conda/recipes/conda-build
3 changes: 3 additions & 0 deletions .pyup.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
update: insecure
branch: develop-23.5
schedule: "every two weeks on monday"
21 changes: 21 additions & 0 deletions .readthedocs.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
# ..readthedocs.yaml
# Read the Docs configuration file
# See https://docs.readthedocs.io/en/stable/config-file/v2.html for details

# Required
version: 2

# Build documentation in the docs/ directory with Sphinx
sphinx:
configuration: docs/source/conf.py

# Optionally build your docs in additional formats such as PDF
formats:
- pdf

# Optionally set the version of Python and requirements required to build your docs
python:
version: "3.7"
install:
- requirements: requirements.txt
- requirements: docs/requirements.txt
45 changes: 0 additions & 45 deletions .travis.yml

This file was deleted.

2 changes: 1 addition & 1 deletion CODE_OF_CONDUCT.md
Original file line number Diff line number Diff line change
Expand Up @@ -40,7 +40,7 @@ Project maintainers who do not follow or enforce the Code of Conduct in good fai

## Developer & Administrator

[Favio Vázquez](https://github.com/faviovazquez)
[Argenis Leon](https://github.com/argenisleon)

## Attribution

Expand Down
2 changes: 1 addition & 1 deletion CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@ Please note we have a code of conduct, please follow it in all your interactions

### Developer and administrator

[Favio Vázquez](https://github.com/faviovazquez)
[Argenis León](https://github.com/argenisleon)

### Attribution

Expand Down
36 changes: 36 additions & 0 deletions Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
FROM ubuntu:20.04

RUN apt-get update && yes|apt-get upgrade && \
apt-get install -y git curl wget nano bzip2 sudo net-tools && \
apt-get install -y --no-install-recommends apt-utils

RUN wget https://repo.anaconda.com/archive/Anaconda3-2020.07-Linux-x86_64.sh && \
bash Anaconda3-2020.07-Linux-x86_64.sh -b && \
rm Anaconda3-2020.07-Linux-x86_64.sh

ENV PATH="/root/anaconda3/bin:${PATH}"

RUN sudo apt-get update --fix-missing && \
sudo apt-get install -y gcc g++ && \
sudo apt-get clean

RUN sudo rm -rf /var/lib/apt/lists/*

RUN sudo chown -R root ~/anaconda3/bin && \
sudo chmod -R +x ~/anaconda3/bin && \
conda install -c conda-forge jupyterlab && \
conda install -c conda-forge dask-labextension && \
jupyter serverextension enable dask_labextension && \
conda install -c conda-forge jupyter_kernel_gateway && \
conda clean -afy

RUN echo "Version 23.5.0-beta"

RUN pip install cytoolz && \
pip install llvmlite --ignore-installed && \
pip install git+https://github.com/hi-primus/[email protected]#egg=pyoptimus[pandas] && \
pip install git+https://github.com/hi-primus/[email protected]#egg=pyoptimus[dask]

CMD jupyter notebook --port=8888 --no-browser --ip=0.0.0.0 --allow-root

EXPOSE 8888:8888 8889:8889
28 changes: 28 additions & 0 deletions Dockerfile.coiled
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
FROM continuumio/miniconda3:4.9.2

RUN apt-get update --fix-missing && \
apt-get install -y gcc && \
apt-get install -y g++ && \
apt-get clean && \
rm -rf /var/lib/apt/lists/*

RUN conda install libmagic && \
conda clean -afy

RUN pip install aiohttp --no-cache-dir && \
pip install git+https://github.com/hi-primus/[email protected] --no-cache-dir















Loading