Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
33 commits
Select commit Hold shift + click to select a range
24b5003
Add Internet Archive data fetching functionality
jessbryte Oct 14, 2025
2fc1f72
Merge branch 'creativecommons:main' into main
jessbryte Oct 16, 2025
3d18a74
Cleaned up static analysis issues in internetarchive_fetch.py
jessbryte Oct 16, 2025
9cb806c
Merge pull request #1 from jessbryte/internet-archive-update
jessbryte Oct 16, 2025
b7873cc
Revert Pipfile and Pipfile.lock to state before last push
jessbryte Oct 20, 2025
c77e23f
Refactor Internet Archive fetch script: use pycountry, count by port,…
jessbryte Oct 20, 2025
0597061
Revert Pipfile.lock
jessbryte Oct 20, 2025
6924f4a
Merge pull request #2 from jessbryte/internet-archive-update
jessbryte Oct 20, 2025
a22ede1
Revert Pipfile to remove pycountry
jessbryte Oct 24, 2025
c47fd55
Merge branch 'creativecommons:main' into main
jessbryte Oct 24, 2025
c970d57
Improve normalization and filtering for license and language fields
jessbryte Oct 25, 2025
7c1f97a
Merge branch 'main' into internet-archive-update
jessbryte Oct 25, 2025
6d078a2
Merge pull request #3 from jessbryte/internet-archive-update
jessbryte Oct 25, 2025
903190f
Make internetarchive_fetch.py executable for direct script usage
jessbryte Oct 25, 2025
c1a1605
Refactor: improve language normalization, logging, and output sorting…
jessbryte Oct 28, 2025
27897f1
Merge pull request #4 from jessbryte/internet-archive-update
jessbryte Oct 28, 2025
68be354
Restore pre-automation scripts
jessbryte Oct 28, 2025
226bd34
Merge pull request #5 from jessbryte/preautomation_cleanup
jessbryte Oct 28, 2025
a22299a
Added internetarchive documentation, utf-8 csv outputting and package…
jessbryte Oct 29, 2025
b7a2aad
Merge pull request #6 from jessbryte/internet_archive_cleanup
jessbryte Oct 29, 2025
3e29579
Language normalization
jessbryte Oct 31, 2025
92e7629
Documentation Update for internetarchive
jessbryte Oct 31, 2025
6eebe1d
Merge branch 'main' into internet-archive-update
jessbryte Oct 31, 2025
5b257a4
Restore Pipfile.lock
jessbryte Oct 31, 2025
7db3e8b
Merge pull request #7 from jessbryte/internet-archive-update
jessbryte Oct 31, 2025
cc083f4
Removing aliases that are in iso639
jessbryte Oct 31, 2025
5581dd9
Merge branch 'main' into main
jessbryte Nov 4, 2025
2c9fa53
Reworked Language Normalization logic and some fixes
jessbryte Nov 5, 2025
ea7fa93
Pulling Changes - Conflict Resolved for Merge
jessbryte Nov 5, 2025
61da028
Merge branch 'creativecommons:main' into main
jessbryte Nov 5, 2025
990d535
Make changes to license identifiers + documentation update
jessbryte Nov 5, 2025
c93d87b
Update internetarchive fetching to use shared session
jessbryte Nov 6, 2025
4af14ac
Reordered sources in sources documentatiom
jessbryte Nov 6, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions Pipfile
Original file line number Diff line number Diff line change
Expand Up @@ -4,12 +4,13 @@ verify_ssl = true
name = "pypi"

[packages]
feedparser = "*"
babel = "*"
flickrapi = "*"
GitPython = "*"
google-api-python-client = "*"
h11 = ">=0.16.0" # Ensure dependency is secure
internetarchive = ">=5.5.1"
iso639-lang = "*"
jupyterlab = ">=3.6.7"
matplotlib = "*"
numpy = "*"
Expand All @@ -19,7 +20,6 @@ pillow = ">=11.3.0" # Ensure dependency is secure
Pyarrow = "*"
Pygments = "*"
python-dotenv = "*"
PyYAML = "*"
requests = ">=2.31.0"
seaborn = "*"
urllib3 = ">=2.5.0"
Expand Down
119 changes: 70 additions & 49 deletions Pipfile.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading