new poetry lock

arulmabr · Dec 20, 2024 · b588446 · b588446
2 parents aed96d4 + 387f0e7
commit b588446
Show file tree

Hide file tree

Showing 589 changed files with 101,659 additions and 70,783 deletions.
diff --git a/.github/workflows/test_suite.yml b/.github/workflows/test_suite.yml
@@ -48,4 +48,4 @@ jobs:
 
       - name: Run doctests
         shell: bash
-        run: poetry run make test-doctests
+        run: poetry run make test-doctests
diff --git a/.gitignore b/.gitignore
@@ -6,6 +6,7 @@ _build/*
 *db.bak
 *db-shm
 *db-wal
+*docx
 *gz
 *json
 *jsonl
@@ -40,3 +41,5 @@ venv
 !.vscode/settings.json
 !tests/serialization/data/*.json
 .env*
+docs/notebooks/drafts/*
+test.py
diff --git a/CHANGELOG.md b/CHANGELOG.md
@@ -1,15 +1,173 @@
 # Changelog
 
-## [0.1.30] - 2024-TBD [In progress]
+## [0.1.39] - TBD
+
+## [0.1.38] - 2024-11-26
+### Added
+- `Results` are now automatically displayed in a scrollable table when you call `select()` on them. You can also call `table().long()` to display results in a long-view table. This replaces the need to call `print(format="rich")`. See examples in the starter tutorial.
+
+### Changed
+- The progress bar is now web-based and a link to view it in a new tab is automatically returned when you call the `run()` method on a survey (`progress_bar=True` by default). See examples in the starter tutorial.
+
+### Fixed
+- Results were automatically appending cache; this was removed.
+
+
+## [0.1.37] - 2024-11-14
 ### Added
-- [In progress] `ScenarioList.from_sqlite` allows you to create a list of scenarios from a SQLite table.
+- EDSL auth token: It will automatically retrieve your EXPECTED_PARROT_API_KEY and write it to your *.env* file. *How it works:* If you try to run a survey remotely without storing your EXPECTED_PARROT_API_KEY, you will see a message with a Coop login link that will automatically write your key to your *.env* file when you click on the link and log in. Example message:
+```
+You're missing some of the API keys needed to run this job:
+     🔑 OPENAI_API_KEY
 
-- [In progress] Added LaTeX support to SQL outputs and ability to write to files: `Results.print(format="latex", filename="example.tex")`
+You can either add the missing keys to your .env file, or use remote inference.
+Remote inference allows you to run jobs on our server.
 
-- [In progress] Options that we think of as "terminal", such as `sql()`, `print()`, `html()`, etc., now take a `tee` boolean that causes them to return `self`. This is useful for chaining, e.g., if you run `print(format = "rich", tee = True)` it will return `self`, which allows you do also run `print(format = "rich", tee = True).print(format = "latex", filename = "example.tex")`.
+🚀 To use remote inference, sign up at the following link:
+    https://www.expectedparrot.com/login?edsl_auth_token=gVSxKvDcuB1x6d_dqa7xrA
+
+Once you log in, we will automatically retrieve your Expected Parrot API key and continue your job remotely.
+⠇ Waiting for login. Last checked: 2024-11-13 11:36:44 AM
+```
+
+Once you have logged in you will see a confirmation message:
+```
+✨ API key retrieved and written to .env file. 
+```
 
 ### Changed
-- [In progress] `QuestionMultipleChoice` may be modified to allow combined options and free response "Other" option, as well as non-responsive answers. Previously, an error was thrown if the agent did not select one of the given options. Details TBD.
+- The `AgentList` method `from_csv()` now allows you to (optionally) automatically specify the `name` parameters for agents by including a column "name" in the CSV. Other columns are (still) passed as agent `traits`. See an example: https://docs.expectedparrot.com/en/latest/agents.html#from-a-csv-file
+
+- The `Job` method `run()` now takes a parameter `remote_inference_results_visibility` to set the visibility of results of jobs that are being run remotely. The allowed visibility settings are `public`, `private` and `unlisted` (the default setting is unlisted). This parameter has the same effect as passing the parameter `visibility` to the `push()` and `patch()` methods for posting and updating objects at the Coop. For example, these commands have the same effect when remote inference activated:
+
+```
+Survey.example().run() 
+```
+```
+Survey.example().run(remote_inference_visibility="unlisted")
+```
+
+### Fixed
+- Bug in using f-strings and scenarios at once. Example usage: https://docs.expectedparrot.com/en/latest/scenarios.html#using-f-strings-with-scenarios
+
+- Bug in optional question parameters `answering_instructions` and `question_presentation`, which can be used to modify user prompts separately from modifying question texts. Example usage: https://docs.expectedparrot.com/en/latest/questions.html#optional-question-parameters
+
+
+## [0.1.36] - 2024-10-28
+### Added
+- Method `show_prompts()` can be called on a `Survey` to display the user prompt and system prompt. This is in addition to the existing method `prompts()` that is called on a `Job` which will return the prompts and additional information about the questions, agents, models and estimated costs. Learn more: https://docs.expectedparrot.com/en/latest/prompts.html
+
+- Documentation on storing API keys as "secrets" for using EDSL in Colab.
+
+### Changed
+- `Conversation` module works with multiple models at once.
+
+- Improved features for adding new models.
+
+
+## [0.1.35] - 2024-10-17
+### Fixed
+- Access to Open AI o1 models
+
+
+## [0.1.34] - 2024-10-15
+### Added
+- Survey Builder is a new interface for creating and launching hybrid human-AI surveys. It is fully integrated with EDSL and Coop. Get access by activating beta features from your Coop account profile page. Learn more: https://docs.expectedparrot.com/en/latest/survey_builder.html
+
+- `Jobs` method `show_prompts()` returns a table showing the user and system prompts that will be used with a survey, together with information about the agent and model and estimated cost for each interview. `Jobs` method `prompts` returns the information in a dataset.
+
+- `Scenario` objects can contain multiple images to be presented to a model at once (works with Google models).
+
+### Fixed
+- Bug in piping a `ScenarioList` containing multiple lists of `question_options` to use with questions.
+
+
+## [0.1.33] - 2024-09-26
+### Added 
+
+- Optional parameters for `Question` objects:
+    - `include_comment = False` prevents a `comment` field from being added to a question (default is `True`: all question types other than free text automatically include a field for the model to comment on its answer, unless this parameter is passed) 
+    - `use_code = True` modifies user prompts for question types that take `question_options` to instruct the model to return the integer code for an option instead of the option value (default is `False`)
+    - `answering_instructions` and `question_presentation` allow you to control exact prompt language and separate instructions for the presentation of a question
+    - `permissive = True` turns off enforcement of question constraints (e.g., if min/max selections for a checkbox question have been specified, you can set `permissive = True` to allow responses that contain fewer or greater selections) (default is `False`)
+
+- Methods for `Question` objects:
+    - `loop()` generates a list of versions of a question for a `ScenarioList` that is passed to it. Questions are constructed with a `{{ placeholder }}` for a scenario as usual, but each scenario value is added to the question when it is created instead of when a survey is run (which is done with the `by()` method). Survey results for looped questions include fields for each unique question but no `scenario` field. See examples: https://docs.expectedparrot.com/en/latest/starter_tutorial.html#adding-scenarios-using-the-loop-method and https://docs.expectedparrot.com/en/latest/scenarios.html#looping 
+
+- Methods for `ScenarioList` objects:
+    - `unpivot()` expands a scenario list by specified identifiers
+    - `pivot()` undoes `unpivot()`, collapsing scenarios by identifiers
+    - `give_valid_names()` generates valid Pythonic identifiers for scenario keys
+    - `group_by()` groups scenarios by identifiers or applies a function to the values of the specified variables
+    - `from_wikipedia_table()` converts a Wikipedia table into a scenario list. See examples: https://docs.expectedparrot.com/en/latest/notebooks/scenario_list_wikipedia.html
+    - `to_docx()` exports scenario lists as structured Docx documents
+
+- Optional parameters for `Model` objects: 
+    - `raise_validation_errors = False` causes exceptions to only be raised (interrupting survey execution) when a model returns nothing at all (default: `raise_validation_errors = True`)
+    - `print_exceptions = False` causes exceptions to not be printed at all (default: `print_exceptions = True`)
+
+- Columns in `Results` for monitoring token usage:
+    - `generated_tokens` shows the tokens that were generated by the model
+    - `raw_model_response.<question_name>_cost` shows the cost of the result for the question, applying the token quanities & prices
+    - `raw_model_response.<question_name>_one_usd_buys` shows the number of results for the question that 1USD will buy
+    - `raw_model_response.<question_name>_raw_model_response` shows the raw response for the question
+
+- Methods for `Results` objects:
+    - `tree()` displays a nested tree for specified components
+    - `generate_html()` and `save_html()` generate and save HTML code for displaying results
+
+### Changed
+- General improvements to exceptions reports. 
+
+- General improvements to the progress bar: `survey.run(progress_bar=True)`
+
+- Question validation methods no longer use JSON. This will eliminate exceptions relating to JSON errors previously common to certain models.
+
+- Base agent instructions template is not added to a job if no agent is used with a survey (reducing tokens).
+
+- The `select()` method (for `Results` and `ScenarioList`) now allows partial match on key names to save typing.
+
+### Fixed
+- Bug in enforcement of token/rate limits.
+
+- Bug in generation of exceptions report that excluded agent information.
+
+
+## [0.1.32] - 2024-08-19
+### Added
+- Models: AWS Bedrock & Azure
+
+- Question: New method `loop()` allows you to create versions of questions when you are constructing a survey. It takes a `ScenarioList()` as a parameter and returns a list of `Question` objects.
+
+### Fixed
+- Bug in `Survey` question piping prevented you from adding questions after piping.
+
+
+## [0.1.31] - 2024-08-15
+
+- `ScenarioList.from_sqlite` allows you to create a list of scenarios from a SQLite table.
+
+- Added LaTeX support to SQL outputs and ability to write to files: `Results.print(format="latex", filename="example.tex")`
+
+- Options that we think of as "terminal", such as `sql()`, `print()`, `html()`, etc., now take a `tee` boolean that causes them to return `self`. This is useful for chaining, e.g., if you run `print(format = "rich", tee = True)` it will return `self`, which allows you do also run `print(format = "rich", tee = True).print(format = "latex", filename = "example.tex")`.
+
+
+## [0.1.30] - 2024-07-28
+### Added
+- Ability to create a `Scenario` for `question_options`. Example:
+```
+from edsl import QuestionMultipleChoice, Scenario
+
+q = QuestionMultipleChoice(
+    question_name = "capital_of_france",
+    question_text = "What is the capital of France?", 
+    question_options = "{{question_options}}"
+)
+
+s = Scenario({'question_options': ['Paris', 'London', 'Berlin', 'Madrid']})
+
+results = q.by(s).run()
+```
 
 
 ## [0.1.29] - 2024-07-21
@@ -431,4 +589,4 @@ There is one for each question. The key is the question_name + `_raw_response_mo
 
 ## [0.1.0] - 2023-12-20
 ### Added
-- Base feature
+- Base feature
diff --git a/Makefile b/Makefile
@@ -3,7 +3,7 @@
 ###############
 GIT_ROOT ?= $(shell git rev-parse --show-toplevel)
 PROJECT_NAME ?= $(shell basename $(GIT_ROOT))
-.PHONY: bump docs docstrings find help integration
+.PHONY: bump docs docstrings find help integration model-report
 
 ###############
 ##@Utils ⭐ 
@@ -28,7 +28,12 @@ clean: ## Clean temp files
 	[ ! -d .temp ] || rm -rf .temp
 	[ ! -d dist ] || rm -rf dist
 	[ ! -d htmlcov ] || rm -rf htmlcov
+	[ ! -f output.html ] || rm output.html
 	[ ! -d prof ] || rm -rf prof
+	[ ! -f test.dta ] || rm test.dta
+	[ ! -f *.docx ] || rm *.docx
+	[ ! -f *.html ] || rm *.html
+	[ ! -f *.json ] || rm *.json
 	find . -type d -name '.venv' -prune -o -type f -name '*.db' -exec rm -rf {} +
 	find . -type d -name '.venv' -prune -o -type f -name '*.db.bak' -exec rm -rf {} +
 	find . -type d -name '.venv' -prune -o -type f -name '*.log' -exec rm -rf {} +
@@ -53,6 +58,10 @@ clean-test: ## Clean test files
 		[ ! -f "$$file" ] || rm "$$file"; \
 	done
 
+model-report: ## Generate a model report
+	python integration/test_all_questions_and_models.py | tee >> model_report.txt
+	echo "Model report generated in model_report.txt"
+
 clean-all: ## Clean everything (including the venv)
 	@if [ -n "$$VIRTUAL_ENV" ]; then \
 		echo "Your virtual environment is active. Please deactivate it."; \
@@ -133,6 +142,10 @@ test: ## Run regular tests (no Coop tests)
 	make clean-test
 	pytest -xv tests --nocoop
 
+test-token-bucket: ## Run token bucket tests
+	make clean-test
+	pytest -xv tests --nocoop --token-bucket
+
 test-coop: ## Run Coop tests (no regular tests, requires Coop local server running)
 	make clean-test
 	pytest -xv tests --coop
@@ -171,11 +184,34 @@ test-doctests: ## Run doctests
 	pytest --doctest-modules edsl/language_models
 	pytest --doctest-modules edsl/data
 	pytest --doctest-modules edsl/study
-	pytest --doctest-modules edsl/conjure
+
+test-services:
+	python integration/test_all_questions_and_models.py
+
+# Directory containing the notebooks
+NOTEBOOK_DIR := docs/notebooks
+
+.PHONY: test-notebooks
+
+# Test notebooks
+test-notebooks:
+	@if [ -z "$(notebook)" ]; then \
+		echo "Testing all notebooks..."; \
+		pytest -v integration/active/test_notebooks.py; \
+	else \
+		echo "Testing notebook: $(notebook)"; \
+		pytest -v integration/active/test_notebooks.py -k "$(notebook)"; \
+	fi
+
+
+# .PHONY: test-notebooks	
+# test-notebooks: ## Run the notebooks tests
+# 	pytest -v integration/active/test_example_notebooks.py
 
 test-integration: ## Run integration tests via pytest **consumes API credits**
-	cd integration/printing && python check_printing.py
-	pytest -v integration/test_example_notebooks.py
+	# cd integration/printing && python check_printing.py
+	pytest -v integration/active
+	# pytest -v integration/test_example_notebooks.py
 	pytest -v integration/test_integration_jobs.py
 	pytest -v integration/test_memory.py
 	pytest -v integration/test_models.py
@@ -187,14 +223,3 @@ integration-job-running: # DOES NOT WORK!
 
 integration-tricky-questions: # DOES NOT WORK!
 	pytest -v --log-cli-level=INFO integration/test_tricky_questions.py
-
-# ###############
-# ##@COOP 🪺
-# ###############
-# env-chick:
-# 	@echo "Setting up the environment"
-# 	cp .env_chick .env
-
-# env-local-coop: 
-# 	@echo "Setting up the environment"
-# 	cp .env_local_coop .env
diff --git a/README.md b/README.md
@@ -19,7 +19,7 @@ A quick example:
 
 ```python
 # Import a question type
-from edsl.questions import QuestionMultipleChoice
+from edsl import QuestionMultipleChoice
 
 # Construct a question using the question type template
 q = QuestionMultipleChoice(
-Original file line number
+Diff line change
@@ Expand Up / @@ -6,6 +6,7 @@ _build/* @@
     *db.bak
     *db-shm
     *db-wal
+    *docx
     *gz
     *json
     *jsonl
@@ Expand Down Expand Up / @@ -40,3 +41,5 @@ venv @@
     !.vscode/settings.json
     !tests/serialization/data/*.json
     .env*
+    docs/notebooks/drafts/*
+    test.py