Very much a work in progress; contact owner with questions.
This tool consists of two scripts. main.py
reads in a config file that
describes your class (names and values of assignments, etc) as well as some
number of source spreadsheets (roster, gradesheets, etc, in either csv or xlsx
form), and produces formatted grade reports for each student.
The second script, autoconf.py
, helps you build the config file required by
main.py
. It takes in source spreadsheets and outputs its best guess for the
config file, which you can then edit. It can also be run in an incremental
fashion, where it takes in an existing config file as well and adds to it.
Requires Python. (possibly even Python 3.7.2+ ?)
Requires wkhtmltopdf (if you want pdf reports instead of just html)
git clone https://github.com/BenjaminCosman/gradeReports.git
cd gradeReports
python3 -m venv .venv
If you do this, make sure to activate that environment for the rest of installation as well as whenever you run the scripts:
source .venv/bin/activate
(Unix zsh/bash)
.venv\Scripts\activate
(Windows PowerShell)
pip3 install -r requirements.txt
Check if the commands in the first few steps of the tutorial work (see below).
It is week 3 of the fictional class CSE777, and we'd like to generate a preliminary progress report for our students. We've already downloaded all relevant files to local folder examples/data
: the roster, gradescope gradebook, iclicker and discussion attendance spreadsheets, and google forms responses for quizzes, surveys, and clicker registrations.
- Later steps will show you how to produce a config file, but for now we'll use the one at
examples/config.json
. Once we have this config file, all we need to do to generate reports is run:
python3 main.py examples/config.json --pdf
You should see simplified text versions of the reports on your terminal, and formatted reports suitable for distribution in a reports
folder.
(The suggested way to distribute these reports would be to upload them all to gradescope as a 0-point assignment; gradescope can automatically match the files with students using the name and student ID fields.)
- Now we will work backwards and produce that config file needed for step 1. Run:
python3 autoconf.py examples/data
This will produce a config file at tempConfig.json
. Now try using it as in step 1 (except since we're not uploading these reports, there's no need to spend the time converting the html reports to pdfs so we leave out the --pdf
option from now on):
python3 main.py tempConfig.json
Open one of the generated reports (e.g. reports/A12345678.html
). It should look mostly correct and ready for distribution, with a few exceptions:
- There are some extra assignments, like clickerRegistrations, and discussion attendance through week 10.
- All headers and some assignments need to be renamed.
These are quick to fix manually:
- Open
tempConfig.json
- At the bottom of the file (
outputs
->content
), rename thetitle
values. - Find and delete unwanted assignments in the middle. For example, to remove weeks 4-10 of discussion attendance, find and delete the objects containing the text
Week 4
,Week 5
, etc. Make sure the resulting config file is still valid JSON: in particular, the last element of a list needs to NOT have a comma after it (and each other element does need a comma) - For any assignment where a Timestamp column was detected, autoconf has inserted a
due_date
field in the distant future. If there is a deadline you want to enforce, change these fields; late assignemnts will get 0 credit.
After editing the file you can re-generate the reports (python3 main.py tempConfig.json
) and you should see your changes reflected there. That's it for the most important features; at this point you should be able to generate a real report using your own data.
COMING SOON:
- Now it's week 5 and you need a config file that includes the last two weeks of assignments. TODO Right now incremental autoconfigure doesn't work well, so either edit your old config manually or create a brand new one with autoconf.py and edit that one. Soon however you will be able to automatically update your old config using
python3 autoconf.py -i oldConfig.json
Once you already have your sources and a config file, just run
python3 main.py CONFIG_FILE
First download all sources. Now you need to create a JSON config file. Full documentation follows, but the easiest way to do this may be to just run autoconf:
python3 autoconf.py SOURCE1 SOURCE2 ...
where each SOURCE is either a csv/xlsx file or a folder containing such files.
This will create a config file at tempConfig.json
(use -o
to choose a
different output filename). Then as long as your sources were formatted in ways
autoconf could understand, you should be able to immediately generate reports
with main.py
. You can then see directly what different parts of the config
file are doing, and edit them to get the result you want.
The example class config (examples/config.json
) may also be useful for
understanding these files.
The config file has three parts, described in more detail in the next section.
- "studentAttributes" describes what non-grade information you are tracking for each student, like their name, student ID, and email.
- "sources" describes how to read each of your raw data spreadsheets. From each row in the spreadsheet, we seek to extract information sufficient to identify one student, and then one or both of 1) additional non-grade information, and 2) one or more grades. For example, from a "clicker registration" spreadsheet we hope to identify a student by student ID and then extract a clicker ID that we can link to that student (and we are not extracting any grades). Then from a separate clickers participation spreadsheet, we identify a student by clicker ID and then extract multiple attendance scores.
- "outputs" describes what text and grades go on the report
key: "studentAttributes" value: A dictionary of non-grade attributes that are associated with each student, e.g. name, student ID. For example:
"studentAttributes": {
"Section": {"onePerStudent": true},
"Student ID": {"identifiesStudent": true, "onePerStudent": true, "filters": ["strip", "toUpper"]},
}
- "identifiesStudent" (default: false) should be set to true if the attribute can be (and is) used to uniquely identify a student. For example, a Student ID can identify a student; a name usually can NOT (two people can have the same name).
- "onePerStudent" (default: false) should be set to true if each student should only have one of this attribute. For example, each student probably has only one university-issued Student ID, but they may have multiple email addresses.
- "filters" (default: []) can be set to a list of operations that should be
performed to clean and validate this attribute whenever it is read. For example,
if students are asked to enter their student ids on a web form, then you may want
to strip off any whitespace and change any letters to uppercase. For a full list
of filter options (and to add your own, if needed), see the definition of
filtersAndChecks
inlib/munge.py
. - "onlyPrintIfPresent" (default: false): if set to true, then any student who does NOT have this attribute (e.g. they are not in the roster so they don't have a Roster Name) will not have their report printed.
key: "sources" value: a list of source config objects, each of which looks like this:
{
"file": FILEPATH,
"sheetName": SHEETNAME,
"isRoster": [true|false],
"attributes": {
ATTR_COLUMN1: ATTR_NAME1,
...
},
"assignments": [
ASSIGNMENT1_CONFIG,
...
]
}
- FILEPATH is the path to the spreadsheet file (either absolute, or relative to the directory from which you will run
main.py
). - SHEETNAME (optional): if source is an excel workbook, the sheet to read from
- isRoster (optional, default=false): is the sheet formatted like a UCSD roster, i.e. has an extra mini table in the first few rows with section info
- The "attributes" value says how to read student identifying data from the spreadsheet. For example, if the column header is "Type in your PID" and internally (see "studentAttributes" above) we call this attribute "Student ID", then the "attributes" object should include a
"Type in your PID":"Student ID"
entry. - Each ASSIGNMENT_CONFIG is an object with the following properties:
- "name": the display name you want to appear on the report, e.g. "Homework 1"
- "sheetName" (optional syntactic sugar): the sheet of the excel book to read from, specified here instead of at the top level so that you can have more than one sheet in the same source config object
- "scoreCol" (optional): the header of the column which contains how many points were awarded. If not specified, then all students in the spreadsheet will get full credit (useful when scoring just for completion, e.g. a survey)
- "max_points": the maximum score on the assignment
- "type": a string of your choice representing a category name, e.g. "homework" or "exam". Will be used below (see "outputs") to organize the report by category
- "due_date" (optional): a timestamp like "10/3/2018 23:59:59". Assignments received after this time get no credit. It is assumed this timestamp and all timestamps appearing in the spreadsheet are from the same timezone (which should NOT be specified explicitly)
- "timestampCol" (optional, used in conjunction with "due_date"): the header of the column containing submission timestamps
key: "outputs" value: an object describing what should appear on the reports, e.g.
{
"report-name": "CSE777 Preliminary Grade Report",
"disclaimer-text": "These are all the scores recorded for you in this course. If there are any discrepancies between the scores you see here and your own records, email...",
"content": [
{ "title": "Homework", "from": "homework" },
{ "title": "Surveys", "from": "survey" }
]
}
- "report-name": will appear at the top in large font
- "disclaimer-text": will appear next in smaller font
- "content": a list of objects where each object specifies the display title of a category and the 'type' chosen earlier (see "type" above). All grades of the given type will appear on the report under that title; grades of types not specified here will not appear at all.
- No two assignments can have the same name, even if they are in different categories (e.g. "Week 1" for both discussion attendance and weekly review quiz), or else the grades from one will silently overwrite the other's.
- Scored google forms display the score as SCORE / MAX but store only the raw score. So if you download it as a csv the formatting is turned into the canonical text and you will need the "stripDenominator" filter; if you download it as an xlsx then the formatting remains visible to you but hidden from the code, so you will not need that filter and also autoconf may do a worse job of guessing the intended max score.
- On Windows it seems to have a hard time finding the wkhtmltopdf binary, even if it is on your PATH. You can get around this for now by specifying the path manually using the -w flag, e.g.
python3 main.py examples/config.json --pdf -w "C:\Program Files\wkhtmltopdf\bin\wkhtmltopdf.exe"