-
-
Notifications
You must be signed in to change notification settings - Fork 155
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
API results for dockets include two different "description" fields #4886
Comments
@anseljh you said this caught you, and @johnhawkinson, you griped about it, so I'm trying to find an actionable thing that would have helped. The PACER API documentation already has a note about the two description fields: (From: https://www.courtlistener.com/help/api/rest/pacer/#docket-entry-endpoint) Should we put it somewhere else or is there a tweak that would help? |
An actionable thing that would have helped is not to have two fields named the same thing. Changing the schema such that both fields appear as peers is probably extremely challenging — and arguably not correct, although CM/ECF's data model has them as peer, so I don't think that argument holds water — but renaming one or both of the fields so they have different names seems worthy of consideration.
I think expecting people to read the API documentation as distinct from glancing at the API output and making decisions based on what they see — I think reading the documentation may unfortunately be an unreasonable expectation. |
So, I'm about 99.9% sure we won't be tweaking the model for this, but maybe explaining the design will help move the conversation forward. As I think you know, there's a 1-to-N relationship between docket entries and documents, and each object can have descriptions. This means you have something like this: Where you have one docket entry with four documents fk'ed to it. Each document has a short description, and the entry itself has a long one. Now, in our search engine, this all gets flattened, so we had to resolve it, and we wound up with
😭. Like I say, I don't think this is particularly a bad model, but even if it were, it probably wouldn't be worth changing (we'd need to release API v5, and that's big and difficult). I think people just need to read the docs instead of just playing with the APIs and hoping for the best. FWIW, this isn't a mistake I see often. |
I’ll have to look back at my data later, but I think the reason this
tripped me up is that I was getting empty text on the docket entry
`description` field, so I went a level deeper to the documents under them.
This may have been a weird fluke based on how or what I was querying,
because I’m mostly not running into this anymore with a different query.
I was also in a hurry and not reading the docs ultra carefully, so I’d
accept this is partially user error.
I do think the docs are really long and could use a fresh pair of eyes to
consider reorganizing them.
|
It's not so weird. We get a lot of content from the PACER RSS feeds, which only have the short description, no the full one, so this happens a lot, actually.
Yeah, maybe fresh eyes will help. I just did a huge documentation overhaul maybe six months ago. It made them a lot longer overall, but it did make the main API page significantly shorter. |
I think this winds up being a @s-taube issue, for user research, but I don't think it's a priority because we've recently invested so much time in the API docs. |
There are two different "description" fields in API results for dockets. Users would like to be able to differentiate between the two.
For example:
and
are both from:
{ "resource_uri": "https://www.courtlistener.com/api/rest/v4/docket-entries/411967648/", "id": 411967648, "docket": "https://www.courtlistener.com/api/rest/v4/dockets/69015293/", "recap_documents": [ { "resource_uri": "https://www.courtlistener.com/api/rest/v4/recap-documents/425199869/", "id": 425199869, "filepath_local": "recap/gov.uscourts.nysd.626017/gov.uscourts.nysd.626017.204.0.pdf", "document_number": "204", "attachment_number": null, "pacer_doc_id": "127036708025", "description": "Declaration in Opposition to Motion", … … … }, … … … … … ], … "entry_number": 204, "recap_sequence_number": "2024-12-24.004", "pacer_sequence_number": 570, "description": "DECLARATION of RUDOLPH W. GIULIANI in Opposition re: 170 MOTION for Order to Show Cause Why Defendant Rudolph W. Giuliani Should not be Held in Contempt for Failing to Comply with the Turnover Orders.. Document filed by Rudolph W. Giuliani. (Attachments: # 1 Exhibit EXHIBIT 1, # 2 Exhibit EXHIBIT 2, # 3 Exhibit EXHIBIT 3, # 4 Exhibit EXHIBIT 4, # 5 Exhibit EXHIBIT 5, # 6 Exhibit EXHIBIT 6, # 7 Exhibit EXHIBIT 7, # 8 Exhibit EXHIBIT 8, # 9 Exhibit EXHIBIT 9, # 10 Exhibit EXHIBIT 10, # 11 Exhibit EXHIBIT 11).(Cammarata, Joseph) (Entered: 12/24/2024)", }
The text was updated successfully, but these errors were encountered: