-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Entrez search result limit #16
Comments
I really need your help, my project is stuck and i'm really desperate |
Can you post a reproducible example code using |
thank you very much for the reply! |
actually i'll put an example, that I partly succeeded when I try to use in_batches_of(1_000).fetch I get the an error of 'raise ValueError( |
I am currently attempting to work with GEO series in the So instead of
I'd like to just say
and get useful results (which will then be summarized and get their |
|
Ok, and there is no work so far for adding batching to search? I don't understand the system very well yet, is there a special reason why |
Thanks for your interest! To implement batching for |
Alright, I just understodd why the batching doesn't work with |
Actually, why is there this limit to 100'000 records for (e)
The documentation talks about a 10'000 UID limit, but for Some further experimentation revealed the actual limit after which entrez refuses to send anything is 2'147'483'647, aka the maximum number for a 32 bit signed integer, so if this works for all databases, retrieving all UIDs for a query would just entail setting max_results to that.
|
Well the limit on
It is good to hear that it works for you with a specific database but I suspect it might not work for all databases.
Are there really as many records in the |
But that would be 10'000 not 100'000, or is there just a typo somewhere?
I am currently checking what I get back for the other ones, as defined in
In gds for that query there are ~210'000 records. In total though, gds comprises 6'961'960 records. I just wanted to find out what the actual maximum number possible is, because Entrez always returns the number of records that are the minimum of |
Yes, it appears so that currently it is to lax a limit and should have been 10k nor 100k. |
Should this limit then be dynamic, depending on the database? I can see if I can determine individual database limits by experimentation... |
Well these limits can change. I would be more inclined to have a separate argument |
This seems reasonable 👍 |
Could you look over #18 please, @krassowski? This should solve my immediate problem, and at least logically I don't see why the actual request limit for |
v0.3.7 is now released an available on PyPI: https://pypi.org/project/easy-entrez/0.3.7/ |
Great thank you very much! |
Hello!
I've come to this project since the BioPython entrez search fail me.
It used to return more than 9999 results but now there's this cursed limit.
so several question
I need the most simple use of these functions, I want to put a term ('T cell' for example) and get a list of the most 100k relevant articles PMIDs. That's the only thing standing in my project way
Cheers
The text was updated successfully, but these errors were encountered: