-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Twitter API suspension #26
base: master
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'd love to see a .describe()
on the all_tweets
data frame, is it possible to add it next to the .tail()
and .head()
methods calls?
.describe()
will give more insight whereas to the distribuition of data on the numeric columns =)
plt.show() | ||
|
||
|
||
# Using a linear regression, numbers suggest that by mid-October **the impact of Twitter's block has been negative** in terms of engagement with Rosie's tweets – as the slope is more skwed down after the block. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
typo here skwed
-> skewed
Question: Can we reliably use the Quadratic regression on the before 2018 data? I ask this because we have a huge gap in information (which I think is due to Rosie's sabbatical). Other than the data itself, I wonder if this is leading to an inconclusive result since the linear regression shows one thing and the quadratic another. Am I missing some mathematical/statistical concept here? |
@jtemporal thank you for the feedback. I am with you. I pointed out that a polynomial regression might not be the best approach here, specially because we have a reason to think that that time series is not a stationary process. That is why I used an autoregressive integrated moving average (ARIMA) model instead. Have you had the chance to look at my notebook? |
Probably not, but that was may naïve approach just to get started. As the mathematician who really adds values in the analysis is @g4brielvs, what about |
hi @g4brielvs Just started looking at yourt notebook. Bellow I'll write down some changes I think would be good to have:
I like that <3 I think is a better approach to the matter at hand
<3 null hypothesis validated: block = bad |
@jtemporal thank you! I made those changes |
@jtemporal Hey! I just wanted to check if this PR is still relevant. If more changes are needed, I'd be happy to work on those. |
Hi @g4brielvs I think we need to check with @sergiomario on this 😉 |
What is the purpose of this Pull Request?
This is an analysis (take 1) to start the conversation to understand how Twitter API suspension might have impacted Rosie's level of engagement.
What was done to achieve this purpose?
I used time series analysis, particularly an autoregressive model.
How to test if it really works?
An overview of the methodology would be a good start.
Who can help reviewing it?
@cuducos @jtemporal
TODO