Twitter API suspension #26

g4brielvs · 2019-10-14T03:42:34Z

What is the purpose of this Pull Request?

This is an analysis (take 1) to start the conversation to understand how Twitter API suspension might have impacted Rosie's level of engagement.

What was done to achieve this purpose?

I used time series analysis, particularly an autoregressive model.

How to test if it really works?

An overview of the methodology would be a good start.

Who can help reviewing it?

@cuducos @jtemporal

TODO

jtemporal

I'd love to see a .describe() on the all_tweets data frame, is it possible to add it next to the .tail()and .head() methods calls?

.describe() will give more insight whereas to the distribuition of data on the numeric columns =)

jtemporal · 2019-11-17T13:31:07Z

notebooks/2019-10-11-cuducos-impact-of-twitter-blocking-rosie.py

+plt.show()
+
+
+# Using a linear regression, numbers suggest that by mid-October **the impact of Twitter's block has been negative** in terms of engagement with Rosie's tweets – as the slope is more skwed down after the block.


typo here skwed -> skewed

jtemporal · 2019-11-17T13:35:20Z

Question: Can we reliably use the Quadratic regression on the before 2018 data? I ask this because we have a huge gap in information (which I think is due to Rosie's sabbatical).

Other than the data itself, I wonder if this is leading to an inconclusive result since the linear regression shows one thing and the quadratic another. Am I missing some mathematical/statistical concept here?

g4brielvs · 2019-11-17T14:21:43Z

@jtemporal thank you for the feedback. I am with you. I pointed out that a polynomial regression might not be the best approach here, specially because we have a reason to think that that time series is not a stationary process. That is why I used an autoregressive integrated moving average (ARIMA) model instead.

Have you had the chance to look at my notebook?

cuducos · 2019-11-17T17:52:44Z

Can we reliably use the Quadratic regression on the before 2018 data?

Probably not, but that was may naïve approach just to get started. As the mathematician who really adds values in the analysis is @g4brielvs, what about git rm my notebooks (which were merely warmups for his analysis)? We can also checkout to my commits to see what I've tried.

jtemporal · 2019-11-17T19:48:56Z

hi @g4brielvs Just started looking at yourt notebook. Bellow I'll write down some changes I think would be good to have:

I noticed some cells are run out of order, maybe you could re-run all of them and guarantee they all work if run sequentially? That would be a major improvement.
Remove @cuducos notebook. In that way we can focus on the "real" analysis =P and as he said it was just a warm up.
As I mentioned before add a .describe() after the .shape so we can have statistics on numeric features documented for anyone reading the notebook without running it.
I think there's a plot missing on the engagement part, maybe it wasn't displayed in time.

That is why I used an autoregressive integrated moving average (ARIMA) model instead.

I like that <3 I think is a better approach to the matter at hand

Note that the negative trend apparently started before and has been accentuated after the block. ... Between Fev/2019 and Apr/2019 - right after the block - the slope has higher negative value and continuously stabilizes, but in a negative trend.

<3 null hypothesis validated: block = bad

g4brielvs · 2019-11-22T00:20:07Z

@jtemporal thank you! I made those changes

g4brielvs · 2021-02-10T01:40:47Z

@jtemporal Hey! I just wanted to check if this PR is still relevant. If more changes are needed, I'd be happy to work on those.

jtemporal · 2021-02-10T16:28:05Z

Hi @g4brielvs I think we need to check with @sergiomario on this 😉

cuducos and others added 3 commits October 11, 2019 22:02

Analysis impact of Twitter API suspension

4d2bace

Analysis impact of Twitter API suspension

b2d011d

Update requirements.txt

ba5ade6

jtemporal reviewed Nov 17, 2019

View reviewed changes

g4brielvs added 2 commits November 21, 2019 19:13

Remove @cuducos notebook

71b31c8

Fix glitches

dbf0069

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Twitter API suspension #26

Twitter API suspension #26

g4brielvs commented Oct 14, 2019 •

edited

Loading

jtemporal left a comment •

edited

Loading

jtemporal Nov 17, 2019

jtemporal commented Nov 17, 2019

g4brielvs commented Nov 17, 2019 •

edited

Loading

cuducos commented Nov 17, 2019

jtemporal commented Nov 17, 2019

g4brielvs commented Nov 22, 2019

g4brielvs commented Feb 10, 2021

jtemporal commented Feb 10, 2021

		plt.show()


		# Using a linear regression, numbers suggest that by mid-October the impact of Twitter's block has been negative in terms of engagement with Rosie's tweets – as the slope is more skwed down after the block.

Twitter API suspension #26

Are you sure you want to change the base?

Twitter API suspension #26

Conversation

g4brielvs commented Oct 14, 2019 • edited Loading

jtemporal left a comment • edited Loading

Choose a reason for hiding this comment

jtemporal Nov 17, 2019

Choose a reason for hiding this comment

jtemporal commented Nov 17, 2019

g4brielvs commented Nov 17, 2019 • edited Loading

cuducos commented Nov 17, 2019

jtemporal commented Nov 17, 2019

g4brielvs commented Nov 22, 2019

g4brielvs commented Feb 10, 2021

jtemporal commented Feb 10, 2021

g4brielvs commented Oct 14, 2019 •

edited

Loading

jtemporal left a comment •

edited

Loading

g4brielvs commented Nov 17, 2019 •

edited

Loading