Exclude posts in own subs from spam detection #1946

Scroogey-SN · 2025-03-05T23:06:13Z

Description

Modify SQL function item_spam to ignore posts made in own subs:

diff --git a/prisma/migrations/20240416181215_item_spam_exclude_bios/migration.sql b/prisma/migrations/20240416181215_item_spam_exclude_bios/migration.sql
index 5ad205c7..1d4930f5 100644
--- a/prisma/migrations/20240416181215_item_spam_exclude_bios/migration.sql
+++ b/prisma/migrations/20240416181215_item_spam_exclude_bios/migration.sql
@@ -1,5 +1,5 @@
--- exclude bios from spam detection
-CREATE OR REPLACE FUNCTION item_spam(parent_id INTEGER, user_id INTEGER, within INTERVAL)
+-- exclude posts in own subs from spam detection
+CREATE OR REPLACE FUNCTION item_spam(parent_id INTEGER, user_id INTEGER, within INTERVAL, sub_name TEXT)
 RETURNS INTEGER
 LANGUAGE plpgsql
 AS $$
@@ -12,16 +12,22 @@ BEGIN
         RETURN 0;
     END IF;

+    IF sub_name IS NOT NULL AND user_id = (SELECT "Sub"."userId" FROM "Sub" WHERE "Sub"."name" = sub_name) THEN
+        RETURN 0;
+    END IF;
+
     SELECT count(*) INTO repeats
     FROM "Item"
+    LEFT JOIN "Sub" ON "Sub"."name" = "Item"."subName"
     WHERE (
         (parent_id IS NULL AND "parentId" IS NULL)
         OR
         ("parentId" = parent_id AND user_id <> (SELECT i."userId" FROM "Item" i WHERE i.id = "Item"."rootId"))
     )
-    AND "userId" = user_id
+    AND "Item"."userId" = user_id
     AND "bio" = 'f'
-    AND created_at > now_utc() - within;
+    AND ("Sub"."name" IS NULL OR "Sub"."userId" <> user_id)
+    AND "Item".created_at > now_utc() - within;

     IF parent_id IS NULL THEN
         RETURN repeats;

I tested by running the replaced function through ./sndev psql and

stackernews=# select item_spam(NULL, 15556, interval '1' hour);

And testing by posting as territory owner in the web interface.

Note that this will only ignore posts made by a territory owner in their own subs. Posts made in others' subs still count. As they should, IMO.

ekzyis

You found the right spot to fix #787!

I think it's almost there, but I found a bug regarding replies, see comment.

Will review more later, but I wanted to mention this before I go to lunch.

prisma/migrations/20250305175301/migration.sql

ekzyis

I noticed that since this filters out posts from own territories, it will still apply the spam fee to posts in your own territories if you posted before in other territories.

I think stackers will expect that they never have to pay the spam fee in their own territories. For this, we need to also pass subName to item_spam so we can check if we want to create a post in one of our own territories. If that is the case, we immediately return 0. That should also make the item_spam function easier to read and thus less prone to bugs.

Does this make sense?

Scroogey-SN · 2025-03-06T21:57:38Z

Yes, I agree that's what the territory owner would like, ideally. It's also a more invasive change. The middle ground would be better than nothing (the territory owner is unlikely to spam others' territories).

ekzyis · 2025-03-07T20:04:48Z

It's also a more invasive change. The middle ground would be better than nothing (the territory owner is unlikely to spam others' territories).

It's a bigger change, yes, but as-is, this does not fix #787 because a territory founder can still pay 10x in their own territories. For that to happen, they only need to post in another territory once and then they will pay 10x in their own territory for the next 10 minutes. I don't consider this to be an edge case.

Scroogey-SN · 2025-03-07T20:25:35Z

What I slightly dislike is how the additional parameter will change the semantics of the function.

Right now, the function tells you how badly a particular user has spammed in the interval.
Some callers will continue to use the function as such.
But by passing a particular sub, the caller asks instead "should the user be penalized for posting here?"

A territory owner who has spammed a foreign territory and subsequently wants to post in his own territory without penalty at the same time

has spammed
should not be penalized for a new post to in his own territory

So the function now returns two different kinds of information.

Scroogey-SN · 2025-03-07T20:52:46Z

To play devil's advocate:

If territory owners should be able to post to their territories for base cost (no matter how much they spammed), shouldn't the same apply to any user posting their bio (the existing exclusion AND bio = 'f'), too?

Right now, posting your bio doesn't increase the spam counter, but when you have otherwise spammed before, your cost of posting bio multiplies.

Maybe the function should continue to answer only the question "how much has the user spammed in the interval" (where bio posting and territory owner posting in own territories are not spam), but the caller should not always do the POWER multiplication (posting should only do it if the user is not the owner, bio posting should never do it)?

Co-authored-by: ekzyis <[email protected]>

ekzyis · 2025-03-13T01:24:10Z

Sorry for the late reply.

What I slightly dislike is how the additional parameter will change the semantics of the function. [...] So the function now returns two different kinds of information.

What we want for #787 is to never make founders pay spam fees in their territory. Everything else is secondary and can be updated accordingly.

Right now, posting your bio doesn't increase the spam counter, but when you have otherwise spammed before, your cost of posting bio multiplies.

This is a great point! Bios should not affect spam fees ( ✅ ) and not be affected by them ( ❌ ).

Scroogey-SN · 2025-03-14T17:27:57Z

Now I'm passing sub into item_spam() to return 0 for the owner.

I tested the following sequence:

Login as unrelated user

First post has fee 1
Second post has fee 10
Third post has fee 100
First comment on unrelated post has fee 1
Second comment on same post has fee 10
Third comment on same post has fee 100
Comment on own post has fee 1

Login as user who owns sub 'AGORA'

First post to 'AMA' costs 1
Second post to 'AMA' costs 10
Third post to 'AMA' costs 100
First post to 'AGORA' costs 1
Second post to 'AGORA' costs 1
Third post to 'AGORA' costs 1
First comment to unrelated post in 'AMA' costs 1
Second comment to unrelated post in 'AMA' costs 10
Third comment to unrelated post in 'AMA' costs 100
Comment to own post in 'AMA' costs 1

In short, the behaviour is now as before, except for two things:

posts of sub owners in own subs don't count as spam (subsequent posts in foreign subs are not punished)
posts and comments of sub owners in own subs always cost base (no matter how much they spammed in foreign subs before)

I believe that's what you asked for.

I wasn't exactly sure if any of the parameters should have been named subName instead of sub, there may be naming conventions to the layers that I mixed up, should be cosmetic, but let me know.

ekzyis

Not a full review yet because I found some issues just from looking at the code. Will continue review when they are fixed.

components/post.js

ekzyis · 2025-03-18T02:44:15Z

components/reply.js

@@ -71,7 +71,7 @@ export default forwardRef(function Reply ({
        // no lag for itemRepetition
        if (!item.mine && me) {
          cache.updateQuery({
-            query: gql`{ itemRepetition(parentId: "${parentId}") }`
+            query: gql`{ itemRepetition(parentId: "${parentId}", sub: "${sub?.name}") }`


See other comment

I left this as-is, assuming that was a hint how to solve the other comment.

I meant that you have the same issue ~~with searching for~~ updating the query with a sub named 'undefined' here, too.

See comment above this one

prisma/migrations/20250314104901_pass_sub_to_item_spam/migration.sql

components/fee-button.js

ekzyis

This works but the code can be improved, see my two comments.

ekzyis · 2025-03-18T21:04:13Z

components/fee-button.js

+  const query = parentId && sub
+    ? gql`{ itemRepetition(parentId: "${parentId}", sub: "${sub}") }`
+    : (parentId
+        ? gql`{ itemRepetition(parentId: "${parentId}") }`
+        : (sub
+            ? gql`{ itemRepetition(sub: "${sub}") }`
+            : gql`{ itemRepetition }`
+          ))


As mentioned in the other comment, this should use GraphQL variables.

ekzyis · 2025-03-18T21:10:28Z

components/reply.js

@@ -71,7 +71,7 @@ export default forwardRef(function Reply ({
        // no lag for itemRepetition
        if (!item.mine && me) {
          cache.updateQuery({
-            query: gql`{ itemRepetition(parentId: "${parentId}") }`
+            query: gql`{ itemRepetition(parentId: "${parentId}", sub: "${sub?.name}") }`


See comment above this one

issue stackernews#787: exclude posts in own subs from spam detection

6d05682

ekzyis self-requested a review March 6, 2025 16:42

ekzyis changed the title ~~Fix issue #787: Exclude posts in own subs from spam detection~~ Exclude posts in own subs from spam detection Mar 6, 2025

Merge branch 'master' into pr/1946

af4470a

ekzyis requested changes Mar 6, 2025

View reviewed changes

prisma/migrations/20250305175301/migration.sql Outdated Show resolved Hide resolved

ekzyis reviewed Mar 6, 2025

View reviewed changes

Merge branch 'stackernews:master' into issue_787

8250c2f

Scroogey-SN and others added 2 commits March 7, 2025 21:16

Merge branch 'stackernews:master' into issue_787

9bb516f

Update prisma/migrations/20250305175301/migration.sql

991c6bc

Co-authored-by: ekzyis <[email protected]>

Scroogey-SN added 2 commits March 14, 2025 15:06

Merge branch 'stackernews:master' into issue_787

7eaae87

pass sub to item_spam() to make owner except from escalation

5b6cddf

Scroogey-SN added 3 commits March 14, 2025 17:37

remove debug values from sql

cdbd615

rename migration folder with details

55e7a0a

fix lint: space before closing curly brace

9411bf5

Scroogey-SN requested a review from ekzyis March 14, 2025 18:33

ekzyis requested changes Mar 18, 2025

View reviewed changes

Scroogey-SN added 3 commits March 18, 2025 10:31

sub may be undefined, adjust SQL function parameter name

d5ab608

Merge branch 'stackernews:master' into issue_787

84eac60

fix lint: indent

52d6b92

Scroogey-SN requested a review from ekzyis March 18, 2025 10:45

ekzyis requested changes Mar 18, 2025

View reviewed changes

ekzyis added territories ui/ux labels Mar 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exclude posts in own subs from spam detection #1946

Exclude posts in own subs from spam detection #1946

Scroogey-SN commented Mar 5, 2025 •

edited by ekzyis

Loading

ekzyis left a comment

ekzyis left a comment

Scroogey-SN commented Mar 6, 2025

ekzyis commented Mar 7, 2025

Scroogey-SN commented Mar 7, 2025

Scroogey-SN commented Mar 7, 2025

ekzyis commented Mar 13, 2025

Scroogey-SN commented Mar 14, 2025

ekzyis left a comment

ekzyis Mar 18, 2025

Scroogey-SN Mar 18, 2025

ekzyis Mar 18, 2025 •

edited

Loading

ekzyis Mar 18, 2025

ekzyis left a comment

ekzyis Mar 18, 2025 •

edited

Loading

ekzyis Mar 18, 2025

Exclude posts in own subs from spam detection #1946

Are you sure you want to change the base?

Exclude posts in own subs from spam detection #1946

Conversation

Scroogey-SN commented Mar 5, 2025 • edited by ekzyis Loading

Description

ekzyis left a comment

Choose a reason for hiding this comment

ekzyis left a comment

Choose a reason for hiding this comment

Scroogey-SN commented Mar 6, 2025

ekzyis commented Mar 7, 2025

Scroogey-SN commented Mar 7, 2025

Scroogey-SN commented Mar 7, 2025

ekzyis commented Mar 13, 2025

Scroogey-SN commented Mar 14, 2025

ekzyis left a comment

Choose a reason for hiding this comment

ekzyis Mar 18, 2025

Choose a reason for hiding this comment

Scroogey-SN Mar 18, 2025

Choose a reason for hiding this comment

ekzyis Mar 18, 2025 • edited Loading

Choose a reason for hiding this comment

ekzyis Mar 18, 2025

Choose a reason for hiding this comment

ekzyis left a comment

Choose a reason for hiding this comment

ekzyis Mar 18, 2025 • edited Loading

Choose a reason for hiding this comment

ekzyis Mar 18, 2025

Choose a reason for hiding this comment

Scroogey-SN commented Mar 5, 2025 •

edited by ekzyis

Loading

ekzyis Mar 18, 2025 •

edited

Loading

ekzyis Mar 18, 2025 •

edited

Loading