No instruct dp #20

AndrewRWilliams · 2025-02-06T15:46:33Z

Adds support for using Direct Prompt for models that are do not support instructions in HF.

Checks if the tokenizer has a chat template.
If not, the context is simply concatenated into one long string and feed to the model.

ashok-arjun · 2025-02-14T15:42:50Z

cik_benchmark/baselines/direct_prompt.py

+
+        # Build one pattern line per required timestamp:
+        #   (YYYY-MM-DD HH:MM:SS,[-+]?\d+(?:\.\d+)?)
+        # No spaces allowed anywhere, so everything is literally "fixed"
+        # except for the numeric portion.
+        lines = [
+            rf"\({re.escape(ts)},[-+]?\d{1,20}(?:\.\d{0,20})?\)"
+            for ts in required_timestamps
+        ]
+
+        # Join lines with exactly one "\n".
+        body = r"\n".join(lines)
+
+        # Return the full pattern, ensuring a single newline
+        # after <forecast> and before </forecast>.
+        return rf"<forecast>\n{body}\n</forecast>"


Is there a need to replace the original function? This might break reproducibility.

The original function allowed for returns such as \n \n \n \n \n \n \n ... so I fixed it to this when working with Hymba. I think it would only break reproducibility if this code is somehow encoded into controlling the RNG for the llm or for the tasks, right?

@marcotet what are your thoughts?

Hmm, all results of our paper were with the previous code so I guess it was fine - we didn't get any errors.

If you think it's better maybe good to reproduce a model's results on CiK to confirm it doesn't break reproducibility. What do you think?

Does mamba need this change?

ashok-arjun · 2025-02-14T15:43:14Z

requirements.txt

@@ -19,5 +19,6 @@ termcolor
 tenacity
 h5py
 transformers>4.4.1
+tokenizers


Was this necessary for mamba or something?

Yeah, I can remove it though since I didn't add the mamba models

Yeah, that'd be great cause it might break the already-fragile requirements otherwise :D Unless you tested that it didn't :)

ashok-arjun · 2025-02-14T16:00:22Z

By the way, I'm testing if the code works for other base models. Will get back on that.
Update: It works!! Yay!

AndrewRWilliams added 8 commits January 23, 2025 16:33

initial commit for direct prompt with models that have no chat_template

8915779

update dp regex and chat template handling

b460db5

add back statsmodels

63fb433

remove ssm stuff from reqs

7685ef3

remove run_baselines debupy stuff

e420019

revert reqs

59367bd

comment out ssm stuff

5a7257a

comment out imports

668e16c

AndrewRWilliams requested a review from ashok-arjun February 6, 2025 15:46

ashok-arjun requested changes Feb 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

No instruct dp #20

No instruct dp #20

Uh oh!

AndrewRWilliams commented Feb 6, 2025

Uh oh!

ashok-arjun Feb 14, 2025

Uh oh!

AndrewRWilliams Feb 14, 2025

Uh oh!

AndrewRWilliams Feb 14, 2025

Uh oh!

ashok-arjun Feb 14, 2025

Uh oh!

ashok-arjun Feb 14, 2025

Uh oh!

ashok-arjun Feb 14, 2025

Uh oh!

AndrewRWilliams Feb 14, 2025

Uh oh!

ashok-arjun Feb 14, 2025

Uh oh!

ashok-arjun commented Feb 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

No instruct dp #20

Are you sure you want to change the base?

No instruct dp #20

Uh oh!

Conversation

AndrewRWilliams commented Feb 6, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ashok-arjun commented Feb 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ashok-arjun commented Feb 14, 2025 •

edited

Loading