Skip to content

Conversation

@mohsen1
Copy link
Owner

@mohsen1 mohsen1 commented Jan 25, 2025

No description provided.

Copy link
Contributor

@greptile-apps greptile-apps bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR Summary

This PR significantly refactors the repository's core functionality, removing chunking in favor of single-file output and adding token-based processing.

  • Critical bug in src/parallel.rs: Both token mode and byte mode truncation logic have incorrect conditions that would prevent any files from being processed (if total_tokens_needed > 0 and if header_size + content_size > 0)
  • Removal of test_chunk_order.rs without adequate replacement tests for file ordering validation in the new single-file output system
  • Shell scripts in root directory (.sh) are now ignored in .gitignore, which could affect critical installation scripts
  • Significant behavioral change in test_normalize_path.rs where paths outside base directory now only return filename instead of full path

28 file(s) reviewed, 36 comment(s)
Edit PR Review Bot Settings | Greptile

Comment on lines +47 to +48
tokio = { version = "1.0", features = ["rt", "time", "macros"] }
env_logger = "0.11"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

style: tokio is added as a dev dependency but its purpose isn't clear from the changes shown. Check if async testing is actually needed

README.md Outdated
```

> [!NOTE]
> When max-size is reached, `yek` will throw away all of the less important files and generate
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

logic: This line appears to be cut off mid-sentence. Complete the explanation of what happens when max-size is reached.

README.md Outdated
Comment on lines 168 to 169
# Tokenizer model for token counting (defaults to 'deepseek-reasoner')
tokens = "deepseek"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

logic: The default model is stated as 'deepseek-reasoner' here but 'openai' is shown as default in the CLI help (line 141). This inconsistency needs to be resolved.

Comment on lines +24 to +33
// Verify all models are listed
let models = [
"openai", // OpenAI models
"claude", // Anthropic Claude models
"mistral", // Mistral models
"mixtral", // Mixtral models
"deepseek", // DeepSeek models
"llama", // Meta Llama models
"codellama", // CodeLlama models
];
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

style: Models array should be imported from model_manager.rs SUPPORTED_MODEL_FAMILIES to avoid duplication and ensure consistency

Comment on lines +8 to +9
"^.github/.*",
"^.cargo/.*",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

logic: Redundant ignore pattern - .github/ is already covered by the pattern on line 13

@mohsen1 mohsen1 force-pushed the refactor-input-handling-and-config branch from 1b27189 to f10d33e Compare January 25, 2025 17:32
@mohsen1 mohsen1 closed this Jan 26, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants