Profanity filter for Nova #771
Replies: 3 comments 6 replies
-
Thanks for asking your question about Deepgram! If you didn't already include it in your post, please be sure to add as much detail as possible so we can assist you efficiently, such as:
|
Beta Was this translation helpful? Give feedback.
-
@robfig I can pass this need onto our product team for feedback. |
Beta Was this translation helpful? Give feedback.
-
@robfig , thank you for sharing this feedback, and I apologize for you and your customer being put in this situation. As you noted, Deepgram only supports profanity filtering for our Base model, which we wouldn't recommend as Nova-2 is newer and more accurate. Our models are trained on very large volumes of data, which may include instances of profane language. We don't apply any cleaning to our training data or models that prevents certain words from being predicted. For other customers of ours who have wanted to implement their own profanity post-processing, we have compiled a list of 1200 terms covering a range of English profanity (cursing, offensive language, etc). Let me know if you would be interested in consuming this list in your own post-processing, and I can share it with you privately. You can also implement the idea of censoring a shorter list of terms that you want to prevent from ever being displayed in captions. Please be assured that I've raised this broader product question internally as well for further discussion. |
Beta Was this translation helpful? Give feedback.
-
We use Deepgram for Closed Captioning in meetings. I got feedback from a customer today:
I see that Deepgram supports a Profanity filter, but that only applies to Base models. We use Nova-2. Do you have any plans to add support, or is there an alternative approach for handling this that you'd recommend?
Thanks for your help
Rob
Beta Was this translation helpful? Give feedback.
All reactions