template-scala-parallel-multimodal-recommendation

This recommender is designed to take account of a wide range of user behavior, item content, and contextual information to make real time recommendations. It contains highly flexible mechanisms for dealing with events that incorporate any important part of the user's entire click-stream. It can also mix content based recommendations in several ways to augment collaborative filtering and to account for important context.

It is provided as a PredicitonIO template for easy installation and application integration.

This engine is designed to preform the following functions:

Collaborative filtering and content based recommendations
Serve recommendations in real time from usage data gathered in real time
Based on a "cooccurrence" type recommender
Creates predictive models from any number of user interaction types—full click-stream recommendations
Uses real time context to affect recommendations
Creates predictive models from content and metadata that deliver personalized recommendations
Is responsive as new models are recalculated in the background
Creates personalized, similar item, and shopping cart type recommendations
Can mix usage, content, context, and metadata in one query for extremely flexible recs targeting
Based on highly scalable fault-tolerant technology
Queries allow biasing or filtering recs by any metadata field—get recs filtered or slanted towards categories, tags, subject, or any item attribute.

Near future

Streaming input through Kafka
Log parsing to extract user click-stream data—often replacing SDK integration for user interaction data
Automatically calculates "trending" items to help solve the cold start problem.

##Architecture

We follow the lambda architectural model as embodies in the DASE archetecture and tool set. Streaming input comes through the PreditionIO SDKs (or optionally files), which is turned immediately into user history used to make near real time recommendations. The delay in creating this usable user history is minimal--on the order of a seconds. Recommendations are returned in real time. The predictive models are calculated in the background based on all collected data and is updated as often as requested with no server downtime.

The actual components of the architecture are chosen for their speed, reliability, scalability, and high-level of support. They are the at the forefront in their class:

Spark-Mahout: provides needed linear algebra calculations for creating the predictive models
Spark: used for streaming input and batch calculations. Keeping the batch and streaming code automatically in sync.
PrecitionIO: Used to scaffold the recommender with data ingestion, model calculation infrastructure and a real time recs server.
Elasticsearch: Used as the core of the real time query engine.

##References

A slide deck, which talks about mixing actions or other indicators: A Multimodal Streaming Recommender
A free ebook, which talks about the general idea: Practical Machine Learning
Two blog posts: What's New in Recommenders: part #1 and What's New in Recommenders: part #2
A post describing the loglikelihood ratio: Surprise and Coinsidense LLR is used to reduce noise in the data while keeping the calculations O(n) complexity.
A Guide to Online Video site that demonstrates the use of many of the above techniques

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
project		project
src/main/scala		src/main/scala
.gitignore		.gitignore
README.md		README.md
build.sbt		build.sbt
engine.json		engine.json
template.json		template.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

template-scala-parallel-multimodal-recommendation

About

Uh oh!

Releases

Packages

Languages

pferrel/defunct-template-scala-parallel-universal-recommendation

Folders and files

Latest commit

History

Repository files navigation

template-scala-parallel-multimodal-recommendation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages