GitHub - Pa123313/opentelemetry-demo: OpenTelemetry demo project for observability and tracing setup

🌌 Cloud-Native Observability: OpenTelemetry Microservices Demo

A comprehensive implementation of OpenTelemetry (OTel) across a distributed microservices architecture. This project showcases the ability to instrument, collect, and visualize the "Three Pillars of Observability" (Metrics, Logs, and Traces) in a complex, real-world environment.

🚀 Project Overview

This project utilizes the Astronomy Shop microservice suite to demonstrate how to achieve full-stack visibility. By integrating Prometheus, Grafana, and Locust, I’ve established a proactive monitoring system capable of detecting latency spikes and resource exhaustion.

🛠️ Observability & SRE Stack

Instrumentation: OpenTelemetry SDKs (Java, Python, Go, Node.js).

Telemetry Collection: OpenTelemetry Collector (Contrib) for processing and exporting data.

Metrics & Storage: Prometheus for time-series data.

Visualization: Grafana (Custom Dashboards for Span Metrics and Service Health).

Load Testing: Locust (Distributed load generation to simulate real-user traffic).

Infrastructure: Kubernetes / Docker-Compose.

📊 Performance Analysis & Insights

Real-Time Distributed Tracing & Span Metrics I configured custom Grafana dashboards to monitor Service Latency (P95) and Mean Rate across 10+ microservices. This allows for immediate identification of "hot paths" in the distributed system.

OTel Collector Health & Throughput Monitoring the OpenTelemetry Collector itself to ensure zero data loss. This dashboard tracks spans/sec and logs/sec being processed through the pipeline.

Automated Load Testing Using Locust, I simulated concurrent user behavior to stress-test the /api/checkout and /api/cart endpoints, capturing failure rates and response times under load.

Resource Utilization (SRE View) Monitoring CPU and Memory consumption per service (e.g., product-catalog, frontend) to establish performance baselines and right-size Kubernetes resource limits.

💡 Key Accomplishments

Bottleneck Identification: Used Span Metrics to identify that the accounting service had a P95 latency of 15s during peak load.

Unified Pipeline: Successfully routed logs and metrics through a single OTel Collector, reducing agent overhead on the infrastructure.

Stress Resilience: Verified system stability up to 50+ RPS with a 78% success rate on high-complexity transaction paths.

🛠️ How to Deploy

Clone the Repo: git clone

Launch Stack: docker-compose up -d (or kubectl apply -f ./k8s)

Access Dashboards:

Grafana: http://localhost:8080/grafana

Locust: http://localhost:8080/loadgen

Name		Name	Last commit message	Last commit date
Latest commit History 1,719 Commits
.github		.github
internal/tools		internal/tools
pb		pb
src		src
telemetry-schema		telemetry-schema
test		test
.dockerignore		.dockerignore
.env		.env
.env.override		.env.override
.gitattributes		.gitattributes
.gitignore		.gitignore
.licenserc.json		.licenserc.json
.lychee.toml		.lychee.toml
.markdownlint.yaml		.markdownlint.yaml
.yamlignore		.yamlignore
.yamllint		.yamllint
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
buildkitd.toml		buildkitd.toml
docker-compose-tests.yml		docker-compose-tests.yml
docker-compose-tests_include-override.yml		docker-compose-tests_include-override.yml
docker-compose.minimal.yml		docker-compose.minimal.yml
docker-compose.yml		docker-compose.yml
docker-gen-proto.sh		docker-gen-proto.sh
ide-gen-proto.sh		ide-gen-proto.sh
otel-config.yml		otel-config.yml
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages