Skip to content
View WangHewei16's full-sized avatar
  • Carnegie Mellon University
  • Pittsburgh, United States
  • LinkedIn in/stephenw624

Block or report WangHewei16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
WangHewei16/README.md

Hi! I am Stephen Hewei Wang👋, a master's student at Carnegie Mellon University, School of Computer Science (CMU SCS). I have previously interned at AppleVision Pro as Machine Learning Research Engineer. My interests lie in AR/VR, LLMs, VLM, computer vision, machine learning, and foundation models. I graduated from University College Dublin (UCD) in Ireland, earning a Bachelor’s in Software Engineering with Cum Laude. During UCD, I took on a role as an ML/CV research intern in THEIA lab and collaborated with Nanyang Technological University (NTU), supervised by Assoc Prof Yee Hui Lee and Dr. Soumyabrata Dev. My endeavors have culminated in 10+ works with 200+ citations published and delivered in esteemed conferences, journals, and workshops including CVPR, AAAI, CIKM, BMVC, IEEE, ACM, SCI, and Elsevier.

🔍 Linkedin | Google Scholar | ResearchGate

Github stats

Pinned Loading

  1. DMCNet-for-Video-Engagement-Understanding DMCNet-for-Video-Engagement-Understanding Public

    [Elsevier SOCL'22] Investigate in ML/DL-ensembled models and visualize features by dimension reduction techniques like PCA and t-SNE, measure performance via multiple metrics (e.g., Gini Index, AGF…

    Jupyter Notebook 22 2

  2. AMDCNet-for-Stereo-Matching AMDCNet-for-Stereo-Matching Public

    [Elsevier Displays'22] Propose visual sensitivity regularization in cost calculation stage and multi-directional aggregation template, optimize parallax through left-right consistency detection.

    C++ 20 6

  3. 3D-VistaNet-Purificatory-LoFTR-and-DKM-with-Test-Time-Augmentation-for-Scalable-Image-Matching 3D-VistaNet-Purificatory-LoFTR-and-DKM-with-Test-Time-Augmentation-for-Scalable-Image-Matching Public

    [CVPR'22 Image Matching Workshop (IMW) Top6%] Propose a model registering two images from diverse viewpoints with coarse-to-fine attention, use LoFTR for local feature matching and DKM for regressi…

    Jupyter Notebook 15 3

  4. DAANet-for-Salient-Object-Detection DAANet-for-Salient-Object-Detection Public

    Forked from Att100/DAANet

    [ROBIO'23] Propose a dual attention aggregating network, which adopt an ImageNet pre-trained backbone as an encoder and use FPN architecture with DAAM we introduced.

    Jupyter Notebook

  5. MENTOR MENTOR Public

    Forked from Jinfeng-Xu/MENTOR

    [AAAI'25] MENTOR: Multi-level Self-supervised Learning for Multimodal Recommendation

    Python

  6. UCloudNet-for-Cloud-Segmentation UCloudNet-for-Cloud-Segmentation Public

    Forked from Att100/UCloudNet

    [IGARSS'24] Propose a model named UCloudNet where we introduce a residual U-Net with deep supervision for cloud segmentation.

    Jupyter Notebook