Md Mubtasim Ahasan
  • about
  • publications
  • blog (current)
  • profile
  • paper-review
  • •

  • code
  • Paper Reviews on Energy-Based Models for Learning and Reasoning

    A review of two works that explore energy-based modeling for scalable learning and iterative reasoning.

    12 min read   ·   July 13, 2025

    2025   ·   paper-review

    image
  • Paper Reviews on Vector Quantization and Discrete Representation Learning

    A review of two key works on discrete representation learning and neural audio compression VQ-VAE and Encodec.

    6 min read   ·   March 11, 2025

    2025   ·   paper-review

    image
  • Paper Reviews on Advances in Audio-Video Generation (VATT, AV-Link, Frieren)

    Review of three recent works on controllable video-to-audio generation, unified cross-modal diffusion, and flow-based approaches for audio synthesis from video.

    12 min read   ·   February 23, 2025

    2025   ·   paper-review

    image
  • Essential Commands for Docker, Hugging Face CLI, Git, and Vim

    A practical guide with frequently used commands across Docker, Hugging Face CLI, Git, and Vim.

    5 min read   ·   November 04, 2024

    2024   ·   code

  • Paper Review - Moshi: A Speech-Text Foundation Model for Real-Time Dialogue

    6 min read   ·   October 20, 2024

    2024   ·   paper-review

    image
  • Newer
  • 1
  • 2
  • Older
© Copyright 2025 Md Mubtasim Ahasan. Last updated: September 06, 2025.