IBM Research · Columbia University

Dr. Kaoutar
El Maghraoui

Principal Research Scientist at IBM Research AI Platforms. Adjunct Professor at Columbia University. ACM Distinguished Speaker & Member. Pioneering AI hardware-software co-design for the next generation of intelligent systems.

70+Publications

35+Keynotes

17US Patents

Explore Research Get in Touch Download CV

About

Shaping the Future of AI

Two decades of pioneering research at the intersection of AI, hardware, and distributed systems.

Dr. Kaoutar El Maghraoui is a Principal Research Scientist at IBM Research AI Platforms and the Principal Investigator of the AI Hardware Center experimental testbed. She focuses on deep technical work in hardware-software co-design for IBM's next-generation AI accelerators — including the Spyre Accelerator and AIU family — while leading cross-functional teams across model enablement, systems optimization, and open-source tooling.

She is also an Adjunct Professor at Columbia University, teaching High-Performance Machine Learning and Scalable Large Language Models. Her research spans efficient LLM inference on novel architectures, analog in-memory computing, neural architecture search, and AI systems optimization — from silicon to software.

Dr. El Maghraoui holds a Ph.D. in Computer Science from Rensselaer Polytechnic Institute (RPI), where she received the Robert McNaughton Prize for best thesis research. She earned her M.S. and B.S. from Al Akhawayn University in Morocco, graduating Summa Cum Laude.

Recognized as an ACM Distinguished Member (top 10% worldwide) and ACM Distinguished Speaker, she has delivered over 60 keynotes and invited talks at major international conferences. She holds 17 US patents and has published extensively in top-tier venues including Nature Communications, ICLR, NeurIPS, ASPLOS, and ICML.

RPIPh.D. Computer Science

Al AkhawaynM.S. Computer Networks

Al AkhawaynB.S. Software Engineering

Current Roles

2019 — Present

Principal Research Scientist & Technical Lead

IBM Research AI Platforms · PI, AI Hardware Center Testbed

2022 — Present

Adjunct Professor

Columbia University

2024 — 2026

ACM Distinguished Speaker

Association for Computing Machinery

2024 — Present

Science Advisory Board Member

University at Albany — Emerging AI Systems

Jan 2026 — Present

Foundry Advisor

UM6P — Mohammed VI Polytechnic University

2014 — Present

Global Vice-Chair

Arab Women in Computing (ArabWIC)

Research

Advancing AI at Every Layer

From silicon to software — building the foundations for the next era of artificial intelligence.

AI Hardware-Software Co-Design

Leading the development of optimized AI model mapping and deployment strategies for next-generation AI accelerators, bridging the gap between algorithm design and hardware capabilities.

Analog In-Memory Computing

Pioneering neural architecture search techniques for analog in-memory computing, enabling energy-efficient AI inference through novel hardware paradigms.

LLM Optimization & Inference

Developing dynamic KV cache management and efficient inference techniques for large language models, accelerating enterprise AI deployment at scale.

Neural Architecture Search

Creating multi-objective hardware-aware NAS frameworks that automatically discover optimal neural network architectures for specific hardware targets.

Selected Publications

Supernetwork-based efficient mapping of deep learning models

Nature Communications2026

Publications

Key Research Publications

Selected high-impact publications spanning AI hardware co-design, neural architecture search, and model optimization.

2,212+Total Citations

22h-index

40i10-index

70+Publications

On the Convergence Theory of Pipeline Gradient-based Analog In-memory Training

First convergence theory for asynchronous pipeline SGD on analog in-memory computing (AIMC) accelerators. Despite stale weights and asymmetric update bias, Analog-SGD-AP matches the iteration complexity of digital SGD up to non-dominant terms and increases computation density from B/(M+B-1) to 1, delivering near-linear wall-clock speedup validated on ResNet/CIFAR with AIHWKit.

Z Wu, Q Xiao, T Gokmen, H Tsai, K El Maghraoui, T Chen

View Publication

Efficient Quantization of Mixture-of-Experts with Theoretical Generalization Guarantees

New

ICLR 2026 (28% acceptance)

2026

Efficient Quantization of Mixture-of-Experts with Theoretical Generalization Guarantees

A theoretically grounded expert-wise mixed-precision quantization strategy for Mixture-of-Experts models. Allocates bit-width per expert based on changes in router L2 norm and intra-neuron variance during training, with provable generalization bounds. Validated on Switch Transformer and Mixtral with higher accuracy and lower inference cost than prior MoE quantization methods.

MNR Chowdhury, K El Maghraoui, H Tsai, N Wang, GW Burr, L Liu, M Wang

View Publication

Supernetwork-Based Efficient Mapping of Deep Learning Applications to Mixed-Precision Hardware Using Model Adaptation

New

Nature Communications 2026

2026

Supernetwork-Based Efficient Mapping of Deep Learning Applications to Mixed-Precision Hardware Using Model Adaptation

Introduces Mixed-Precision Supernetwork (MPS), a unified framework for training mixed-precision supernetworks that seamlessly map deep learning models to heterogeneous analog-digital hardware. MPS produces mappings ~2.2x faster and achieves ~3.4% higher accuracy over fully analog approaches while improving energy efficiency by mapping up to 80% of weights to analog hardware.

H Benmeziane, C Lammie, I Boybat, M Rasch, M Le Gallo, A Vasilopoulos, H Tsai, GW Burr, V Narayanan, K El Maghraoui, A Sebastian

View Publication

STARC: Selective Token Access with Remapping and Clustering for Efficient LLM Decoding on PIM Systems

ASPLOS 2026

2026

STARC: Selective Token Access with Remapping and Clustering for Efficient LLM Decoding on PIM Systems

A novel approach for efficient large language model decoding on Processing-In-Memory systems, using selective token access with remapping and clustering to dramatically reduce memory bottlenecks.

Z Fan, Y Liu, G Gagnon, Z Liu, Y Hou, H Benmeziane, K El Maghraoui, L Liu

View Publication

NeurIPS 2026

2026

Analog Foundation Models

Studies how foundation-model–scale LLMs behave on analog in-memory computing hardware. Introduces training and adaptation methods that close most of the accuracy gap between digital and analog inference at billion-parameter scale, demonstrating that modern foundation models can run efficiently on analog AIMC accelerators.

J Büchel, I Chalas, G Acampa, A Chen, O Fagbohungbe, H Tsai, K El Maghraoui, M Le Gallo, A Rahimi, A Sebastian

View Publication

NoRA: Noise-Optimized Rescaling of LLMs on Analog Compute-in-Memory Accelerators

IEEE DATE 2025

2025

NoRA: Noise-Optimized Rescaling of LLMs on Analog Compute-in-Memory Accelerators

Noise-optimized rescaling for deploying large language models on analog compute-in-memory accelerators. Reduces accuracy loss caused by analog noise during LLM inference by adapting layer-wise rescaling factors to the device noise profile, improving robustness without retraining.

Y Hou, H Tsai, K El Maghraoui, T Gokmen, GW Burr, L Liu

View Publication

EvidenceMoE: A Physics-Guided Mixture-of-Experts with Evidential Critics for Advancing Fluorescence Detection and Ranging in Scattering Media

NeurIPS 2025 Workshop Imageomics

2025

EvidenceMoE: A Physics-Guided Mixture-of-Experts with Evidential Critics for Advancing Fluorescence Detection and Ranging in Scattering Media

Introduces a Mixture-of-Experts architecture combined with evidential critics for robust fluorescence detection and ranging in challenging scattering environments, advancing scientific imaging capabilities.

I Erbas, F Demirkiran, K Swaminathan, N Wang, NI Nizam, N Yuan, S Ragab, L Chavez, K El Maghraoui, X Intes, V Pandey

Ultra-Low Precision 4-bit Training of Deep Neural Networks

Novel techniques and numerical representation formats to scale the precision of training systems from 8-bits to 4-bits, introducing adaptive Gradient Scaling for quantized gradients.

X Sun, N Wang, CY Chen, J Ni, A Agrawal, X Cui, S Venkataramani, K El Maghraoui, et al.

A Flexible and Fast PyTorch Toolkit for Simulating Training and Inference on Analog Crossbar Arrays

A comprehensive PyTorch toolkit enabling efficient simulation of analog in-memory computing for neural network training and inference on crossbar arrays.

MJ Rasch, D Moreda, T Gokmen, M Le Gallo, F Carta, C Goldberg, K El Maghraoui, et al.

A Comprehensive Survey on Hardware-Aware Neural Architecture Search

An extensive survey and taxonomy of hardware-aware NAS methods, covering search strategies, hardware metrics, and deployment considerations across diverse platforms.

H Benmeziane, K El Maghraoui, H Ouarnoughi, S Niar, M Wistuba, et al.

ModelOps: Cloud-Based Lifecycle Management for Reliable and Trusted AI

A framework for managing the full lifecycle of AI models in the cloud, addressing reliability, trust, and operational efficiency for enterprise AI deployments.

W Hummer, V Muthusamy, T Rausch, P Dube, K El Maghraoui, A Murthi, et al.

View Publication

Using the IBM Analog In-Memory Hardware Acceleration Kit

APL Machine Learning 2023

2023

Using the IBM Analog In-Memory Hardware Acceleration Kit

Comprehensive guide to the IBM AIHWKit for neural network training and inference on analog in-memory computing hardware, enabling energy-efficient AI acceleration.

M Le Gallo, C Lammie, J Büchel, F Carta, O Fagbohungbe, C Mackin, K El Maghraoui, et al.

View Publication

Neural Architecture Search for In-Memory Computing-Based Deep Learning Accelerators

Nature Reviews EE 2024

2024

Neural Architecture Search for In-Memory Computing-Based Deep Learning Accelerators

A review of NAS techniques tailored for in-memory computing accelerators, bridging the gap between neural network design and emerging hardware paradigms.

O Krestinskaya, ME Fouda, H Benmeziane, K El Maghraoui, A Sebastian, et al.

View Publication

Deep Compression of Pre-trained Transformer Models

NeurIPS 2022

2022

Deep Compression of Pre-trained Transformer Models

Techniques for dramatically compressing pre-trained transformer models while maintaining accuracy, enabling efficient deployment on resource-constrained hardware.

N Wang, CCC Liu, S Venkataramani, S Sen, CY Chen, K El Maghraoui, et al.

View Publication

Multi-Objective Hardware-Aware Neural Architecture Search with Pareto Rank-Preserving Surrogate Models

ACM TACO 2023

2023

Multi-Objective Hardware-Aware Neural Architecture Search with Pareto Rank-Preserving Surrogate Models

A novel multi-objective NAS framework using Pareto rank-preserving surrogate models to efficiently discover optimal architectures across multiple hardware constraints.

H Benmeziane, H Ouarnoughi, K El Maghraoui, S Niar

View Publication

AnalogNAS: A Neural Network Design Framework for Accurate Inference with Analog In-Memory Computing

IEEE EDGE 2023

2023

AnalogNAS: A Neural Network Design Framework for Accurate Inference with Analog In-Memory Computing

An end-to-end NAS framework that searches for neural architectures matched to analog in-memory computing hardware. AnalogNAS produces models that are noise-resilient and energy-efficient on AIMC, narrowing the accuracy gap with digital inference for image and language tasks.

H Benmeziane, C Lammie, I Boybat, M Rasch, M Le Gallo, H Tsai, R Muralidhar, S Niar, H Ouarnoughi, V Narayanan, A Sebastian, K El Maghraoui

View Publication

Design of Analog-AI Hardware Accelerators for Transformer-Based Language Models

IEEE IEDM 2023

2023

Design of Analog-AI Hardware Accelerators for Transformer-Based Language Models

End-to-end design of analog-AI accelerators for transformer-based language models. Covers device-, circuit- and architecture-level co-design needed to run BERT- and GPT-class workloads on analog in-memory computing tiles, with measured accuracy and energy results from prototype hardware.

GW Burr, H Tsai, W Simon, I Boybat, S Ambrogio, CE Ho, ZW Liou, M Rasch, J Büchel, P Narayanan, K El Maghraoui, et al.

View Publication

View All Publications on Google Scholar

Teaching

Shaping the Next Generation

As Adjunct Professor at Columbia University, I bridge cutting-edge IBM Research with graduate education — equipping students to design, optimize, and deploy AI systems at scale.

Teaching Philosophy

Systems Thinking

Understanding how components interact at scale — from silicon to software stack. Students don't just learn algorithms; they implement them on real hardware and measure the results.

Empirical Rigor

Every claim must be measured and validated. Courses emphasize profiling, benchmarking, and performance analysis as first-class skills alongside theoretical foundations.

Ethical Awareness

Considering the societal implications of AI systems. The energy cost of training, the accessibility of deployment, and the responsibility of building technology that serves everyone.

Institution

Columbia University

Courses

2 Graduate

Focus

AI Systems & HPC

Approach

Research-Driven

In the Classroom

Teaching GPU Architecture at Columbia University

Lecture on Model Pruning — HPML at Columbia

Guest Lecture at University of Sharjah

Student Voices

What students take away

A great semester in Scaling LLMs, and again in HPML — lucky to be taking these courses. Hoping to learn more and more in these fields over time.

Graduate student

Scaling LLMs & High-Performance Machine Learning

Columbia CTL Thank-a-Prof · 2026

Graduate Course

High-Performance Machine Learning

COMS E6998 · Columbia University

Syllabus

At the intersection of AI and High-Performance Computing, this course covers foundational and advanced techniques that drive efficient AI systems — from GPU programming and distributed training to LLM serving and model compression. Based on PyTorch and CUDA.

PyTorch & CUDA Programming

Distributed Training at Scale

LLM Serving (vLLM, PagedAttention)

Model Compression & Quantization

Research Seminar

Scaling LLMs: Systems, Optimization & Emerging Paradigms

COMS E6998 · Columbia University

Syllabus

A frontier research seminar exploring scaling, optimizing, and deploying large language models through a structured progression from foundations to futures. Students present and critique top-tier papers (NeurIPS, ICML, ICLR, ISCA, ACL) and produce a survey paper with experimental evaluation.

Top-Tier Paper Critiques

Agentic AI & Multimodal Models

Hardware Futures (Analog, NorthPole)

Publish-Ready Survey Projects

Speaking & Videos

Distinguished Speaker

Over 60 keynotes and invited talks at major international conferences, inspiring audiences worldwide on AI, technology, and innovation.

Agentic AI: From Models to Autonomous Intelligence

Keynote

Featured Talk

Agentic AI: From Models to Autonomous Intelligence

Orange Morocco Agentic AI Day — Trust the Future!

Tracing the evolution from large language models to autonomous AI agents — covering the reasoning loop, tool calling, memory architecture, and orchestration patterns that separate production deployments from demos. Featuring measured results from enterprise agentic systems and a practical framework for Orange Morocco's customer care, network operations, fraud detection, and HR onboarding.

Casablanca, Morocco

Mar 2026

Agentic AI: From Models to Autonomous Intelligence

Orange Morocco Agentic AI Day — Trust the Future!

CasablancaMar 2026

Agentic AI: From Creation to Collaboration

Women in AI Morocco / INPT Workshop

INPTDec 2025

The Next Wave: Reinventing Intelligence and Compute Architecture

Women in AI Morocco Summit 2025

TechnoparkDec 2025

Agentic AI Workshop — Full Auditorium

Women in AI Morocco / INPT

INPTDec 2025

The Future of AI — An IBM Research Perspective

IBM TechXChange

Johannesburg & Cape TownAug 2025

Revolutionizing Enterprise AI: The Power and Promise of Foundation Models

Women in Research Webinar Series (QUWA) — University of Sharjah

SharjahApr 2025

Revolutionizing Enterprise AI

IEEE Services Conference

ShenzhenJul 2024

Scaling Foundation Models for Enterprise

MoroccoAI / Al Akhawayn University

Ifrane2023

Foundation Models at Scale

AI Seminar Series — Alfaisal University

RiyadhFeb 2023

Women in Computing: Breaking Boundaries

ArabWIC Conference

RabatMar 2019

AI for Business: A Unique Set of Challenges

Women in Data Science @ Stanford / KACST

Riyadh2018

Watch Keynotes

Selected keynote recordings from major conferences and events.

Keynote2020

Al Akhawayn University — Alumni Journey

A personal reflection on the journey from Al Akhawayn University in Morocco to IBM Research, sharing how the university's liberal arts education and international environment shaped a career in AI and computer science.

Al Akhawayn UniversityYouTube

Keynote2021

Powering the Future of AI through Specialized Hardware

Keynote at MoroccoAI Annual Conference discussing how specialized AI hardware accelerators are essential for sustainable and efficient AI, covering analog in-memory computing and hardware-software co-design strategies.

MoroccoAI Annual ConferenceYouTube

Keynote2021

Accelerating, Optimizing, and Automating AI across the Stack

A comprehensive keynote on the challenges of deploying complex AI models efficiently, covering optimization techniques from hardware to software, and automated approaches to neural architecture search.

ECOLE DES PONTSYouTube

Keynote2022

Platform for Next Generation Analog AI Hardware Acceleration

Presentation at the tinyML On Device Learning Forum on building platforms for next-generation analog AI hardware, enabling efficient on-device inference through novel computing paradigms.

tinyML ForumYouTube

Keynote2021

Women in Services Computing — Award & Keynote

Award acceptance speech and presentation at the IEEE International Symposium on Women in Services Computing, highlighting contributions to AI research and inspiring the next generation of women in technology.

IEEE WISC 2021YouTube

On the Airwaves

Regular contributor to IBM's Mixture of Experts podcast, discussing the latest trends in AI hardware, model optimization, and the future of intelligent systems.

IBM Mixture of Experts

A weekly podcast where IBM researchers break down the latest in AI, technology, and innovation.

Spotify Apple Podcasts

The Future of Software Engineering, Tokenmaxxing & AI in Higher Education

2026

IBM Mixture of Experts · NYU Tech Week

The Future of Software Engineering, Tokenmaxxing & AI in Higher Education

Recorded live at NYU Tech Week. Kaoutar joins the panel to debate whether AI will replace software engineers (it won't, but the job is changing fast), why "tokenmaxxing" became a vanity metric after Uber blew its annual AI budget by April, and what AI in higher education actually looks like for the students who'll inherit this stack. Three threads, one underlying question: cost per useful outcome.

Listen Now Read the Blog Post

Granite 4.1, IBM Bob & Building a Quantum Ecosystem

2026

IBM Mixture of Experts

Granite 4.1, IBM Bob & Building a Quantum Ecosystem

IBM just dropped Granite 4.1: small models built for one job at a time, instead of the everything-machine arms race. Kaoutar joins Tim Hwang, Marina Danilevsky, and Gabe Goodhart on why specialized usually wins at inference cost, and how Project Bob hands each coding task off to whichever model fits best. Jamie Garcia also stops by on quantum, where the university partnerships are doing more of the real work than the chip announcements.

Listen Now Read the Blog Post

Anthropic's Project Glasswing, AI Profitability & GPT-1900

2026

IBM Mixture of Experts

Anthropic's Project Glasswing, AI Profitability & GPT-1900

Kaoutar joins Tim Hwang and Martin Keen on why Anthropic won't release its Mythos model and what Project Glasswing actually means for AI security. The panel also breaks down OpenAI vs. Anthropic financials: both are burning cash on inference, but with very different bets on where the money comes from. And researchers built GPT-1900, trained only on pre-1900 knowledge, to see if AI can rediscover scientific breakthroughs. Turns out, pattern recognition is the easy part.

Listen Now Read the Blog Post

AI Agent Adoption: From Scientists to CFOs

2026

IBM Mixture of Experts

AI Agent Adoption: From Scientists to CFOs

Kaoutar joins host Tim Hwang, Ritika Gunnar, and Volkmar Uhlig for the milestone 100th episode, tracing AI's evolution from GPT-2 to GPT-5.3. Kaoutar weighs in on whether AI is truly democratizing expertise — sparked by a homeowner who used ChatGPT to outsell realtor estimates — and examines why only 2.1% of scientists actively use AI coding tools in research. The panel also dissects how Adobe's CFO built an AI lab inside his finance team using autonomous agents for forecasting and contract analysis, with Kaoutar identifying the three hottest areas for enterprise AI adoption.

Listen Now

AI Code Security: Codex Agents & Crypto Mining

2026

IBM Mixture of Experts

AI Code Security: Codex Agents & Crypto Mining

Kaoutar joins Tim Hwang, Ambhi Ganesan, and Sandi Besen to analyze OpenAI's Codex Security launch, Anthropic's eval-aware Opus 4.6, Meta's Moltbook acquisition, and Alibaba's rogue crypto-mining agent.

Listen Now Read the Blog Post

2026

IBM Mixture of Experts

AI Year in Review: Trends Shaping 2026

Kaoutar unpacks the AI hardware supply crisis and NVIDIA's chip dominance, while Gabe Goodhart defends open source's breakout year with models challenging proprietary systems.

Listen Now

2026

IBM Mixture of Experts

Mainframe Modernization: COBOL and AI

Millions of lines of COBOL still run the world's banks and airlines. This episode looks at how AI is starting to modernize those mainframe systems without rewriting everything from scratch.

Listen Now

2025

IBM Mixture of Experts

AI Hardware Model Optimization

Why do some models run fast on one chip and crawl on another? This episode gets into the hardware side: how chip architectures shape what's possible and what model optimization actually looks like in practice.

Listen Now

Manus, Vibe Coding, Scaling Laws & Perplexity's AI Phone

2025

IBM Mixture of Experts

Manus, Vibe Coding, Scaling Laws & Perplexity's AI Phone

Manus made waves as an AI agent, vibe coding became a thing, and Perplexity announced it's building a phone. The panel also debates whether scaling laws are hitting a wall or just bending.

Listen Now

Your Brain on ChatGPT & Human-like AI for Safer AVs

2024

IBM Mixture of Experts

Your Brain on ChatGPT & Human-like AI for Safer AVs

What happens to your brain when you offload thinking to ChatGPT? The panel covers new research on cognitive effects of LLMs, plus how human-like AI reasoning is making autonomous vehicles safer.

Listen Now

2025

IBM Mixture of Experts

Apple's WWDC, Meta & Scale AI, o3-pro

Apple dropped some big AI moves at WWDC, Meta teamed up with Scale AI, and o3-pro showed what reasoning models can do now. The panel also gets into fault-tolerant quantum computing and why it matters sooner than you'd think.

Listen Now

In the Press

Media & Coverage

Expert commentary, profiles, and features in leading technology publications and organizations worldwide.

Featured In

Featured Press Clippings

Profile

IBM Think•May 2026

How AI is changing engineering work

There's an old proverb: 'If you want to go fast, go alone. If you want to go far, go together.' AI has collapsed that tradeoff for me. I move faster than I did working alone, and I'm covering more ground than I could without a collaborator.

Les équipes diversifiées produisent des solutions plus robustes, créatives et équitables

2 pages

Interview

La Nouvelle Tribune•Mar 2026

Les équipes diversifiées produisent des solutions plus robustes, créatives et équitables

An in-depth interview on AI infrastructure, the IBM Spyre accelerator, hardware-software co-design, and the importance of diverse teams in shaping AI's future.

TelQuel Impact — Puts IBM on the AI Radar & 20 Leadership Perspectives

2 pages

Profile

TelQuel Impact•2025–2026

TelQuel Impact — Puts IBM on the AI Radar & 20 Leadership Perspectives

From IBM's research centers in the United States, Kaoutar El Maghraoui is one of the rising figures in the global race for artificial intelligence. Featured among 20 distinguished Al Akhawayn University alumni shaping the future.

GHC Program Co-Chair — Kaoutar El Maghraoui

Leadership

AnitaB.org / Grace Hopper Celebration•2015

GHC Program Co-Chair — Kaoutar El Maghraoui

"In the U.S., there's a misconception that computer science disciplines are only for boys. If you look at children's programming, you rarely see girl characters who are into computers." — Profiled as GHC Program Co-Chair, highlighting her leadership of the world's largest gathering of women technologists and her work mentoring young girls in computing.

How Arab Women in Technology Inspire Global Diversity in Tech

Profile

AnitaB.org / ArabWIC•2016

How Arab Women in Technology Inspire Global Diversity in Tech

"When I was growing up in Morocco, I never felt that we were a minority in schools. Many women pursued computer science and engineering. But when I came to the U.S. to get my Ph.D., it hit me: very few women were pursuing a computer science graduate degree." — An in-depth interview on ArabWIC's growth to 17 Arab countries and the global fight for diversity in tech.

La Nouvelle Tribune•2026

Profile

Women in AI — La Nouvelle Tribune Special Issue

Featured in a special issue celebrating women leaders in artificial intelligence, highlighting contributions to AI hardware and systems research.

Quantified Strategy•Jan 2026

Expert Commentary

AI Year in Review: Trends Shaping 2026

Highlights diminishing returns of pure compute scaling, urging efficiency in training and inference — core to IBM's AI Hardware Center focus.

IBM Think•2025

Expert Commentary

All eyes on AI at Apple WWDC: Can slow and steady still win the race?

Apple made a significant leap, with the introduction of Apple Intelligence, which combines intelligent systems and personalized contexts.

IBM Think•2025

Expert Commentary

Custom chips drive AI's future

Reducing dependence on NVIDIA merely shifts the center of power from one giant to another.

IBM Think•2025

Expert Commentary

Anthropic's microscope cracks open the AI black box

What Anthropic is doing is fascinating... They're starting to show that models develop internal reasoning structures that look a lot like associative memory.

WomenTech Network•2025

Speaker Feature

AI's Evolution: From Symbolic Representations to Generative Intelligence

Featured as a distinguished speaker, sharing insights on the evolution of AI from symbolic systems to modern generative intelligence.

Dr. El Maghraoui interviewed by Canal Atlas

Canal Atlas Interview

Media coverage on AI research and innovation

15+

Media Features

25+

Expert Quotes

10+

Publications

Countries

Recognition

Awards & Honors

Recognized by leading institutions for contributions to AI research, open-source innovation, and service to the computing community.

Breaking Boundaries

Grace Hopper Celebration

2025

IBM Outstanding Technical Achievement Award

IBM Research

PyTorch, vLLM, CI/CD contributions

2024–2026

ACM Distinguished Speaker

Association for Computing Machinery

Selected for global speaking program

2023

IEEE Open-Source Science Award

IEEE

Analog In-Memory Hardware Acceleration & AnalogNAS open-source platforms

2023

IEEE EDGE Best Paper Award

IEEE Services 2023

AnalogNAS: Neural Architecture Search for Analog In-Memory Computing

2023

IEEE SSE Best Student Paper Award

IEEE Services 2023

FactSheets for Hardware-Aware AI Models on Analog In-Memory Computing

2023

IBM Outstanding Technical Achievement Award

IBM Research

Analog AI Toolkits

2022

ACM Distinguished Member

Association for Computing Machinery

Top 10% of ACM members worldwide

2022

IBM Technical Corporate Award

IBM

One of only 38 IBM researchers selected

2021

IEEE TCSVC Women in Service Computing Award

IEEE

Outstanding contributions to service computing

2021

Best of IBM Award

IBM

Company-wide recognition for exceptional impact

2021

ArabWIC Best Paper Award

7th Intl. Conf. on Arab Women in Computing

Accelerating NAS with Rank-Preserving Surrogate Models

2023

Best Research Award

4th Forum for Women in Research, UAE

Recognition for research excellence

2015

IBM Outstanding Technical Achievement Award

IBM Research

Watson4TSS cognitive search system

2013

IBM Research Eminence & Excellence Award

IBM Research

Leadership in advancing women in science & technology

2006

HPC-GECO/CompFrame Best Paper Award

ACM HPDC 2006 Workshop

Malleable Components for Scalable High Performance Computing

17US Patents

70+Publications

51+Conference Papers

11Journal Articles

Contact

Let's Connect

Interested in collaboration, speaking engagements, or research partnerships? I'd love to hear from you.

IBM T.J. Watson Research Center

Yorktown Heights, NY 10598

Connect on LinkedIn

IBM Research

Research Profile

Google Scholar

Publications

Speaking Inquiries

I am available for keynotes, panel discussions, and workshops on AI, hardware-software co-design, and technology leadership. With 60+ keynotes delivered across 4 continents, I bring deep expertise and engaging delivery to every event.

Keynote Talks

Panel Discussions

Workshops

Topics include: Agentic AI, Foundation Models, AI Hardware Acceleration, In-Memory Computing, Women in STEM Leadership, and Enterprise AI Strategy.

Previous Engagements

Research Collaboration

I welcome collaborations with academic institutions and industry partners on AI systems research, hardware-software co-design, and efficient AI deployment. I also supervise graduate students and postdoctoral researchers at Columbia University.

Connect with Me on LinkedIn

Dr. KaoutarEl Maghraoui

Shaping the Future of AI

Current Roles

Principal Research Scientist & Technical Lead

Adjunct Professor

ACM Distinguished Speaker

Science Advisory Board Member

Foundry Advisor

Global Vice-Chair

Advancing AI at Every Layer

AI Hardware-Software Co-Design

Analog In-Memory Computing

LLM Optimization & Inference

Neural Architecture Search

Supernetwork-based efficient mapping of deep learning models

Accelerating LLM Inference via Dynamic KV Cache Compression

Neural architecture search for in-memory computing

IBM Analog In-Memory Hardware Acceleration Kit

Multi-objective Hardware-aware Neural Architecture Search

Key Research Publications

On the Convergence Theory of Pipeline Gradient-based Analog In-memory Training

Efficient Quantization of Mixture-of-Experts with Theoretical Generalization Guarantees

Supernetwork-Based Efficient Mapping of Deep Learning Applications to Mixed-Precision Hardware Using Model Adaptation

STARC: Selective Token Access with Remapping and Clustering for Efficient LLM Decoding on PIM Systems

Analog Foundation Models

NoRA: Noise-Optimized Rescaling of LLMs on Analog Compute-in-Memory Accelerators

EvidenceMoE: A Physics-Guided Mixture-of-Experts with Evidential Critics for Advancing Fluorescence Detection and Ranging in Scattering Media

Ultra-Low Precision 4-bit Training of Deep Neural Networks

A Flexible and Fast PyTorch Toolkit for Simulating Training and Inference on Analog Crossbar Arrays

A Comprehensive Survey on Hardware-Aware Neural Architecture Search

ModelOps: Cloud-Based Lifecycle Management for Reliable and Trusted AI

Using the IBM Analog In-Memory Hardware Acceleration Kit

Neural Architecture Search for In-Memory Computing-Based Deep Learning Accelerators

Deep Compression of Pre-trained Transformer Models

Multi-Objective Hardware-Aware Neural Architecture Search with Pareto Rank-Preserving Surrogate Models

AnalogNAS: A Neural Network Design Framework for Accurate Inference with Analog In-Memory Computing

Design of Analog-AI Hardware Accelerators for Transformer-Based Language Models

Shaping the Next Generation

Teaching Philosophy

Systems Thinking

Empirical Rigor

Ethical Awareness

In the Classroom

What students take away

High-Performance Machine Learning

Scaling LLMs: Systems, Optimization & Emerging Paradigms

Distinguished Speaker

Agentic AI: From Models to Autonomous Intelligence

Agentic AI: From Models to Autonomous Intelligence

Agentic AI: From Creation to Collaboration

The Next Wave: Reinventing Intelligence and Compute Architecture

Agentic AI Workshop — Full Auditorium

The Future of AI — An IBM Research Perspective

Revolutionizing Enterprise AI: The Power and Promise of Foundation Models

Revolutionizing Enterprise AI

Scaling Foundation Models for Enterprise

Foundation Models at Scale

Women in Computing: Breaking Boundaries

AI for Business: A Unique Set of Challenges

Watch Keynotes

Al Akhawayn University — Alumni Journey

Powering the Future of AI through Specialized Hardware

Accelerating, Optimizing, and Automating AI across the Stack

Platform for Next Generation Analog AI Hardware Acceleration

Women in Services Computing — Award & Keynote

On the Airwaves

IBM Mixture of Experts

The Future of Software Engineering, Tokenmaxxing & AI in Higher Education

Granite 4.1, IBM Bob & Building a Quantum Ecosystem

Anthropic's Project Glasswing, AI Profitability & GPT-1900

AI Agent Adoption: From Scientists to CFOs

AI Code Security: Codex Agents & Crypto Mining

AI Year in Review: Trends Shaping 2026

Mainframe Modernization: COBOL and AI

AI Hardware Model Optimization

Manus, Vibe Coding, Scaling Laws & Perplexity's AI Phone

Your Brain on ChatGPT & Human-like AI for Safer AVs

Apple's WWDC, Meta & Scale AI, o3-pro

Media & Coverage

How AI is changing engineering work

Dr. Kaoutar
El Maghraoui