Skip to main content
Learning Hub

Computer Architecture Topics

Master the fundamentals and dive deep into advanced concepts. Our comprehensive topic library covers everything from basic CPU design to cutting-edge parallel architectures.

26
Topics
12
Categories
31m
Avg Read

CPU

Processor design, microarchitecture, and execution pipelines

3 topics
1 beginner2 intermediate
CPUintermediate
5

Bit Manipulation Mastery

Essential bit manipulation techniques for computer architecture interviews.

25 min
2 prereqs
Learn →

Understanding how multiprocessor systems maintain data consistency across multiple cache levels and cores.

15 min
2 prereqs
Learn →

Essential foundations of computer architecture from instruction sets to pipelined execution, covering ISA design, the classic 5-stage pipeline, and hazard management.

30 min
3 prereqs
Learn →

GPU

Graphics processing, SIMD architectures, and compute shaders

4 topics
2 intermediate2 advanced

Modern GPU microarchitecture, SIMT execution model, and performance optimization for senior-level design.

35 min
3 prereqs
Learn →

Deep dive into graphics processing unit design, SIMD execution, and parallel computing architectures.

20 min
1 prereq
Learn →

GPU sparsity patterns from unstructured CSR to structured 2:4 for modern tensor cores.

25 min
3 prereqs
Learn →

GEMM optimization on GPUs: tiling, memory hierarchy, coalescing, and tensor cores for peak performance.

20 min
3 prereqs
Learn →

Memory

Cache hierarchies, memory systems, and storage technologies

1 topics
1 beginner
Memorybeginner
5

Memory Hierarchy 101

Understand why memory hierarchy exists, from registers to DRAM. Learn about cache levels, locality principles, and bandwidth vs latency trade-offs.

32 min
3 prereqs
Learn →

NoC

Network-on-chip design, routing, and interconnect architectures

2 topics
2 advanced

NoC virtual channels for deadlock-free routing in mesh topologies and many-core systems.

30 min
3 prereqs
Learn →

Understanding the phases of transformer architecture and their NoC traffic patterns, from embedding to self-attention to feedforward layers.

25 min
4 prereqs
Learn →

Parallel

Parallel processing, threading, and synchronization mechanisms

1 topics
1 beginner

Introduction to parallelism fundamentals: data vs task parallelism, SIMD vector operations, throughput-oriented architectures, and essential parallelism metrics.

35 min
3 prereqs
Learn →

Performance

Optimization techniques, benchmarking, and performance analysis

3 topics
1 beginner1 intermediate1 advanced

Essential algorithm patterns: sliding window, prefix sums, monotonic structures, binary search.

45 min
2 prereqs
Learn →

Progressive optimization of matrix multiplication: from naive O(N³) to cache-blocked SIMD implementations.

25 min
2 prereqs
Learn →

Master essential performance metrics including IPC, throughput, latency, and bandwidth. Learn to identify bottlenecks using Amdahl's Law and the Roofline Model.

28 min
3 prereqs
Learn →

Software

Programming languages, frameworks, containers, and system software design

5 topics
2 intermediate3 advanced
Softwareadvanced
4

C++ Coroutines (C++20)

Comprehensive guide to C++20 coroutines: syntax, implementation patterns, and SystemC integration.

30 min
1 prereq
Learn →

Essential C++ techniques, modern features, and performance optimization patterns for systems programming.

25 min
1 prereq
Learn →

Deep dive into implementing STL containers: linked lists, hash tables, red-black trees, and their performance characteristics.

60 min
2 prereqs
Learn →
Softwareintermediate
3

SystemC Fundamentals

Introduction to SystemC for hardware modeling and system-level design.

10 min
1 prereq
Learn →

Comprehensive guide to SystemC TLM-2.0 for high-level system modeling and communication.

15 min
2 prereqs
Learn →

Machine Learning

Core ML architectures, algorithms, and computational approaches for modern AI systems

3 topics
3 advanced

Key-Value cache optimization techniques for transformer inference: compression, retention, and memory efficiency.

35 min
3 prereqs
Learn →
Machine Learningadvanced
4

Model Size Reduction Techniques

Comprehensive overview of quantization, pruning, and compression techniques for deploying large neural networks efficiently.

25 min
2 prereqs
Learn →

Technical comparison of Vision Transformer and Stable Diffusion architectures and their convergence.

45 min
3 prereqs
Learn →

DatacenterArch

Large-scale system design, cluster management, and distributed architectures

2 topics
2 advanced

Complete technical deep-dive into Google TPU Pod's optical circuit switching architecture, 3D torus topology, collective communication optimization, and datacenter-scale AI infrastructure

90 min
4 prereqs
Learn →

Comprehensive comparison of Google TPU Pod optical interconnects with NVIDIA NVSwitch, InfiniBand, Ethernet, and emerging datacenter interconnect technologies for AI infrastructure

75 min
4 prereqs
Learn →

Start with beginner topics and progress to advanced concepts