New techniques efficiently accelerate sparse tensors for massive AI models

"improved performance and energy-efficiency of massive machine-learning models that drive generative AI... skips over 0's, saves computation and memory... finds non-0's for wider variety of sparsity patterns, and handle case where data doesn't fit memory... still maintain flexibility and adaptability while being specialized and efficient... divides tensor values into smaller blocks, with own simple, sparsity pattern, then combining blocks into eve larger levels, remaining simple at each step"
