Model Merging via Data-Free Covariance Estimation
Marawan Gamal Abdel Hameed, Derek Tam, Pascal Jr Tikeng Notsawo, Colin Raffel, Guillaume Rabusseau
Preprint · 2026
Model MergingCovariance EstimationTransfer Learning
Grokking Finite-Dimensional Algebra
Pascal Jr. Tikeng Notsawo, Guillaume Dumas, Guillaume Rabusseau
Forty-Third International Conference on Machine Learning (ICML 2026) · 2026
GrokkingAlgebraRepresentation LearningGeneralization
Grokking Beyond the Euclidean Norm of Model Parameters
Pascal Jr. Tikeng Notsawo, Guillaume Dumas, Guillaume Rabusseau
Forty-Second International Conference on Machine Learning (ICML) · 2025
GrokkingGeneralizationRegularizationDeep Learning
Lost in Translation: The Algorithmic Gap Between LMs and the Brain
Tommaso Tosato, Pascal Jr. Tikeng Notsawo, Saskia Helbling, Irina Rish, Guillaume Dumas
Workshop on Large Language Models and Cognition, ICML · 2024
Language ModelsNeuroscienceCognitionBrain
Predicting Grokking Long Before it Happens: A look into the loss landscape of models which grok
Pascal Jr. Tikeng Notsawo, Hattie Zhou, Mohammad Pezeshki, Irina Rish, Guillaume Dumas
ICLR 2024 Workshop on Mathematical and Empirical Understanding of Foundation Models · 2023
GrokkingGeneralizationLoss LandscapeFourier Analysis
Adaptive Discrete Communication Bottlenecks with Dynamic Vector Quantization for Heterogeneous Representational Coarseness
Dianbo Liu, Alex Lamb, Xu Ji, Pascal Jr. Tikeng Notsawo, Mike Mozer, Yoshua Bengio, Kenji Kawaguchi
Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023) · 2023
Vector QuantizationDiscrete RepresentationsReinforcement LearningCommunication
Stochastic Average Gradient : A Simple Empirical Investigation
Pascal Junior Tikeng Notsawo
IFT6512, Stochastic programming, Université de Montréal · 2023
OptimizationStochastic GradientConvergenceSAG
On the use of linguistic similarities to improve Neural Machine Translation for African Languages
Pascal Jr. Tikeng Notsawo, Brice Nanda, James Assiene
5th Black in AI Workshop @ NeurIPS · 2021
NLPMachine TranslationAfrican LanguagesMultilingualism