News
Entertainment
Science & Technology
Sport
Business & Money
Life
Culture & Art
Hobbies
8 | Follower
Apple Machine Learning Research
11.07.2025
Effectively representing 3D scenes for Multimodal Large Language Models (MLLMs) is crucial yet challenging. Existing approaches commonly…
Large Language Models (LLMs) are increasingly being deployed on edge devices for long-context settings, creating a growing need for fast and…
10.07.2025
This paper was accepted at the Workshop on Reliable and Responsible Foundation Models (RRFMs) Workshop at ICML 2025. Uncertainty…
The adoption of text-to-image diffusion models raises concerns over reliability, drawing scrutiny under the lens of various metrics like…
The success of large language models in text processing has inspired their adaptation to speech modeling. However, since speech is…
Discrete diffusion is a promising framework for modeling and generating discrete data. In this work, we present Target Concrete Score…
Large Language Models (LLMs) are increasingly used in applications requiring long context lengths, but the key-value (KV) cache often…
09.07.2025
Driven by steady progress in deep generative modeling, simulation-based inference (SBI) has emerged as the workhorse for inferring the…
08.07.2025
We design new differentially private algorithms for the problems of adversarial bandits and bandits with expert advice. For adversarial…
Wearable devices record physiological and behavioral signals that can improve health predictions. While foundation models are increasingly…
This paper was presented at the Workshop on Reliable and Responsible Foundation Models at ICML 2025. Large Language Models (LLMs) have…
We design differentially private algorithms for the problem of prediction with expert advice under dynamic regret, also known as tracking…
Large language models (LLMs) have demonstrated impressive performance on several tasks and are increasingly deployed in real-world…
05.07.2025
Large-scale models are routinely trained on a mixture of different data sources. Different data mixtures yield very different downstream…
04.07.2025
State-Space Models (SSMs), and particularly Mamba, have recently emerged as a promising alternative to Transformers. Mamba introduces input…
The recent rapid adoption of large language models (LLMs) highlights the critical need for benchmarking their fairness. Conventional…
People who are blind or have low vision (BLV) may hesitate to travel independently in unfamiliar environments due to uncertainty about the…
03.07.2025
Recent works have shown a surprising result: a small fraction of Large Language Model (LLM) parameter outliers are disproportionately…
02.07.2025
Imitation learning for manipulation has a well-known data scarcity problem. Unlike natural language and 2D computer vision, there is no…
01.07.2025
With the rapid scaling of large language models (LLMs), structured pruning has become a widely used technique to learn efficient, smaller…
28.06.2025
As language models support larger and larger context sizes, evaluating their ability to make effective use of that context becomes…
Precisely evaluating semantic alignment between text prompts and generated videos remains a challenge in Text-to-Video (T2V) Generation…
In recent years, there have been remarkable breakthroughs in image-to-video generation. However, the 3D consistency and camera…
27.06.2025
Egocentric Video Question Answering (QA) requires models to handle long-horizon temporal reasoning, first-person perspectives, and…
With advances in generative AI, there is increasing work towards creating autonomous agents that can manage daily tasks by operating user…
22.06.2025
We present STARFlow, a scalable generative model based on normalizing flows that achieves strong performance in high-resolution image…
Existing paradigms for ensuring AI safety, such as guardrail models and alignment training, often compromise either inference efficiency or…
21.06.2025
We introduce a set of training-free ABX-style discrimination tasks to evaluate how multilingual language models represent language identity…
Normalizing Flows (NFs) are likelihood-based models for continuous inputs. They have demonstrated promising results on both density…
End-to-end (E2E) Automatic Speech Recognition (ASR) models are trained using paired audio-text samples that are expensive to obtain, since…
A widespread strategy for obtaining a language model that performs well in a target domain is to fine-tune it by training it to do…
Uncertainty Quantification (UQ) in Language Models (LMs) is key to improving their safety and reliability. Evaluations often use metrics…
20.06.2025
Accommodating human preferences is essential for creating aligned LLM agents that deliver personalized and effective interactions. Recent…
Recent research demonstrated that training large language models involves memorization of a significant fraction of training data. Such…
We study Variational Rectified Flow Matching, a framework that enhances classic rectified flow matching by modeling multi-modal velocity…
Flow matching models have emerged as a powerful method for generative modeling on domains like images or videos, and even on irregular or…
11.06.2025
Apple researchers are advancing AI and ML through fundamental research, and to support the broader research community and help accelerate…
10.06.2025
With Apple Intelligence, we're integrating powerful generative AI right into the apps and experiences people use every day, all while…
06.06.2025
Chain-of-thought (CoT) reasoning in vision language models (VLMs) is crucial for improving interpretability and trustworthiness…
Recent generations of frontier language models have introduced Large Reasoning Models (LRMs) that generate detailed thinking processes…