News
Entertainment
Science & Technology
Sport
Business & Money
Life
Culture & Art
Hobbies
8 | Follower
Apple Machine Learning Research
08.05.2024
This paper has been accepted at the Data Problems for Foundation Models workshop at ICLR 2024. Large language models are trained on massive…
07.05.2024
Apple is sponsoring the International Conference on Learning Representations (ICLR), which is taking place in person from May 7 to 11 in…
Apple is sponsoring the ACM Human-Computer Interaction Conference (CHI), which is taking place in person from May 11 to May 16, 2024 in…
04.05.2024
Conformal prediction (CP) for regression can be challenging, especially when the output distribution is heteroscedastic, multimodal, or…
03.05.2024
Large Language Models (LLMs) with billions of parameters have drastically transformed AI applications. However, their demanding computation…
Rendering scenes observed in a monocular video from novel viewpoints is a chal- lenging problem. For static scenes the community has studied…
We show that large language models (LLMs) can be adapted to be generalizable policies for embodied visual tasks. Our approach, called Large…
In this paper, we introduce a novel approach to automatically assign entity labels to images from existing noisy image-text pairs. The…
Despite their remarkable achievements, modern Large Language Models (LLMs) encounter exorbitant computational and memory footprints…
Contrastive learning typically matches pairs of related views among a number of unrelated negative views. Views can be generated (e.g. by…
This paper was accepted at the How Far Are We from AGI? workshop at ICLR 2024. Vision-Language Models (VLMs) such as GPT-4V have recently…
Instruction-based image editing improves the controllability and flexibility of image manipulation via natural commands without elaborate…
02.05.2024
We investigate the capabilities of transformer models on relational reasoning tasks. In these tasks, models are trained on a set of strings…
30.04.2024
Neural knowledge-to-text generation models often struggle to faithfully generate descriptions for the input facts: they may produce…
Pretrained language models are commonly adapted to comply with human intent and downstream tasks via finetuning. The finetuning process…
Sleep staging is a clinically important task for diagnosing various sleep disorders but remains challenging to deploy at scale because it…
26.04.2024
Contrastive learning has emerged as a transformative method for learning effective visual representations through the alignment of image and…
25.04.2024
Inspired by the advancements in foundation models for language-vision modeling, we explore the utilization of transformers and large-scale…
On-device machine learning (ML) moves computation from the cloud to personal devices, protecting user privacy and enabling intelligent user…
The reproducibility and transparency of large language models are crucial for advancing open research, ensuring the trustworthiness of…
On-device machine learning (ML) promises to improve the privacy, responsiveness, and proliferation of new, intelligent user experiences by…
24.04.2024
Adaptive gradient methods, notably Adam, have become indispensable for optimizing neural networks, particularly in conjunction with…
17.04.2024
Preference based Reinforcement Learning (PbRL) has shown great promise in learning from human preference binary feedback on agent's…
Existing vision-language models exhibit strong generalization on a variety of visual domains and tasks. However, such models mainly perform…
Long prompts present a significant challenge for practical LLM-based systems that need to operate with low latency and limited resources. We…
09.04.2024
Keeping large foundation models up to date on latest data is inherently expensive. To avoid the prohibitive costs of constantly retraining…
05.04.2024
*Equal Contributors Contrastive pretraining of image-text foundation models, such as CLIP, demonstrated excellent zero-shot performance and…
03.04.2024
Neural Network Language Models (NNLMs) of Virtual Assistants (VAs) are generally language-, region-, and in some cases, device-dependent…
02.04.2024
Streaming neural network models for fast frame-wise responses to various speech and sensory signals are widely adopted on…
30.03.2024
Apple is sponsoring the International Conference on Acoustics, Speech and Signal Processing (ICASSP), which is taking place in person from…
23.03.2024
We present an architecture for device-directed speech detection that treats the task as a text-generation problem. We use a multi-modal…
21.03.2024
In this work, we discuss building performant Multimodal Large Language Models (MLLMs). In particular, we study the importance of various…
16.03.2024
In the fast-evolving world of natural language processing (NLP), there is a strong demand for generating coherent and controlled text, as…
Recent research has explored [clinical monitoring,](https://pubmed.ncbi.nlm.nih.gov/32706685/) [cardiovascular…
Datasets that pair Knowledge Graphs (KG) and text together (KG-T) can be used to train forward and reverse neural models that generate text…
15.03.2024
Wearable sensors have permeated into people's lives, ushering impactful applications in interactive systems and activity recognition…
While Automatic Speech Recognition (ASR) systems are widely used in many real-world applications, they often do not generalize well to new…
14.03.2024
Human following serves an important human-robotics interaction feature, while real-world scenarios make it challenging particularly for a…
13.03.2024
This paper was accepted at The 5th AAAI Workshop on Privacy-Preserving Artificial Intelligence. Personalized recommendations form an…
12.03.2024
As the repository of publicly available pre-trained vision foundation models (VFMs) — such as CLIP, DINOv2, and SAM — grows, users face…