The selection of the assumed effect size (AES) critically determines the duration of an experiment, and hence its accuracy and efficiency. Traditionally, experimenters determine AES based on domain knowledge. However, this method becomes impractical for online experimentation services managing…
Sellers on online marketplaces such as Amazon.com use a variety of retail and retail media advertising services to improve their brand performance, including awareness, consideration, and revenue. But how can they measure their progress and drive these metrics? For 122,000 brands, we measure Amazon…
The problem of search relevance in the E-commerce domain is a challenging one since it involves understanding the intent of a user’s short nuanced query and matching it with the appropriate products in the catalog. This problem has traditionally been addressed using language models (LMs) and graph…
We propose a personalization framework to adapt compact models to test time environments and improve their speech enhancement performance in noisy and reverberant conditions. The use-cases are when the end-user device encounters only one or a few speakers and noise types that tend to reoccur in the…
The Net Promoter Score (NPS) is often used in customer experience programs for measuring customer loyalty. Increasingly more companies seek to automatically process millions of pieces of customer feedback from social media per month in order to estimate their NPS, leveraging advanced analytics like…
While data selection methods have been studied extensively in active learning, data pruning, and data augmentation settings, there is little evidence for the efficacy of these methods in industry scale settings, particularly in low-resource languages. Our work presents ways of assessing prospective…
Grounding-based vision and language models have been successfully applied to low-level vision tasks, aiming to pre- cisely locate objects referred in captions. The effectiveness of grounding representation learning heavily relies on the scale of the training dataset. Despite being a useful data en-…
Product information in e-commerce is usually localized using machine translation (MT) systems. The Arabic language has rich morphology and dialectal variations, so Arabic MT in e-commerce training requires a larger volume of data from diverse data sources; Given the dynamic nature of e-commerce,…
Nowadays, VPN technology is widely used in cloud and hybrid network communication that makes use of algorithms and tunneling to meet different security requirements. However, existing cloud VPN gateways often lack advanced monitoring capabilities and struggle to identify and resolve network…
The multitude of makeup products available can make it challenging to find the ideal match for desired attributes. An intelligent approach for product discovery is required to enhance the makeup shopping experience to make it more convenient and satisfying. However, enabling accurate and efficient…
Pre-trained language models (PLMs) such as BERT, RoBERTa, and DeBERTa have achieved state-of-the-art performance on various downstream tasks. The enormous sizes of PLMs hinder their deployment in resource-constrained scenarios, e.g., on edge and mobile devices. To address this issue, many model…
Construction auditing is a vital step in designing new sites in fulfillment centers (FCs). The audits include inspection processes for newly launched buildings which ensure that the buildings meet workplace standards. In current practice the process of scheduling construction audits and…
Computer science (CS) is special among STEM subjects: it aims at an industry sector that has the most job growth but has a constant shortage in the workforce; it is a relatively young and burgeoning subject in K-12 education that has a shortage of classroom teachers; and it is one of a very few…
Conversational, multi-turn, text-to-SQL (CoSQL) tasks map natural language utterances in a dialogue to SQL queries. State-of-the-art (SOTA) systems use large, pre-trained and finetuned language models, such as the T5-family, in conjunction with constrained decoding. With multi-tasking (MT) over…
Biodiversity loss and ecosystem degradation are global challenges demanding creative and scalable solutions. Recent increases in data collection coupled with machine learning have the potential to expand landscape monitoring capabilities. We present a computer vision solution to the problem of…
We introduce two models for high precision sound event detection leveraging transfer learning. The sound events we detect include “speech”, “music”, and “chime”. Both models consist of a CNN backbone pre-trained using AudioSet for audio classification. To get high precision detection results, the…
With ever-growing digital adoption in the society and increasing demand for businesses to deliver to customers doorstep, the last mile hop of transportation planning poses unique challenges in emerging geographies with unstructured addresses. One of the crucial inputs to facilitate effective…
Automatic speech recognition (ASR) models are typically designed to operate on a single input data type, e.g. a single or multi-channel audio streamed from a device. This design decision assumes the primary input data source does not change and if an additional (auxiliary) data source is…
Variational Bayesian posterior inference often requires simplifying approximations such as mean-field parametrisation to ensure tractability. However, prior work has associated the variational mean-field approximation for Bayesian neural networks with underfitting in the case of small datasets or…
Low-quality listings and bad actor behavior in online retail websites threatens e-commerce business as these result in sub-optimal buying experience and erode customer trust. When a new listing is created, how to tell it has good quality? Is the method effective, fast, and scalable? Previous…
Learn about Amazon's approach to customer-obsessed science. Amazon Science provides the latest news, research and innovation in artificial intelligence (AI) and machine learning (ML), including jobs, publications, conferences and events.