by Jessie Jiao, Yanwei Cui, Michelle Hong, and Gary Lo on 28 JUL 2025 in Amazon Bedrock, Amazon Machine Learning, Artificial Intelligence, Customer Solutions Permalink Comments Share
Modern AI assistants—artificial intelligence systems designed to interact with users through natural language, answer questions, and even perform tasks—face increasingly complex challenges in production environments. Beyond handling basic FAQs, they must now execute meaningful actions, adhere to company policies, implement content filtering, escalate to human operators when needed, and manage follow-up tasks. These requirements demand sophisticated systems capable of handling diverse scenarios while maintaining consistency and compliance. In this post, we explore how we used user and system feedback to continuously improve and optimize our instruction prompts. This feedback-driven approach has enabled us to create more effective prompts that adapt to various subsystems while maintaining high performance across different use cases.
by Tony Wong, Ken Tsui, Edward Tsoi Po Wa, Mickey Yip, Yanwei Cui, and Zhihao Lin on 15 MAY 2025 in Amazon SageMaker, Amazon SageMaker HyperPod, Artificial Intelligence, Customer Solutions, Generative AI Permalink Comments Share
The banking industry has long struggled with the inefficiencies associated with repetitive processes such as information extraction, document review, and auditing. These tasks, which require significant human resources, slow down critical operations such as Know Your Customer (KYC) procedures, loan applications, and credit analysis. As a result, banks face operational challenges, including limited scalability, slow processing speeds, and high costs associated with staff training and turnover. In this post, we present our work and step-by-step code on fine-tuning the Qwen2-VL-7B-Instruct model using LLaMA-Factory on SageMaker HyperPod. Our results demonstrate significant improvements in table structure recognition accuracy and efficiency compared to the original base model and traditional methods, with particular success in handling complex financial tables and multi-page documents.
by Gordon Wang, Yanwei Cui, and Gary Lo | on 26 SEP 2024 | in Amazon SageMaker HyperPod, Best Practices, Generative AI, Technical How-to | Permalink | Comments | Share
In this post, we share an ML infrastructure architecture that uses SageMaker HyperPod to support research team innovation in video generation. We will discuss the advantages and pain points addressed by SageMaker HyperPod, provide a step-by-step setup guide, and demonstrate how to run a video generation algorithm on the cluster.
by Jacky Wu, Yanwei Cui, and Michelle Hong | on 25 JUN 2024 | in Amazon Bedrock, Amazon QuickSight, AWS Step Functions, Intermediate (200), Technical How-to | Permalink | Comments | Share
In this post, we explore how to integrate LLMs into enterprise applications to harness their generative capabilities. We delve into the technical aspects of workflow implementation and provide code samples that you can quickly deploy or modify to suit your specific requirements. Whether you’re a developer seeking to incorporate LLMs into your existing systems or a business owner looking to take advantage of the power of NLP, this post can serve as a quick jumpstart.
by Yanwei Cui, Bin Wang, Michelle Hong, and Gordon Wang | on 06 OCT 2023 | in Advanced (300), Amazon SageMaker, Generative AI, Technical How-to | Permalink | Comments | Share
In this post, we elucidate the simple yet powerful idea of combining user profiles and item attributes to generate personalized content recommendations using LLMs. As demonstrated throughout the post, these models hold immense potential in generating high-quality, context-aware input text, which leads to enhanced recommendations. To illustrate this, we guide you through the process of integrating a feature store (representing user profiles) with an LLM to generate these personalized recommendations.
by Yanwei Cui, Dhawalkumar Patel, Gordon Wang, Melanie Li, Raghu Ramesha, and Sam Edwards | on 06 OCT 2023 | in Advanced (300), Amazon SageMaker, Generative AI, Technical How-to | Permalink | Comments | Share
In this post, we provide an overview of popular multimodality models. We also demonstrate how to deploy these pre-trained models on Amazon SageMaker. Furthermore, we discuss the diverse applications of these models, focusing particularly on several real-world scenarios, such as zero-shot tag and attribution generation for ecommerce and automatic prompt generation from images.
by Gordon Wang, Yanwei Cui, and Melanie Li | on 05 SEP 2023 | in Amazon SageMaker, Amazon SageMaker JumpStart, Artificial Intelligence, Generative AI* | Permalink | Comments | Share
In this post, we introduce a novel method to perform content moderation on image data with multi-modal pre-training and a large language model (LLM). With multi-modal pre-training, we can directly query the image content based on a set of questions of interest and the model will be able to answer these questions. This enables users to chat with the image to confirm if it contains any inappropriate content that violates the organization’s policies. We use the powerful generating capability of LLMs to generate the final decision including safe/unsafe labels and category type. In addition, by designing a prompt, we can make an LLM generate the defined output format, such as JSON format. The designed prompt template allows the LLM to determine if the image violates the moderation policy, identify the category of violation, explain why, and provide the output in a structured JSON format.
by Gordon Wang, Yanwei Cui, Melanie Li, and Guang Yang | on 01 JUN 2023 | in Advanced (300), Amazon SageMaker, Amazon SageMaker Ground Truth, Artificial Intelligence, Technical How-to | Permalink | Comments | Share
The demand for multi-object tracking (MOT) in video analysis has increased significantly in many industries, such as live sports, manufacturing, and traffic monitoring. For example, in live sports, MOT can track soccer players in real time to analyze physical performance such as real-time speed and moving distance. Since its introduction in 2021, ByteTrack remains to […]
by Yanwei Cui and Gordon Wang | on 22 NOV 2022 | in Amazon Forecast, Best Practices, Intermediate (200), Retail, Technical How-to | Permalink | Comments | Share
Amazon Forecast is a fully managed service that uses statistical and machine learning (ML) algorithms to deliver highly accurate time-series forecasts. Recently, based on Amazon Forecast, we helped one of our retail customers achieve accurate demand forecasting, within 8 weeks. The solution improved the manual forecast by an average of 10% in regards to the […]
by Yanwei Cui, Junyi Liu, and Yi-An Chen | on 08 JUL 2022 | in Amazon SageMaker, Artificial Intelligence | Permalink | Comments | Share
Optical character recognition (OCR) is the task of converting printed or handwritten text into machine-encoded text. OCR has been widely used in various scenarios, such as document electronization and identity authentication. Because OCR can greatly reduce the manual effort to register key information and serve as an entry step for understanding large volumes of documents, […]
by Yanwei Cui and Will Badr | on 12 JAN 2022 | in Amazon Machine Learning, Amazon Neptune, Artificial Intelligence | Permalink | Comments | Share
Recommendation systems are one of the most widely adopted machine learning (ML) technologies in real-world applications, ranging from social networks to ecommerce platforms. Users of many online systems rely on recommendation systems to make new friendships, discover new music according to suggested music lists, or even make ecommerce purchase decisions based on the recommended products. […]
by Yanwei Cui | on 10 AUG 2021 | in Amazon Machine Learning, Amazon SageMaker, Artificial Intelligence, Tensorflow on AWS | Permalink | Comments | Share
As more machine learning (ML) workloads go into production, many organizations must bring ML workloads to market quickly and increase productivity in the ML model development lifecycle. However, the ML model development lifecycle is significantly different from an application development lifecycle. This is due in part to the amount of experimentation required before finalizing a […]
by Yanwei Cui and Raghu Ramesha | on 01 OCT 2020 | in Amazon SageMaker, Artificial Intelligence | Permalink | Comments | Share
This blog was reviewed and updated June, 2022 to address latest changes to steps and User Interface on Studio and Okta. In 2019, AWS announced Amazon SageMaker Studio, a unified integrated development environment (IDE) for machine learning (ML) development. You can write code, track experiments, visualize data, and perform debugging and monitoring within a single, […]