AWS Launches New Chips for AI Training and Its Own AI Chatbot

We Keep you Connected

AWS Launches New Chips for AI Training and Its Own AI Chatbot

AWS Launches New Chips for AI Training and Its Own AI Chatbot
Your email has been sent
At AWS re:Invent, NVIDIA contributed GPUs to Amazon’s cloud efforts and added a retriever system to its AI Enterprise Software platform on AWS Marketplace.
Amazon Web Services announced an AI chatbot for enterprise use, new generations of its AI training chips, expanded partnerships and more during AWS re:Invent, held from November 27 to December 1, in Las Vegas.
The focus of AWS CEO Adam Selipsky’s keynote held on day two of the conference was on generative AI and how to enable organizations to train powerful models through cloud services.
Jump to:
AWS announced new generations of its Graviton chips, which are server processors for cloud workloads and Trainium, which provides compute power for AI foundation model training.
Graviton4 (Figure A) has 30% better compute performance, 50% more cores and 75% more memory bandwidth than Graviton3, Selipsky said. The first instance based on Graviton4 will be the R8g Instances for EC2 for memory-intensive workloads, available through AWS.
Trainium2 is coming to Amazon EC2 Trn2 instances, and each instance will be able to scale up to 100,000 Trainium2 chips. That provides the ability to train a 300-billion parameter large language model in weeks, AWS stated in a press release.
Figure A
Anthropic will use Trainium and Amazon’s high-performance machine learning chip Inferentia for its AI models, Selipsky and Dario Amodei, chief executive officer and co-founder of Anthropic, announced. These chips may help Amazon muscle into Microsoft’s space in the AI chip market.
Selipsky made several announcements about Amazon Bedrock, the foundation model building service, during re:Invent:
Amazon launched its own generative AI assistant, Amazon Q, designed for natural language interactions and content generation for work. It can fit into existing identities, roles and permissions in enterprise security permissions.
Amazon Q can be used throughout an organization and can access a wide range of other business software. Amazon is pitching Amazon Q as business-focused and specialized for individual employees who may ask specific questions about their sales or tasks.
Amazon Q is especially suited for developers and IT pros working within AWS CodeCatalyst because it can help troubleshoot errors or network connections. Amazon Q will exist in the AWS management console and documentation within CodeWhisperer, in the serverless computing platform AWS Lambda, or in workplace communication apps like Slack (Figure B).
Figure B
Amazon Q has a feature that allows application developers to update their applications using natural language instructions. This feature of Amazon Q is available in preview in AWS CodeCatalyst today and will soon be coming to supported integrated development environments.
SEE: Data governance is one of the many factors that needs to be considered during generative AI deployment. (TechRepublic)
Many Amazon Q features within other Amazon services and products are available in preview today. For example, contact center administrators can access Amazon Q in Amazon Connect now.
The Amazon S3 Express One Zone, now in general availability, is a new S3 storage class purpose-built for high-performance and low-latency cloud object storage for frequently-accessed data, Selipsky said. It’s designed for workloads that require single-digit millisecond latency such as finance or machine learning. Today, customers move data from S3 to custom caching solutions; with the Amazon S3 Express One Zone, they can choose their own geographical availability zone and bring their frequently accessed data next to their high-performance computing. Selipsky said Amazon S3 Express One Zone can be run with 50% lower access costs than the standard Amazon S3.
On Nov. 27, AWS announced Salesforce’s partnership with Amazon will expand to certain Salesforce CRM products accessed on AWS Marketplace. Specifically, Salesforce’s Data Cloud, Service Cloud, Sales Cloud, Industry Clouds, Tableau, MuleSoft, Platform and Heroku will be available for joint customers of Salesforce and AWS in the U.S. More products are expected to be available, and the geographical availability is expected to be expanded next year.
New options include:
“Salesforce and AWS make it easy for developers to securely access and leverage data and generative AI technologies to drive rapid transformation for their organizations and industries,” Selipsky said in a press release.
Conversely, AWS will be using Salesforce products such as Salesforce Data Cloud more often internally.
ETL can be a cumbersome part of coding with transactional data. Last year, Amazon announced a zero-ETL integration between Amazon Aurora, MySQL and Amazon Redshift.
Today AWS introduced more zero-ETL integrations with Amazon Redshift:
All three are available globally in preview now.
The next thing Amazon wanted to do is make search in transactional data more smooth; many people use Amazon OpenSearch Service for this. In response, Amazon announced DynamoDB zero-ETL with OpenSearch Service is available today.
Plus, in an effort to make data more discoverable in Amazon DataZone, Amazon added a new capability to add business descriptions to data sets using generative AI.
Amazon One Enterprise enables security management for access to physical locations in industries such as hospitality, education or technologies. It’s a fully-managed online service paired with the AWS One palm scanner for biometric authentication administered through the AWS Management Console. Amazon One Enterprise is currently available in preview in the U.S.
NVIDIA announced a new set of GPUs available through AWS, the NVIDIA L4 GPUs, NVIDIA L40S GPUs and NVIDIA H200 GPUs. AWS will be the first cloud provider to bring the H200 chips with NV link to the cloud. Through this link, the GPU and CPU can share memory to speed up processing, NVIDIA CEO Jensen Huang explained during Selipsky’s keynote. Amazon EC2 G6e instances featuring NVIDIA L40S GPUs and Amazon G6 instances powered by L4 GPUs will start to roll out in 2024.
In addition, the NVIDIA DGX Cloud, NVIDIA’s AI building platform, is coming to AWS. An exact date for its availability hasn’t yet been announced.
NVIDIA brought on AWS as a primary partner in Project Ceiba, NVIDIA’s 65 exaflop supercomputer including 16,384 NVIDIA GH200 Superchips.
Another announcement made during re:Invent is the NVIDIA NeMo Retriever, which allows enterprise customers to provide more accurate responses from their multimodal generative AI applications using retrieval-augmented generation.
Specifically, NVIDIA NeMo Retriever is a semantic-retrieval microservice that connects custom LLMs to applications. NVIDIA NeMo Retriever’s embedding models determine the semantic relationships between words. Then, that data is fed into an LLM, which processes and analyzes the textual data. Business customers can connect that LLM to their own data sources and knowledge bases.
NVIDIA NeMo Retriever is available in early access now through the NVIDIA AI Enterprise Software platform wherever it can be accessed through the AWS Marketplace.
Early partners working with NVIDIA on retrieval-augmented generation services include Cadence, Dropbox, SAP and ServiceNow.
Note: TechRepublic is covering AWS re:Invent virtually.
Stay up to date on the latest in technology with Daily Tech Insider. We bring you news on industry-leading companies, products, and people, as well as highlighted articles, downloads, and top resources. You’ll receive primers on hot tech topics that will help you stay ahead of the game. Delivered Weekdays
Stay up to date on the latest in technology with Daily Tech Insider. We bring you news on industry-leading companies, products, and people, as well as highlighted articles, downloads, and top resources. You’ll receive primers on hot tech topics that will help you stay ahead of the game. Delivered Weekdays
AWS Launches New Chips for AI Training and Its Own AI Chatbot
Your email has been sent
TechRepublic Premium content helps you solve your toughest IT issues and jump-start your career or next project.
This is a comprehensive list of the best AI art generators. Explore the advanced technology that transforms imagination into stunning artworks.
Find the perfect payroll service for your business without breaking the bank. Discover the top cheap payroll services, features, pricing and pros and cons.
Is NordVPN worth it? How much does it cost and is it safe to use? Read our NordVPN review to learn about pricing, features, security, and more.
Free project management software provides flexibility for managing projects without paying a cent. Check out our list of the top free project management tools.
Australian and New Zealand enterprises in the public cloud are facing pressure to optimize cloud strategies due to a growth in usage and expected future demand, including for artificial intelligence use cases.
Food stands as a fundamental necessity for human survival. Our bodies rely on the nutrients found in food that enable us to carry out our day-to-day activities and responsibilities. However, in our modern, fast-paced world, the convenience of readily available food is often prioritized, notwithstanding the expense of food safety due to potential contaminants. This …
Recruiting a compliance analyst with the right combination of industry compliance expertise and collaboration skills will require a comprehensive screening process. This hiring kit from TechRepublic Premium provides an adjustable framework your business can use to find the right person for the job. From the hiring kit: DETERMINING FACTORS, DESIRABLE PERSONALITY TRAITS AND SKILLSETS Regulatory …
While modern video games rely on a full array of multimedia to keep players engrossed and engaged, there is little doubt that the visual medium is the driving force for most games. It is the visuals of art, effects, colors, etc., that keep players glued to their displays. This undeniable fact is why the video …
Choosing a vendor to provide cloud-based data warehouse services requires a certain level of due diligence on the part of the purchaser. This cloud data warehouse guide and checklist from TechRepublic Premium will help businesses choose the vendor that best fits its data storage needs based on offered features and key elements. From the guide: …
Get the web’s best business technology news, tutorials, reviews, trends, and analysis—in your inbox. Let’s start with the basics.
* – indicates required fields
Lost your password? Request a new password
Please enter your email adress. You will receive an email message with instructions on how to reset your password.
Check your email for a password reset link. If you didn’t receive an email don’t forgot to check your spam folder, otherwise contact support.
This will help us provide you with customized content.
Thanks for signing up! Keep an eye out for a confirmation email from our team. To ensure any newsletters you subscribed to hit your inbox, make sure to add newsletters@nl.technologyadvice.com to your contacts list.

source

GET THE LATEST UPDATES, OFFERS, INFORMATION & MORE