wordpress blog stats
Connect with us

Hi, what are you looking for?

Meta partners with Microsoft, unveils open-source AI model ‘Llama 2’ for research, commercial use

Meta announced on July 18 that the open-source large language model will be available for access in the Microsoft Azure AI catalog and Amazon Web Services, among other providers.

Expanding its efforts in the generative AI sector, Meta in partnership with Microsoft announced the availability of its open-source large language model Llama 2, which is essentially a family of language models that are used to power generative AI tools, on July 18. In a blogpost, Meta stated that Llama 2 will be available free of charge for research and commercial use.

Llama 2 can now be accessed in the Microsoft Azure AI catalog, through Amazon Web Services (AWS), and Hugging Face among other providers. Additionally, according to Meta, the language model is also “optimised to run locally on Windows”.

In a research paper titled ‘Llama 2: Open Foundation and Fine-Tuned Chat Models’, the team detailed the methods employed to train the language models used to build Llama 2, techniques for safety evaluation, and discussions around limitations as well as responsible use of such models.

How’s it different from Llama? In February this year, Meta had released LLaMA—Large Language Model Meta AI, under a non-commercial license focused on “research use cases”. While access to Llama was restricted and granted to academic researchers on a case-to-case basis, Llama 2 is an open-source resource for developers and businesses to build their own tools.


Article continues below ⬇, you might also want to read:


On capabilities of Llama 2: In its research paper, Meta has claimed “These models have demonstrated their competitiveness with existing open-source chat models, as well as competency that is equivalent to some proprietary models on evaluation sets we examined, although they still lag behind other models like GPT-4.”

As AI commentators seem to pitch Meta’s language models against Open AI’s GPT-4 which powers ChatGPT and Microsoft Bing, AI scientist at NVIDIA Jim Fan noted on Twitter, “Llama-2 is NOT yet at GPT-3.5 level, mainly because of its weak coding abilities.” Having said that, experts, including Fan, have lauded Meta’s efforts in providing significant amount of detail on its language models in the white paper.

Advertisement. Scroll to continue reading.

On training data: Non-transparency when it comes to data used for training large language models remains a point of contention here as well. Abeba Birhane, Senior Research Fellow at Mozilla Trustworthy AI, pointed out on Twitter that Meta’s research paper does not clearly define the sources that were used to train the models.

Advertisement. Scroll to continue reading.

STAY ON TOP OF TECH NEWS: Our daily newsletter with the top story of the day from MediaNama, delivered to your inbox before 9 AM. Click here to sign up today!


Also Read:

Written By

Curious about privacy, surveillance developments and the intersection of technology with education, caste and welfare rights.

Click to comment

You must be logged in to post a comment Login

Leave a Reply

MediaNama’s mission is to help build a digital ecosystem which is open, fair, global and competitive.

Views

News

The Central Board of Film Certification found power outside the Cinematograph Act and came to be known as the Censor Board. Are OTT self-regulating...

News

Jio is engaging in many of the above practices that CCI has forbidden Google from engaging in.

News

Is it safe to consider all "publicly available data" as public?

News

PhonePe launched an e-commerce buyer app for ONDC called Pincode. We, however, believe that it should also launch a seller app.

News

Amazon announced that it will integrate its logistics network and SmartCommerce services with the Open Network for Digital Commerce (ONDC).

You May Also Like

News

Google has released a Google Travel Trends Report which states that branded budget hotel search queries grew 179% year over year (YOY) in India, in...

Advert

135 job openings in over 60 companies are listed at our free Digital and Mobile Job Board: If you’re looking for a job, or...

News

By Aroon Deep and Aditya Chunduru You’re reading it here first: Twitter has complied with government requests to censor 52 tweets that mostly criticised...

News

Rajesh Kumar* doesn’t have many enemies in life. But, Uber, for which he drives a cab everyday, is starting to look like one, he...

MediaNama is the premier source of information and analysis on Technology Policy in India. More about MediaNama, and contact information, here.

© 2008-2021 Mixed Bag Media Pvt. Ltd. Developed By PixelVJ

Subscribe to our daily newsletter
Name:*
Your email address:*
*
Please enter all required fields Click to hide
Correct invalid entries Click to hide

© 2008-2021 Mixed Bag Media Pvt. Ltd. Developed By PixelVJ

MediaNama