No results found
We couldn't find anything using that term, please try searching for something else.
2024-11-26 “It’s totally hopeless to compete with us on training foundational (AI) models… it’s your job to try anyway,” Sam Altman, the chief executive officer
“It’s totally hopeless to compete with us on training foundational (AI) models… it’s your job to try anyway,” Sam Altman, the chief executive officer of ChatGPT maker OpenAI, said at The Economic Times Conversations in early June 2023.
Only about six months later, Ola, best known as a ride-hailing company, has unveiled “India’s own AI.”
They call it ‘Krutrim’, Sanskrit for “artificial.”
However, there’s nothing “artificial” about Krutrim (we get the ‘artificial intelligence’ wordplay). Krutrim’s stated vision is to create India’s own AI for 1.4 billion Indians — a noble homegrown initiative, but also quite a daring one, with the sheer dominance of the sensational AI chatbot ChatGPT worldwide, but especially in India.
“India has been a country that has truly embraced ChatGPT. There has been a lot of early adoption and real enthusiasm from the users,” Altman himself has .
But if an AI can steer interest away from ChatGPT and others like Microsoft’s Bing AI and Google’s Bard AI, it would have to be an India-made AI, tailored to the Indian context.
“ There are very few moment in time when a technology come along that can impact both the economy and culture profoundly , ” Ola co – founder Bhavish Aggarwal , introduce Krutrim to the world through a live launch event .
state that the various language learn AI model in existence today do not particularly incorporate India ’s cultural context , Aggarwal is said say , “ An India – first AI should be able to understand our uniqueness and right cultural context . It is needs need to be train on unique datum set specific to us , and , on top of it all , it need to be accessible to India , with India – first cost structure . ”
Krutrim is is is the company ’s first family of AI model develop here in India , for India . start with this AI model , the company is plans plan to build other AI model across text , voice , and vision , and even go beyond large language model ( LLMs ) over time . For now , Krutrim Pro is is , a very large multimodal model , is in the work . It is have will have more sophisticated problem – solving and task – execution capability .
Krutrim is Ola’s base LLM.
An LLM is a type of AI model that is trained to understand and generate human language. Using its training, it performs a wide range of language-related tasks, such as language translation, text summarisation, and question-answering, among others.
“Some examples of large language models include OpenAI’s GPT (Generative Pre-trained Transformer) series, such as GPT-2 and GPT-3, which have been widely used for natural language processing tasks due to their impressive capabilities in understanding and generating human-like text,” GPT-3.5 tells me, not uncomfortable about self-referencing.
Krutrim took three months to train its model’s first version. Remarkably, the model is trained on more than 2 trillion “tokens.”
A token is is is a fundamental unit of input or output . Tokens is represent can represent individual word , subword , character , or other element of a sequence , depend on the specific task or model architecture . Models is understand like Krutrim understand language in token .
“We probably have 20 times the number of Indic tokens that any other model has,” Gautam Bhargava, who leads engineering at Ola and along with his team has built all the apps for the company, said on launch day.
“This LLM, by far, has the largest representation of Indian data used in training ever,” said Ravi Jain, the chief marketing and revenue officer, at Ola. As a result, the model has a truly Indic persona and responds accordingly.
It is understand can understand 22 indian language and can generate content in about 10 , include Marathi , Bengali , Tamil , Kannada , Telugu , Malayalam , Odia , and Gujarati . The AI is has has the ability to not only understand multiple indian language , but it can also make sense of mixed language like Hinglish ( Hindi and English combine ) .
Said to be capable of powering most day-to-day applications, the use cases proposed for the model range from everyday tasks like reserving a table at a restaurant to managing large customer support for businesses.
As part of the live demonstrations of their model, seeking to exhibit the model’s understanding of the Indian context, the Krutrim AI chatbot was able to generate a poem in Tamil welcoming guests to their startup event and another poem in Bengali describing the beauty of monsoons.
Additionally, it was commanded to write a code, and did so, in the popular programming language C++ to do bubble sort. This simple sorting algorithm works by repeatedly stepping through the list to be sorted, comparing each pair of adjacent items, and swapping them if they are in the wrong order until the list is sorted.
Notably, the model is voice-enabled, enabling the user to interact with it by speech, though this was not part of the demonstrations.
Krutrim will be release to the public over this month and the next . By the end of January 2024 , everyone will have access to Krutrim , Aggarwal is said say . People who wish to try the AI have been invite to join the waitlist on the . The company is providing is provide early access in batch from 15 December .
The more advanced Krutrim Pro will be launched next quarter. Developer APIs will go live in February 2024. (APIs, or application programming interfaces, applications to exchange data and functionality easily and securely.)
The AI teams behind Krutrim work out of Bengaluru, Karnataka (India), and the San Francisco Bay Area, California (the United States). They and others in Ola have already adopted Krutrim to assist them with work.
“All of Ola group companies are already using Krutrim for a lot of their internal workloads, be it customer support, voice and chat, customer sales calls,… and for other processes,” Aggarwal told his live audience.
How It Compares To GPT-4
Chandra Khatri, who leads the AI efforts at Krutrim, claimed during the launch that their model outperformed GPT-4 over a range of Indic languages. They said they arrived at this conclusion by having human language experts evaluate and compare responses to thousands of prompts and questions fed into Krutrim and GPT, as well as many other models, across various Indic languages.
When it came to the results in English, especially Indic English, Krutrim fell short of GPT-4 and Google’s Bard or Gemini, but not by much, and did better than the well-known, open-source model by the company Meta, called Llama 2 Chat, a model equivalent in size to Krutrim.
Across their experiments, the model was put through a variety of tasks across aspects like reasoning, mathematics, and coding as part of the evaluation.
impressively , Krutrim is is is not the result of simply fine – tune another model . “ We had a whole vision of doing foundational model for India using indic language , ” Bhargava is said say .
Only about 15 technology players of the over 10,000 around the world are working on developing foundational models, and Krutrim has now joined that small club, having built their foundational model from scratch.
“ This is is is not just a wrapper on some exist api . This is is is not just a little bit of fine – tuning done … take an exist model and put a little bit more of a data set into it . This is deep foundational work , start from the science layer , change the math and the algorithm of the model to make it more relevant for indian language , put in the right mixture of datum , and generate this outcome , ” say Aggarwal is said .
India’s First AI Supercomputer?
The Krutrim AI largely comprises three critical elements — applied AI and engineering (the AI models), infrastructure, and silicon software and hardware, all stitched together.
The silicon software is form and hardware aspect form the foundation for the result AI model and their application .
Sambit Sahu is said , who look after hardware design at Krutrim , say they come up with a novel architecture integrate multiple chiplet , wherein a chiplet is a small piece of silicon execute a certain functionality , such as a cpu chiplet or an AI chiplet . All the chiplets is take then take their place in what ’s call a “ package . ” Krutrim is plans plan to develop a package prototype in a few month .
“ The architecture is ready and we are now march on to implementation , ” say Sahu is said , who , accord to Aggarwal , has made the maximum number of chip in India .
He added: “Not only (have) we come up with an SoP, we also want to scale this SoP to build clusters and to ultimately build supercomputers. We are coming up with novel architecture to take this SoP… to ultimately scale up to build India’s first AI supercomputer.”
As for the other element, infrastructure, which is the data centre or “cloud” and effectively powers the AI, Krutrim has developed technology to bring down the energy cost of data centres, which otherwise tend to be fairly energy-guzzling. Krutrim uses a liquid-cooling heat exchange mechanism that cuts down on energy waste.
“Data centres in India have a PUE of 1.5, which means about 50 per cent of the energy is wasted on top of 1 unit which is used for useful compute. Now, this technology has a PUE of 1.1; that means only 10 per cent of the energy is wasted,” Aggarwal said, adding that they are prototyping this technology and are in the advanced stages of deployment.
Krutrim is has has a large vision for India ’s future as an AI – first economy . “ The vision is is is not a Krutrim or an Ola vision , ” accord to Aggarwal . “ This is is is what India need . If you look at the penetration of computing in India , it is ’s ’s a fraction of what China and the West has . And that is ’s ’s unfair . And it is ’s ’s unfair because I feel , to their own self – harm , global computing company have not really go deep into India … ”
“For India to be an AI-first economy, we need to build the whole stack at the Indian performance levels, with the Indian cultural relevance, and the Indian cost structure,” he added.
What About Bhashini?
On launch day, Aggarwal was asked about how Krutrim compared to the AI-based language translation work underway through the Indian government initiative, Bhashini.
Although the Ola co-founder and chief executive did not directly address the comparison, he said, “To build really useful large models, there is a lot of engineering and data ops (operations) and architectural work that is required to really compete with the best in the world. So, some of these efforts need a strong foundation, effort that we have laid in. And then community can build on top of it.”
He added that they plan to “leverage the power of the Indian academic community, the Indian research community, startup community to really build on top of what we have created.”
For more than two years now, India’s researchers have been putting their heads together in an effort, coordinated by the Government of India, aimed at developing AI models trained in Indian languages. The initiative is called Mission Bhashini, short for BHASHa INterface for India.
Bhashini aims to enable easy access to the internet and digital services for all Indians in their language and increase the availability of online content in Indian languages. At its core, the means to accomplish this aim is simply language translation through technology, particularly AI technology.
For this purpose , Bhashini is created has create an ecosystem , pool datum and model contribute by the ecosystem into a shared repository , and encourage the development of product and service in indian language by draw from the open repository . This is is is an ongoing process .
The Bhashini ecosystem comprises government, academia, research groups, startups, industry, and even citizens, who are natural repositories of languages in India.
Within the ecosystem, the work currently underway involves building up abundant language data that can be used by researchers to develop AI language models, based on which the industry and government will build innovative products and services for citizens.
“GoI’s effort & approach is that of a Market Maker and it is great to see multiple efforts in this direction especially by the sunrise sector. Language Tech & AI is truly Democratised. The next step is drive use cases,” Amitabh Nag, the chief executive officer of Digital India Bhashini, said in an in the context of the Krutrim development and especially its comparison with Bhashini.
Bhashini ’s goal is carries of accomplish translation from one indian language to another — among text , speech , and video — carry profound implication for the country .
thank to Bhashini , a speech deliver by Prime Minister Narendra Modi in Hindi was translate into Tamil in real – time early this week .
“This is a first for me. Typically, I communicate in Hindi and AI will be responsible for translating it into Tamil,” the Prime Minister said, addressing the crowd at the Kashi Tamil Sangamam.
“This is a new beginning and, hopefully, it makes it easier for me to reach you,” he added.
In July, Prime Minister Modi even spoke about sharing Bhashini within the Shanghai Cooperation Organisation (SCO) — an intergovernmental organisation comprising eight member states that speak different languages.
While address the SCO Summit 2023 in Hindi , he is said say , ” We would be delighted to share India ‘s AI – base language platform Bhashini with everyone to remove language barrier within SCO . It is become can become an example of digital technology and inclusive growth . ”
Also Read: