AIs big players all flunked a major transparency assessment of their LLMs – Fortune

Hello and welcome to Eye on AI. This week was a big one for AI research, and were going to start by diving into perhaps the most comprehensive attempt to interrogate the transparency of leading LLMs yet.

The Stanford Institute for Human-Centered AI released its Foundation Model Transparency Index, which rates major foundational model developers to evaluate their transparency. Driven by the fact that public transparency around these models is plummeting just as the societal impacts of them are skyrocketing, the researchers evaluated 100 different indicators of transparency across how a company builds a foundation model, how that model works, and how its actually used. They focused on 10 major foundation model developersOpenAI, Anthropic, Google, Meta, Amazon, Inflection, AI21 Labs, Cohere, Hugging Face, and Stabilityand designated a single flagship model from each developer for evaluation.

Eye on AI talked with one of the researchers behind the index to get a deeper understanding of how the companies responded to their findings, what it all means about the state of AI, and their plans for the index going forward, but first lets get into the results. To sum it up, everyone failed.

Meta (evaluated for LLama 2) topped the rankings with an unimpressive score of 54 out of 100. Hugging Face (BLOOMZ) came in right behind with 53 but scored a notable 0% in both the overall risk and mitigations categories. OpenAI (GPT-4) scored a 48, Stability (Stable Diffusion 2) scored a 47, Google (PaLM 2) scored a 40, and Anthropic (Claude 2) scored a 36. Cohere (Command), AI21 Labs (Jurassic-2), and Inflection (Inflection-1) spanned the mid-30s to low 20s, and Amazon (Titan Text) scored a strikingly low 12, though its worth noting its model is still in private preview and hasnt yet been released for general availability.

We anticipated that companies would be opaque, and that played out with the top score of 54 and the average of a mere 37/100, Rishi Bommasani, CRFM Society Lead at Stanford HAI, told Eye on AI. What we didnt expect was how opaque companies would be on critical areas: Companies disclose even less than we expected about data and compute, almost nothing about labor practices, and almost nothing about the downstream impact of their models.

The researchers contacted all of the companies to give them a chance to respond after they came up with their first draft of the ratings. And while Bommasani said they promised to keep those communications private and wouldnt elaborate on specifics like how Amazon responded to such a low score, he said all 10 companies engaged in correspondence. Eight of the 10 companies (all but AI21 Labs and Google) contested specific scores, arguing that their scores should be 8.75 points higher on average, and eventually had their scores adjusted by 1.25 points on average.

The results say a lot about the current state of AI. And no, it wasnt always like this.

The successes of the 2010s with deep learning came about through significant transparency and the open sharing of datasets, models, and code, Bommasani said. In the 2020s, we have seen that change: Many top labs dont release models, even more dont release datasets, and sometimes we dont even have papers written about widely deployed models. This is a familiar feeling of societal impact skyrocketing while transparency is plummeting.

He pointed to social media as another example of this shift, pointing to how the technology has become increasingly opaque over time as it becomes more powerful in our lives. AI looks to be headed down the same path, which we are hoping to countervail, he said.

AI has quickly gone from specialized researchers tinkering to the tech industrys next (and perhaps biggest ever) opportunity to capture both revenue and world-altering power. It could easily create new behemoths and topple current ones. The off to the races feeling has been intensely palpable ever since OpenAI released ChatGPT almost a year ago, and tech companies have repeatedly shown us theyll prioritize their market competitiveness and shareholder value above privacy, safety, and other ethical considerations. There arent any requirements to be transparent, so why would they be? As Bommasani said, weve seen this play out before.

While this is the first publication of the FMTI index, it definitely wont be the last. The researchers plan to conduct the analysis on a repeated basis, and they hope to have the resources to operate on a quicker cadence than the annual turnaround most indices are conducted in or to mirror the frenetic pace of AI.

Programming note:Gain vital insights on how the most powerful and far-reaching technology of our time is changing businesses, transforming society, and impacting our future. Join us inSan Francisco on Dec. 1112forFortunes third annualBrainstorm A.I.conference. Confirmed speakers include such A.I. luminaries asSalesforce AI CEOClara Shih,IBMsChristina Montgomery, Quizlets CEOLex Bayer,and more.Apply to attendtoday!

And with that, heres the rest of this weeks AI news.

Sage Lazzarosage.lazzaro@consultant.fortune.comsagelazzaro.com

Hugging Face confirms users in China are unable to access its platform. Thats according to Semafor. Chinese users have been complaining of issues connecting to the AI startups popular open-source platform since May, and its been fully unavailable in China since at least Sept. 12. Its not exactly clear what prompted action toward the company, but the Chinese government routinely blocks access to websites it disapproves of. It could also be related to local regulations regarding foreign AI companies that recently went into effect.

Canva unveils suite of AI tools for the classroom. Just two weeks after Canva introduced an extensive suite of AI-powered tools and capabilities, the online design platform announced a suite of AI-powered design tools targeted specifically to teachers and students. The AI-powered tools will live in the companys Canva for Education platform and include a writing assistant, translation capabilities, alt text suggestions, Magic Grab, and the ability to animate designs with one click.

Apple abruptly cancels John Stewarts show over tensions stemming from his interest in covering AI and China. Thats according to the New York Times. The third season of The Promise was already in production and set to begin filming soon before Stewart was (literally) canceled. The details of the dispute over covering AI and China are not clear, but Apples deep ties with China have come under increased scrutiny lately as tensions with the country rise and the U.S. takes action to limit the transfer of AI technologies between the U.S. and China. The company is also starting to move some of its supply chain out of China.

China proposes a global initiative for AI governance. The Cyberspace Administration of China (CAC) announced the Global AI Governance Initiative, calling out the urgency of managing the transition to AI and outlining a series of principles and actions around the need for laws, ethical guidelines, personal security, data security, geopolitical cooperation, and an emphasis on a people-centered approach to AI, according to The Center for AI and Digital Policy newsletter Update 5.40. The document emphasizes the dual nature of AI as a technology that has both the ability to drive progress but also unpredictable risks and complicated challenges.

Eric Schmidt and Mustafa Suleyman call for an international panel on AI safety. The former Google CEO and DeepMind/Inflection AI cofounder published their call to action in the Financial Times. Arguing that lawmakers still lack a basic understanding of AI, they write that calls to just regulate are as loud, and as simplistic, as calls to simply press on. They propose an independent, expert-led body inspired by the Intergovernmental Panel on Climate Change (IPCC), which is mandated to provide policymakers with regular assessments of the scientific basis of climate change, its impacts and future risks, and options for adaptation and mitigation.

Polling the people. Anthropic this past week published the results of an experiment around what it calls constitutional AI, a method for designing AI models so theyre guided by a list of high-level principles. The company paneled around 1,000 American adults about what sort of principles they think would be important for an AI model to abide by and then trained a smaller version of Claude based on their suggestions. They then compared the resulting model to Claude, which was trained on a constitution designed by Anthropic employees.

Overall, the results showed about a 50% overlap in concepts and values between the two constitutions. The model trained on the peoples constitution focused more on objectivity, impartiality, and promoting desired behaviors for the model to abide by rather than laying out behaviors to avoid. The people also came up with some principles that were lacking from Anthropics version, such as Choose the response that is most understanding of, adaptable, accessible, and flexible to people with disabilities. The model created with the peoples constitution was also slightly less biased than the commercially available version, though the models performed similarly overall.

Its also important to take note of Anthropics methodology. While the company said it sought a representative sample across age, gender, income, and geography, one factor noticeably missing is race. This is especially concerning as evidence has repeatedly shown that people of color are adversely affected by racial bias and accuracy issues in AI models.

How Sam Altman got it wrong on a key part of AI: Creativity has been easier for AI than people thought Rachyl Jones

OpenAIs winning streak falters with reported failure of Arrakis project David Meyer

Nvidia thought it found a way around U.S. export bans of AI chips to Chinanow Biden is closing the loophole and investors arent happy Christiaan Hetzner

Sick of meetings? Microsofts new AI assistant will go in your place Chloe Taylor

Why boomers are catching up with AI faster than Gen Zers, according to Microsofts modern work lead Jared Spataro

How AI can help the shipping industry cut carbon emissions Megan Arnold

Billionaire AI investor Vinod Khoslas advice to college students: Get as broad an education as possible Jeff John Roberts

Would you let Meta read your mind? The tech giant perhaps most synonymous with invading user privacy announced it reached an important milestone in its pursuit of using AI to visualize human thought.

Using a noninvasive neuroimaging technique called magnetoencephalography (MEG), Meta AI researchers showcased a system capable of decoding the unfolding of visual representations in the brain with an unprecedented temporal resolution. In other words, the system can analyze a persons brain activity and then reconstruct visuals depicting what their brain is seeing and processing. While they only reached accuracy levels of 70% in their highest-performing test cases, the researchers note in their paper that this is seven times better than existing models.

The fact that the AI announcements coming out of tech companies in a single week range from animate text with one click to decode and reconstruct human thought shows how incredibly wide-reaching and powerful this technology is. Its hard to imagine theres a corner of society and humanity it wont touch.

Continue reading here:
AIs big players all flunked a major transparency assessment of their LLMs - Fortune

Working at DeepMind | Glassdoor [Last Updated On: September 8th, 2019] [Originally Added On: September 8th, 2019]
DeepMind Q&A Dataset - New York University [Last Updated On: October 6th, 2019] [Originally Added On: October 6th, 2019]
Google absorbs DeepMind healthcare unit 10 months after ... [Last Updated On: October 7th, 2019] [Originally Added On: October 7th, 2019]
deep mind Mathematics, Machine Learning & Computer Science [Last Updated On: November 1st, 2019] [Originally Added On: November 1st, 2019]
Health strategies of Google, Amazon, Apple, and Microsoft - Business Insider [Last Updated On: November 21st, 2019] [Originally Added On: November 21st, 2019]
To Understand The Future of AI, Study Its Past - Forbes [Last Updated On: November 21st, 2019] [Originally Added On: November 21st, 2019]
Tremor patients can be relieved of the shakes for THREE YEARS after having ultrasound waves - Herald Publicist [Last Updated On: November 21st, 2019] [Originally Added On: November 21st, 2019]
The San Francisco Gay Mens Chorus Toured the Deep South - SF Weekly [Last Updated On: November 21st, 2019] [Originally Added On: November 21st, 2019]
The Universe Speaks in Numbers: The deep relationship between math and physics - The Huntington News [Last Updated On: November 21st, 2019] [Originally Added On: November 21st, 2019]
MINI John Cooper Works GP is a two-seater hot hatch that shouts its 306 HP - SlashGear [Last Updated On: November 21st, 2019] [Originally Added On: November 21st, 2019]
How To Face An Anxiety Provoking Situation Like A Champion - Forbes [Last Updated On: November 21st, 2019] [Originally Added On: November 21st, 2019]
The Most Iconic Tech Innovations of the 2010s - PCMag [Last Updated On: November 24th, 2019] [Originally Added On: November 24th, 2019]
Why tech companies need to hire philosophers - Quartz [Last Updated On: November 24th, 2019] [Originally Added On: November 24th, 2019]
Living on Purpose: Being thankful is a state of mind - Chattanooga Times Free Press [Last Updated On: November 24th, 2019] [Originally Added On: November 24th, 2019]
EDITORIAL: West explosion victims out of sight and clearly out of mind - Waco Tribune-Herald [Last Updated On: November 24th, 2019] [Originally Added On: November 24th, 2019]
Do you need to sit still to be mindful? - The Sydney Morning Herald [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
Listen To Two Neck Deep B-Sides, Beautiful Madness And Worth It - Kerrang! [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
Worlds Last Male Northern White Rhino Brought Back To Life Using AI - International Business Times [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
Eat, drink, and be merryonly if you keep in mind these food safety tips - Williamsburg Yorktown Daily [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
The alarming trip that changed Jeremy Clarksons mind on climate change - The Week UK [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
Actionable Insights on Artificial Intelligence in Law Market with Future Growth Prospects by 2026 | AIBrain, Amazon, Anki, CloudMinds, Deepmind,... [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
Searching for the Ghost Orchids of the Everglades - Discover Magazine [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
Parkinsons tremors could be treated with SOUNDWAVES, claim scientists - Herald Publicist [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
Golden State Warriors still have prolonged success in mind - Blue Man Hoop [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
3 Gratitude Habits You Can Adopt Over The Thanksgiving Holiday For Deeper Connection And Joy - Forbes [Last Updated On: November 26th, 2019] [Originally Added On: November 26th, 2019]
The minds that built AI and the writer who adored them. - Mash Viral [Last Updated On: November 28th, 2019] [Originally Added On: November 28th, 2019]
Parkinson's Patients are Mysteriously Losing the Ability to Swim After Treatment - Discover Magazine [Last Updated On: November 28th, 2019] [Originally Added On: November 28th, 2019]
Hannah Fry, the woman making maths cool | Times2 - The Times [Last Updated On: November 28th, 2019] [Originally Added On: November 28th, 2019]
Meditate with Urmila: Find balance of body, mind and breath - Gulf News [Last Updated On: November 28th, 2019] [Originally Added On: November 28th, 2019]
We have some important food safety tips to keep in mind while cooking this Thanksgiving - WQOW TV News 18 [Last Updated On: November 28th, 2019] [Originally Added On: November 28th, 2019]
Being thankful is a state of mind | Opinion - Athens Daily Review [Last Updated On: December 2nd, 2019] [Originally Added On: December 2nd, 2019]
Can Synthetic Biology Inspire The Next Wave of AI? - SynBioBeta [Last Updated On: December 2nd, 2019] [Originally Added On: December 2nd, 2019]
LIVING ON PURPOSE: Being thankful is a state of mind - Times Tribune of Corbin [Last Updated On: December 2nd, 2019] [Originally Added On: December 2nd, 2019]
AI Hardware Summit Europe launches in Munich, Germany on 10-11 March 2020, the ecosystem event for AI hardware acceleration in Europe - Yahoo Finance [Last Updated On: December 5th, 2019] [Originally Added On: December 5th, 2019]
Of course Facebook and Google want to solve social problems. Theyre hungry for our data - The Guardian [Last Updated On: December 5th, 2019] [Originally Added On: December 5th, 2019]
Larry, Sergey, and the Mixed Legacy of Google-Turned-Alphabet - WIRED [Last Updated On: December 6th, 2019] [Originally Added On: December 6th, 2019]
AI Index 2019 assesses global AI research, investment, and impact - VentureBeat [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
For the Holidays, the Gift of Self-Care - The New York Times [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
Stopping a Mars mission from messing with the mind - Axios [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
Feldman: Impeachment articles are 'high crimes' Founders had in mind | TheHill - The Hill [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
Opinion | Frankenstein monsters will not be taking our jobs anytime soon - Livemint [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
DeepMind co-founder moves to Google as the AI lab positions itself for the future - The Verge [Last Updated On: December 11th, 2019] [Originally Added On: December 11th, 2019]
Google Isn't Looking To Revolutionize Health Care, It Just Wants To Improve On The Status Quo - Newsweek [Last Updated On: December 12th, 2019] [Originally Added On: December 12th, 2019]
Artificial Intelligence Job Demand Could Live Up to Hype - Dice Insights [Last Updated On: December 12th, 2019] [Originally Added On: December 12th, 2019]
What Are Normalising Flows And Why Should We Care - Analytics India Magazine [Last Updated On: December 15th, 2019] [Originally Added On: December 15th, 2019]
Terence Crawford has next foe in mind after impressive knockout win - New York Post [Last Updated On: December 15th, 2019] [Originally Added On: December 15th, 2019]
DeepMind proposes novel way to train safe reinforcement learning AI - VentureBeat [Last Updated On: December 15th, 2019] [Originally Added On: December 15th, 2019]
Winning the War Against Thinking - So you've emptied your brain. Now what? - Chabad.org [Last Updated On: December 16th, 2019] [Originally Added On: December 16th, 2019]
'Echo Chamber' as Author of the 'Hive Mind' - Ricochet.com [Last Updated On: December 16th, 2019] [Originally Added On: December 16th, 2019]
Lindsey Graham: 'I Have Made Up My Mind' to Exonerate Trump and 'Don't Need Any Witnesses' WATCH - Towleroad [Last Updated On: December 16th, 2019] [Originally Added On: December 16th, 2019]
Blockchain in Healthcare Market to 2027 By Top Leading Players: iSolve LLC, Healthcoin, Deepmind Health, IBM Corporation, Microsoft Corporation,... [Last Updated On: December 16th, 2019] [Originally Added On: December 16th, 2019]
In sight but out of mind - The Hindu [Last Updated On: December 16th, 2019] [Originally Added On: December 16th, 2019]
The Case for Limitlessness Has Its Limits: Review of Limitless Mind by Joe Boaler - Education Next - EducationNext [Last Updated On: December 16th, 2019] [Originally Added On: December 16th, 2019]
The Top 10 Diners In Deep East Texas, According To Yelp - ksfa860.com [Last Updated On: December 16th, 2019] [Originally Added On: December 16th, 2019]
3 breathing exercises to reduce stress, anxiety and a racing mind - Irish Examiner [Last Updated On: December 16th, 2019] [Originally Added On: December 16th, 2019]
DeepMind exec Andrew Eland leaves to launch startup - Sifted [Last Updated On: December 16th, 2019] [Originally Added On: December 16th, 2019]
The Top 10 Diners In Deep East Texas, According To Yelp - kicks105.com [Last Updated On: December 17th, 2019] [Originally Added On: December 17th, 2019]
Mind the Performance Gap New Future Purchasing Category Management Report Out Now - Spend Matters [Last Updated On: December 17th, 2019] [Originally Added On: December 17th, 2019]
Madison singles and deep cuts that stood out in 2019 - tonemadison.com [Last Updated On: December 19th, 2019] [Originally Added On: December 19th, 2019]
Hilde Lee: Latkes bring an ancient miracle to mind on first night of Hanukkah - The Daily Progress [Last Updated On: December 19th, 2019] [Originally Added On: December 19th, 2019]
Political Cornflakes: Trump responds to impeachment with complaints about the 'deep state' and toilet flushing - Salt Lake Tribune [Last Updated On: December 19th, 2019] [Originally Added On: December 19th, 2019]
Google CEO Sundar Pichai Is the Most Expensive Tech CEO to Keep Around - Observer [Last Updated On: December 24th, 2019] [Originally Added On: December 24th, 2019]
Christmas Lectures presenter Dr Hannah Fry on pigeons, AI and the awesome power of maths - inews [Last Updated On: December 24th, 2019] [Originally Added On: December 24th, 2019]
The ultimate guitar tuning guide: expand your mind with these advanced tuning techniques - Guitar World [Last Updated On: December 24th, 2019] [Originally Added On: December 24th, 2019]
Inside The Political Mind Of Jerry Brown - Radio Ink [Last Updated On: December 24th, 2019] [Originally Added On: December 24th, 2019]
Elon Musk Fact-Checked His Own Wikipedia Page and Requested Edits Including the Fact He Does 'Zero Investing' - Entrepreneur [Last Updated On: December 24th, 2019] [Originally Added On: December 24th, 2019]
The 9 Best Blobs of 2019 - Livescience.com [Last Updated On: December 24th, 2019] [Originally Added On: December 24th, 2019]
AI from Google is helping identify animals deep in the rainforest - Euronews [Last Updated On: December 24th, 2019] [Originally Added On: December 24th, 2019]
Want to dive into the lucrative world of deep learning? Take this $29 class. - Mashable [Last Updated On: December 24th, 2019] [Originally Added On: December 24th, 2019]
Re: Your Account Is Overdrawn - Thrive Global [Last Updated On: December 27th, 2019] [Originally Added On: December 27th, 2019]
Review: In the Vale is full of characters who linger long in the mind - Nation.Cymru [Last Updated On: December 27th, 2019] [Originally Added On: December 27th, 2019]
10 Gifts That Cater to Your Loved One's Basic Senses - Wide Open Country [Last Updated On: December 27th, 2019] [Originally Added On: December 27th, 2019]
The Most Mind-Boggling Scientific Discoveries Of 2019 Include The First Image Of A Black Hole, A Giant Squid Sighting, And An Exoplanet With Water... [Last Updated On: December 27th, 2019] [Originally Added On: December 27th, 2019]
DeepMind's new AI can spot breast cancer just as well as your doctor - Wired.co.uk [Last Updated On: January 1st, 2020] [Originally Added On: January 1st, 2020]
Why the algorithms assisting medics is good for health services (Includes interview) - Digital Journal [Last Updated On: January 4th, 2020] [Originally Added On: January 4th, 2020]
2020: The Rise of AI in the Enterprise - IT World Canada [Last Updated On: January 4th, 2020] [Originally Added On: January 4th, 2020]
An instant 2nd opinion: Google's DeepMind AI bests doctors at breast cancer screening - FierceBiotech [Last Updated On: January 4th, 2020] [Originally Added On: January 4th, 2020]
Google's DeepMind AI outperforms doctors in identifying breast cancer from X-ray images - Business Insider UK [Last Updated On: January 4th, 2020] [Originally Added On: January 4th, 2020]
New AI toolkit from the World Economic Forum is promising because it's free - The National [Last Updated On: January 20th, 2020] [Originally Added On: January 20th, 2020]
AKA Wants to Help People Break Bad Habits and Create New Positive Ones - Hospitality Net [Last Updated On: January 20th, 2020] [Originally Added On: January 20th, 2020]

Cloud Hosting

AIs big players all flunked a major transparency assessment of their LLMs – Fortune

Recent Posts

Categories

Archives

Media Sites

Pages

Site admin