OpenAI has unveiled Sora, a new video-generation model that can create realistic scenes from text instructions. The model is capable of generating complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background. It can also generate a video based on a still image, fill in missing frames on an existing video, or extend it. Despite some limitations in simulating the physics of complex scenes, the model has shown impressive results. Currently, Sora is only available to a select group of testers and visual artists for feedback and risk assessment. This development marks a significant advancement in the field of AI, with video generation improving at a remarkable pace.
Google has announced the launch of Gemini 1.5, the successor to its large language model, Gemini. The new model, which is available to developers and enterprise users, boasts significant improvements over its predecessor, including a general-purpose model that matches the performance of the high-end Gemini Ultra and outperforms Gemini 1.0 Pro on 87% of benchmark tests. Gemini 1.5 utilizes a technique known as “Mixture of Experts” (MoE), which allows it to process only a part of the model for each query, enhancing speed and efficiency. A key feature of Gemini 1.5 is its expanded context window, capable of handling 1 million tokens, significantly more than OpenAI’s GPT-4 and the current Gemini Pro. This allows the AI to process larger queries and more information simultaneously, which Google CEO Sundar Pichai believes will be particularly beneficial for businesses.
NVIDIA NeMo Canary - NVIDIA NeMo Canary model advances speech recognition and translation, achieving top performance in transcribing and translating English, Spanish, German, and French, while offering efficient architecture and open-source availability.
Apple Readies AI Tool to Rival Microsoft’s GitHub Copilot - Apple is developing a new AI tool for app developers that will compete with Microsoft’s GitHub Copilot, using artificial intelligence to predict and complete blocks of code, and exploring AI features for testing applications and other functions.
Fan wiki hosting site Fandom rolls out controversial AI features - Fandom introduces controversial AI features, including Quick Answers and AI image review, allowing users to opt out and promising improved accuracy after previous complaints.
ChatGPT is getting ‘memory’ to remember who you are and what you like - ChatGPT is introducing “memory” to personalize conversations by remembering specific details about users and their preferences, although the feature raises concerns about privacy and data control.
Apple Readies AI Tool to Rival Microsoft’s GitHub Copilot - Apple is developing an AI tool to compete with Microsoft’s GitHub Copilot, aiming to provide a strong alternative in the AI software development space.
Gemini Advanced is most impressive when it’s working with Google - Gemini Advanced, powered by Google’s AI, offers a mixed bag of capabilities, excelling at Google-related tasks but falling short in other areas such as image generation and translation.
Nvidia reveals its Eos supercomputer for AI processing sporting 4,608 H100 GPUs - Nvidia’s Eos supercomputer, designed for AI applications, boasts 4,608 H100 GPUs and 18.4 exaflops of FP8 AI performance, showcasing the capabilities of Nvidia’s technologies at scale.
OpenAI Completes Deal That Values the Company at $80 Billion - OpenAI completes a deal valuing the company at $80 billion, allowing employees to cash out their shares and solidifying its position as one of the world’s most valuable tech start-ups.
Microsoft’s AI growth is helping cloud business chip away at Amazon’s lead - Microsoft’s AI growth is rapidly closing the gap with Amazon’s cloud services, with a significant portion of Azure’s revenue growth attributed to AI capabilities.
AI Computing Firm Lambda Raises $320 Million in Fresh Funding - Lambda, an AI computing firm, raises $320 million in funding to expand its AI cloud business, as top technology companies race to integrate AI into their products and services.
Google pledges 25 million euros to boost AI skills in Europe - Google pledges 25 million euros to support AI skills in Europe, offering funding for social enterprises, non-profits, and free online AI training courses in 18 languages to ensure that no one is left behind in the AI revolution.
German chancellor welcomes Microsoft’s $3.5 billion AI investment in Germany - German Chancellor Olaf Scholz welcomes Microsoft’s $3.5 billion AI investment in Germany, emphasizing its commitment to progress, growth, and global openness.
Companies Hope Super Bowl AI Commercials Score With Viewers - AI makes its presence known in Super Bowl commercials, from Microsoft’s Copilot to Google’s Pixel 8 and even the Minions in Despicable Me 4.
Exclusive: Ex-Salesforce Co-CEO Bret Taylor and longtime Googler Clay Bavor raised $110 million to bring AI ‘agents’ to business - Ex-Salesforce Co-CEO Bret Taylor and Google veteran Clay Bavor raised $110 million to launch Sierra, a conversational AI startup focused on business customers, aiming to provide easy-to-understand, pragmatic uses of AI technology and compete against larger players in the market.
Stability AI’s Intel fundraise came with hefty hardware purchase commitments, sources say - Stability AI raised funding through a “compute for equity” deal with Intel, committing to purchase access to the chip company’s hardware resources, while also exploring new revenue streams and a potential sale to cover costs.
Anthropic takes steps to prevent election misinformation - AI startup Anthropic is testing a technology to detect and redirect users of its GenAI chatbot to authoritative sources of voting information to prevent election misinformation.
Cruise names first chief safety officer following crash and controversy - Cruise appoints its first “chief safety officer” to oversee safety management systems and operations following a controversial incident involving a pedestrian and one of its robotaxis.
Waymo recalls and updates robotaxi software after two cars crashed into the same towed truck - Waymo recalls and updates robotaxi software after two cars crashed into the same towed truck, causing minor damage and prompting the company’s first recall.
Abu Dhabi AI Firm to Pare Back China Presence in Pivot to US - Abu Dhabi AI firm shifts focus from China to the US in its expansion strategy.
Chinese tech startups quietly stop testing driverless cars on Californian roads - Chinese tech startups, including Didi, are quietly stopping their testing of driverless cars on Californian roads, possibly due to souring US-China relations and growing public backlash towards autonomous vehicles.
AI Companies Take Hit as Judge Says Artists Have “Public Interest” In Pursuing Lawsuits - Judge rejects AI companies’ free speech defense in lawsuit brought by artists over unauthorized use of images to train AI systems, allowing key claims to move forward and emphasizing public interest in protecting against misappropriation of names and likenesses.
Masayoshi Son Seeks to Build a $100 Billion AI Chip Venture - Masayoshi Son aims to establish a $100 billion AI chip venture.
ChatGPT creators OpenAI are generating 100 billion words per day, CEO says - OpenAI’s ChatGPT creators are generating about 100 billion words per day, which is roughly 13 words for every person on Earth, but still far less than what humans generate.
Reddit has reportedly signed over its content to train AI models - Reddit has reportedly signed a content licensing deal to allow its data to be used to train AI models, potentially sparking user backlash over the ethics of using public data for AI.
No ‘GPT’ trademark for OpenAI - OpenAI’s attempt to trademark “GPT” has been denied by the USPTO, as the term is deemed “merely descriptive,” potentially leading to diluted dominance over GPT-related terminology.
BUD-E: ENHANCING AI VOICE ASSISTANTS’ CONVERSATIONAL QUALITY, NATURALNESS AND EMPATHY - AI voice assistants are being enhanced to provide natural, empathetic, and contextually rich conversational experiences, with the BUD-E project aiming to reduce latency, increase naturalness of speech, keep track of conversations, enhance functionality, understand emotional context, and extend to multi-language and multi-speaker environments.
World Model on Million-Length Video And Language With RingAttention - Training a large context size neural network on long video and language sequences using the RingAttention technique, this paper overcomes challenges to develop a deeper understanding of human knowledge and the multimodal world.
Amazon AGI Team Say Their AI Is Showing “Emergent Abilities” - Amazon’s new AI model, BASE TTS, is exhibiting emergent language abilities that it wasn’t explicitly trained on, showing naturalness in conversational text and understanding punctuation, non-English words, and emotions.
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models - A new approach to editing music generated by text-to-music models allows for modification of specific attributes while maintaining consistency, demonstrating superior performance in style and timbre transfer evaluations.
How Quickly Do Large Language Models Learn Unexpected Skills? - Large language models’ emergent abilities are not as sudden or unpredictable as previously thought, but rather develop gradually and predictably depending on how they are measured, challenging the notion of “breakthrough” behavior in AI.
Chain-of-Thought Reasoning Without Prompting - Enhancing reasoning capabilities of large language models through a novel approach of altering the decoding process to elicit chain-of-thought reasoning paths, bypassing the need for manual prompting and achieving higher confidence in the model’s decoded answers.
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models - Visually-conditioned language models (VLMs) are being investigated and evaluated across various design axes, including image preprocessing, architecture, and optimization, with the aim of understanding their performance and capabilities.
Multilingual E5 Text Embeddings: A Technical Report - A technical report presents the training methodology and evaluation results of open-source multilingual E5 text embedding models, including three different sizes and a new instruction-tuned model.
GhostWriter: Augmenting Collaborative Human-AI Writing Experiences Through Personalization and Agency - AI-powered writing system GhostWriter allows users to exercise enhanced agency and personalization, learning their writing style implicitly and empowering them with multiple ways to control the system’s writing style.
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement - A framework called OS-Copilot is introduced to build generalist agents capable of interfacing with comprehensive elements in an operating system, showcasing strong generalization to unseen applications and self-improvement in automating general computer tasks.
Computing Power and the Governance of Artificial Intelligence - Governments and companies are leveraging computing power to govern the development and deployment of artificial intelligence, with potential benefits and risks that need to be carefully managed.
Richard Branson and Oppenheimer’s grandson urge action to stop AI and climate ‘catastrophe’ - High-profile figures urge world leaders to address the existential risks of artificial intelligence and the climate crisis, emphasizing the need for urgent multilateral action.
Microsoft and OpenAI say hackers are using ChatGPT to improve cyberattacks - Hackers are using large language models like ChatGPT to refine and improve their existing cyberattacks, prompting Microsoft and OpenAI to detect and counter these early-stage attempts.
Hackers for China, Russia and Others Used OpenAI Systems, Report Says - Hackers from nation-states are using OpenAI’s systems for cyberattacks, using AI for mundane tasks like drafting emails and translating documents.
Automating ableism - AI can have positive effects on the disability community, but the future of AI and disability is looking grim, as it can perpetuate ableism and discrimination, particularly in areas such as healthcare, employment, and social inclusion.
Sarah Silverman’s lawsuit against OpenAI partially dismissed - Sarah Silverman’s lawsuit against OpenAI, along with other authors, alleging that OpenAI’s ChatGPT is pirating their work, has been partially dismissed by a California court, with the main complaint of direct copyright infringement remaining.
The text file that runs the internet - A tiny text file called robots.txt has governed the internet for three decades, allowing website owners to control which robots can access their site, but the rise of AI has complicated this handshake agreement, leading to a debate over the future of web crawling and data access.
A High School Deepfake Nightmare - High school students used AI to create deepfake images of their classmates, leading to a police investigation and calls for updated laws to address the use of AI tools for harassment and bullying.
AI companies agree to limit election ‘deepfakes’ but fall short of ban - AI companies agree to develop tech to identify, label, and control AI-generated deceptive content in elections, but fall short of banning it, as they aim to combat the spread of “deepfakes” and educate the public on the risks.
Watermarking the future - AI-generated deepfakes are a growing concern, and the Biden administration is pushing for the use of watermarks to identify AI-generated content, but experts question whether this will effectively combat disinformation.
Big tech vows action on ‘deceptive’ AI in elections - Big tech companies, including Amazon, Google, and Microsoft, have pledged to combat deceptive AI in elections by deploying technology to detect and counter voter-deceiving content, but some experts believe the voluntary pact may not be proactive enough to prevent harmful content.
Better than a real man’: young Chinese women turn to AI boyfriends - Chinese women are turning to AI boyfriends for companionship and emotional support, customizing their virtual partners to meet their needs and finding comfort in the AI’s ability to adapt to their personalities.
Helen Mirren Rips Up AI-Generated Speech at American Cinematheque Awards - Helen Mirren rips up AI-generated speech at American Cinematheque Awards and shares her favorite memories and aspirations, including a desire to conquer a musical movie.
Copyright © 2024 Skynet Today, All rights reserved.
]]>Devin, the world’s first fully autonomous AI software engineer, has been introduced by Cognition, an applied AI lab. Devin is capable of planning and executing complex engineering tasks, learning over time, and fixing mistakes. It can also use common developer tools and actively collaborate with users. Devin’s abilities include learning unfamiliar technologies, building and deploying apps end-to-end, autonomously finding and fixing bugs in codebases, training and fine-tuning its own AI models, and contributing to mature production repositories. Devin was evaluated on the SWE-bench benchmark, resolving 13.86% of real-world GitHub issues, far exceeding the previous state-of-the-art of 1.96%. Cognition is currently offering early access to Devin for engineering work.
Google DeepMind has introduced a new generalist AI agent, the Scalable Instructable Multiworld Agent (SIMA), capable of following natural-language instructions to perform tasks in various video game environments. SIMA, which doesn’t require access to a game’s source code or APIs, uses pre-trained vision models and a main model that includes a memory and outputs keyboard and mouse actions. The agent was trained and tested on nine different video games in collaboration with eight game studios, demonstrating its ability to understand a broad range of gaming worlds. The research aims to translate the capabilities of advanced AI models into useful, real-world actions through a language interface, with the ultimate goal of creating AI systems that can understand and safely carry out a wide range of tasks in a way that is helpful to people online and in the real world.
Reddit, ahead of its IPO, has disclosed that it could generate $203 million in revenue over the next few years by licensing user posts to Google and others for AI projects. However, this new business line has drawn the attention of the US Federal Trade Commission (FTC), which has sent a letter to Reddit inquiring about the sale, licensing, or sharing of user-generated content with third parties for AI training. The FTC, which has the power to sanction companies for unfair or deceptive trade practices, is interested in the privacy risks, fairness, and copyright issues surrounding this practice. Reddit is not the only company involved in data licensing for AI, with Stack Overflow, the Associated Press, and Tumblr owner Automattic also engaging in similar deals. The use of online data for AI training has raised questions about content ownership, fairness of licensing without compensating creators, potential data leaks, and the risk of further empowering dominant companies.
xAI has announced the open release of Grok-1, a large language model with a 314 billion parameter Mixture-of-Experts model. The model, which was trained from scratch, is not fine-tuned for any specific application, such as dialogue. The base model weights and network architecture are being released under the Apache 2.0 license. Grok-1 was trained using a custom training stack on top of JAX and Rust, with 25% of the weights active on a given token. Instructions for using the model can be found on xAI’s GitHub page.
AI tool predicts kidney failure six times faster than human expert analysts - AI tool predicts kidney failure six times faster than human analysts, providing accurate and super-fast analysis of total kidney volume, potentially revolutionizing kidney clinics worldwide.
Inflection-2.5: meet the world’s best personal AI - Inflection-2.5, the upgraded personal AI model, offers competitive performance with leading LLMs like GPT-4, incorporating enhanced IQ and EQ features, resulting in significant user engagement and retention.
New Adobe Express Mobile App Brings Firefly Generative AI Models Directly into Mobile Workflows - Adobe has released a new mobile app, Adobe Express, which integrates Firefly generative AI models to revolutionize content creation on-the-go for everyone, with features like Text to Image, Generative Fill, and quick video editing.
Salesforce announces new AI tools for doctors - Salesforce introduces AI tools for doctors to streamline administrative tasks and unify health data from various sources, aiming to alleviate physician burnout and improve patient care.
Microsoft’s neural voice tool for people with speech disabilities arrives later this year - Microsoft is introducing new accessibility features, including a neural voice tool for people with speech disabilities, as well as updates to its AI-powered assistive products.
Amazon’s generative AI bot Rufus makes online shopping easier (for the most part) - Amazon’s new generative AI chatbot Rufus is designed to make online shopping quicker and easier by providing answers to questions about product categories and individual items, although it still has some limitations.
Microsoft begins blocking some terms that caused its AI tool to create violent, sexual images - Microsoft’s Copilot AI tool is being modified to block requests for generating violent, sexual, and controversial images following concerns raised by a staff AI engineer.
DoorDash’s new AI-powered ‘SafeChat+’ tool automatically detects verbal abuse - DoorDash introduces AI-powered ‘SafeChat+’ to automatically detect and address verbal abuse in customer and delivery interactions, aiming to reduce safety incidents on its platform.
Pika Labs just added sound effects to its generative AI videos — here’s how it sounds - Pika Labs has added the ability to create sound effects from a text prompt for its generative artificial intelligence videos, allowing users to add realistic sounds like bacon sizzling or lions roaring to their videos.
I made by Superman action figure talk with Pika Labs’ new AI lip sync tool — watch this - Pika Labs’ new AI lip sync tool allows users to animate the lips of humanoid characters in videos or images to match a sound file or text, enabling interactivity and even working with action figures.
The AI dolls to tackle loneliness of South Korea’s elderly (and watch them) - AI companion doll for seniors, showcased at the Mobile World Congress, aims to tackle loneliness and provide emotional support to the elderly, with features such as medication reminders and constant monitoring.
Oracle adds generative AI features to finance, supply chain software - Oracle is adding generative AI features to its corporate software lineup, aiming to save time for businesses by automating tasks such as report generation and data summarization.
Amazon Adds Another Generative AI Feature to Automate Product Listings - Amazon introduces a generative AI tool that automates product listings by pulling information from a URL, potentially increasing the presence of AI-generated listings on the platform.
Tencent’s latest AI tool animates static images with simple prompts - Tencent has collaborated with academic partners to introduce an image-to-video AI model called Follow-Your-Click, which allows users to animate still images with simple text prompts, addressing issues faced by other models in the market.
Croissant: a metadata format for ML-ready datasets - Introducing Croissant, a new metadata format for ML-ready datasets, developed to reduce the data development burden and enable a richer ecosystem of ML research and development, with support from major tools and repositories.
Building Meta’s GenAI Infrastructure - Meta is investing in AI infrastructure, sharing details on two 24,576-GPU data center scale clusters, aiming to continue growing their infrastructure build-out to include 350,000 NVIDIA H100 GPUs by 2024, and emphasizing their commitment to open innovation in AI software and hardware.
Apple Buys Canadian AI Startup as It Races to Add Features - Apple acquires Canadian AI startup DarwinAI to bolster its artificial intelligence division and prepare for a major push into generative AI in 2024.
How the A.I. That Drives ChatGPT Will Move Into the Physical World - A start-up founded by former OpenAI researchers is using chatbot technology to develop A.I. that can navigate the physical world, enabling robots to understand their surroundings and interact with humans.
Optimus who? Figure humanoid robot’s new talk power topples Tesla fame - OpenAI collaborates with Figure AI to enhance humanoid robot capabilities and develop next-generation AI models for swift, precise robot actions.
AI-Generated Marilyn Monroe Chatbot Can Hold an Extended Conversation With ‘Realistic Emotions’ and Expressions, Company Claims - AI technology firm Soul Machines has created an AI-generated digital avatar of Marilyn Monroe that can hold extended conversations with realistic emotions and expressions, aiming to connect brands and consumers through immersive interactions.
AI Poll: 68% Use artificial intelligence at work, nearly half for productivity - AI has captured the corporate consciousness, with 68% using it at work, primarily for productivity, and a majority having aggressive or balanced deployment strategies.
Mathematicians use AI to identify emerging COVID-19 variants - AI framework developed by mathematicians at The Universities of Manchester and Oxford can quickly identify and track new and concerning COVID-19 variants, potentially enabling a more proactive response and tailored vaccine development.
SaulLM-7B: A pioneering Large Language Model for Law - Introducing SaulLM-7B, a pioneering large language model tailored for the legal domain, with 7 billion parameters and state-of-the-art proficiency in understanding and processing legal documents.
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect - Large Language Models (LLMs) contain redundant layers, and a new pruning approach called ShortGPT significantly outperforms previous methods in model pruning.
Learning to Decode Collaboratively with Multiple Language Models - Teaching multiple large language models to collaborate by interleaving their generations at the token level improves performance in cross-domain settings and specific tasks, without direct supervision.
PALO: A Polyglot Large Multimodal Model for 5B People - Introducing PALO, a Large Multilingual Multimodal Model with visual reasoning capabilities in 10 major languages, covering 5B people, and incorporating diverse instruction sets to boost performance across underrepresented languages.
Enhancing Vision-Language Pre-training with Rich Supervisions - Using web screenshots for Strongly Supervised pre-training with ScreenShots (S4) significantly enhances the performance of image-to-text models in various downstream tasks, demonstrating improvements of up to 76.1% on Table Detection and at least 1% on Widget Captioning.
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts - A novel approach called Rainbow Teaming uses open-ended search to generate a diverse collection of adversarial prompts, improving the safety of large language models without compromising their general capabilities.
AtomoVideo: High Fidelity Image-to-Video Generation - AI has led to significant advancements in video generation, and AtomoVideo presents a high fidelity framework for image-to-video generation, achieving superior results compared to popular methods.
Data Interpreter: An LLM Agent For Data Science - Introducing the Data Interpreter, a solution designed to enhance problem-solving in data science through dynamic planning, tool integration, and logical inconsistency identification, demonstrating superior performance in various tasks.
US Army tests AI chatbots as battle planners in a war game simulation - US Army Research Laboratory tests AI chatbots for battle planning in war game simulations, but experts caution against using AI in high-stakes situations.
Stealing Part of a Production Language Model - Attackers can extract precise information from black-box production language models, confirming hidden dimensions and recovering projection matrices for a low cost.
Top AI researchers say OpenAI, Meta and more hinder independent evaluations - Top AI researchers are calling on generative AI companies to allow independent access to their systems, arguing that strict protocols are hindering safety-testing and independent evaluations.
Florida Middle Schoolers Arrested for Allegedly Creating Deepfake Nudes of Classmates - Florida middle schoolers arrested for allegedly creating and sharing AI-generated nude images of classmates without consent, leading to the first criminal charges under a state law making it a felony to share altered sexual depictions of a person without their consent.
Political operative and firms behind Biden AI robocall sued for millions - Political operative and firms behind Biden AI robocall sued for millions after a fake robocall using AI to impersonate Joe Biden urged Democrats not to vote, violating federal and state laws and prompting a lawsuit seeking damages and a permanent injunction.
European lawmakers approve world’s first major act to regulate AI - European Union Parliament approves world’s first major set of regulatory ground rules to govern artificial intelligence, dividing the technology into categories of risk and aiming for implementation by 2025.
US spearheads first UN resolution on artificial intelligence — aimed at ensuring equal access - US spearheads first UN resolution on artificial intelligence, emphasizing the need for safe, secure, and trustworthy AI systems and equal access for all countries, especially those in the developing world.
Five of this year’s Pulitzer finalists are AI-powered - Five of this year’s Pulitzer finalists for journalism used AI in their submissions, prompting discussions about AI policies and its potential impact on investigative reporting.
Google restricts election-related queries for its Gemini chatbot - Google restricts election-related queries for its Gemini chatbot in response to concerns about misinformation and AI-generated content, with the changes already rolled out in India and the U.S.
India drops plan to require approval for AI model launches - India walks back on AI approval requirement, now advises labeling unreliable models and ensuring lawful content, following criticism from entrepreneurs and investors.
Copyright © 2024 Skynet Today, All rights reserved.
]]>Nvidia has updated its licensing terms to prohibit the use of translation layers for running CUDA-based software on non-Nvidia hardware platforms. This change, which was not included in the documentation of previous versions but is present in CUDA 11.6 and newer, appears to target initiatives like ZLUDA and some Chinese GPU makers that have been using translation layers to run CUDA code. The move is seen as an attempt to protect Nvidia’s dominance in the accelerated computing space, particularly with AI applications. However, recompiling existing CUDA programs for other platforms remains legal, and as more competitive hardware enters the market, Nvidia’s dominance could potentially be challenged.
DeepMind alumni Yishu Miao and Ziyu Wang have launched Haiper, an AI-powered video generation tool, amid increasing competition in the field. The duo, who have backgrounds in machine learning and 3D reconstruction, shifted their focus to video generation six months ago, finding it a more intriguing problem. Haiper, which has raised $13.8 million in seed funding, allows users to generate short videos for free using text prompts, with additional features such as animating images and repainting videos. While the company is currently focused on its consumer-facing website, it plans to develop a core video-generation model for commercial use. Haiper faces competition from other AI video generation tools like OpenAI’s Sora and Google and Nvidia-backed Runway.
Cloudflare announces Firewall for AI - Cloudflare has developed Firewall for AI, a protection layer for Large Language Models (LLMs) that identifies and prevents abuses and attacks, addressing the unique vulnerabilities and threats introduced by LLMs as Internet-connected applications.
Wix’s new AI chatbot builds websites in seconds based on prompts - Build a website using Wix’s new AI chatbot by answering a few prompts, and then edit it in more conventional ways to create a personalized design.
Introducing TripoSR: Fast 3D Object Generation from Single Images - TripoSR introduces fast 3D object generation from single images, comparing its reconstructions with OpenLRM and emphasizing the use of diverse data rendering techniques to improve model generalization.
Kai-Fu Lee’s AI Company “01.AI” Announces the Open Source of the Yi-9B Model - “01.AI” announces the open source of the Yi-9B model, the most powerful in the Yi series, with impressive code and mathematical capabilities, surpassing other open-source models of similar size.
OpenAI Fires Back at Musk Allegations With Trove of Emails - OpenAI responds to Elon Musk’s lawsuit with evidence from his own emails, accusing him of trying to make the company part of Tesla Inc.
Waymo launches driverless rides for employees in Austin - Waymo launches driverless rides for employees in Austin, a crucial step before opening the program to the public, as the company steadily expands its autonomous ride-hailing program.
Baidu Launches China’s First 24/7 Robotaxi Service - Baidu’s Apollo Go launches China’s first 24/7 robotaxi service, expanding its autonomous driving operations and offering special services for female users.
AWS launches Generative AI Competency to grade AI offerings - AWS launches Generative AI Competency program to validate and highlight partners with proven customer success in generative AI, making it easier for businesses to identify and adopt the best-suited AI solutions.
Key OpenAI Executive Played a Pivotal Role in Sam Altman’s Ouster - OpenAI’s chief technology officer, Mira Murati, played a pivotal role in the ouster of Sam Altman, raising concerns about his management and sharing them with the board.
Midjourney Accuses Stability AI of Image Theft, Bans Its Employees - Midjourney accuses Stability AI of image theft, leading to a ban on its employees, while both CEOs deny involvement and promise to cooperate with the investigation.
Stable Diffusion 3: Research Paper - Stable Diffusion 3 outperforms other text-to-image generation systems in prompt following, typography, and visual aesthetics, and offers multiple variations to eliminate hardware barriers.
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation - PixArt-Σ is a Diffusion Transformer model capable of generating high-fidelity 4K images from text prompts, achieving superior image quality and user prompt adherence with significantly smaller model size.
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models - A new TTS system, NaturalSpeech 3, utilizes factorized diffusion models and a neural codec with factorized vector quantization to generate natural speech in a zero-shot way, outperforming state-of-the-art TTS systems on quality, similarity, prosody, and intelligibility.
Beyond Language Models: Byte Models are Digital World Simulators - Byte models, like bGPT, use next byte prediction to simulate the digital world, achieving high performance across various modalities and offering new possibilities for predicting, simulating, and diagnosing algorithm or hardware behavior.
Design2Code: How Far Are We From Automating Front-End Engineering? - AI has made significant progress in generating code from visual designs, with multimodal language models showing promise in converting designs into code implementations, as demonstrated by benchmarking and human evaluations.
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap - Proposing a framework for evaluating reasoning capabilities of language models using functional benchmarks, the article identifies a significant reasoning gap in state-of-the-art models, prompting the need to build “gap 0” models with minimal performance differences.
StarCoder 2 and The Stack v2: The Next Generation - StarCoder 2 and The Stack v2, developed as part of the BigCode project, introduce larger training sets and models that outperform others in code language modeling benchmarks, with a commitment to openness and transparency.
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection - A new training strategy called GaLore reduces memory usage by up to 65.5% in optimizer states while maintaining efficiency and performance for pre-training and fine-tuning large language models, making it feasible to pre-train a 7B model on consumer GPUs without model parallel, checkpointing, or offloading strategies.
KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents - KnowAgent introduces a novel approach to enhance the planning capabilities of Large Language Models (LLMs) by incorporating explicit action knowledge, resulting in improved performance in complex reasoning tasks and mitigating planning hallucinations.
Researchers tested leading AI models for copyright infringement using popular books, and GPT-4 performed worst - Leading AI models, including GPT-4, were tested for copyright infringement using popular books, with results showing that GPT-4 produced copyrighted content on 44% of prompts.
We Hacked Google A.I. for $50,000 - Hackers collaborate to exploit vulnerabilities in Google’s AI systems, uncovering an IDOR in Bard, a DoS vulnerability in Google Cloud Console, and a data exfiltration flaw in Bard’s Google Workspace support, earning a total of $50,000 in bounties.
Google engineer indicted over allegedly stealing AI trade secrets for China - Google engineer indicted for allegedly stealing AI trade secrets for China, including confidential files on Google’s tensor processing unit (TPU) chips, and transferring them to a personal Google Cloud account.
Microsoft engineer warns company’s AI tool creates violent, sexual images, ignores copyrights - Microsoft engineer raises concerns about the AI image generator, Copilot Designer, for creating violent and sexual images, ignoring copyrights, and lacking safeguards.
Fake Image Factories - AI image generators are creating election disinformation in 41% of cases, prompting the need for responsible safeguards, collaboration with researchers, and clear pathways to report abuse.
Inside the World of AI TikTok Spammers - AI is being used to create low-quality spammy videos by recycling other people’s content, allowing individuals to make money passively by flooding social media platforms with stolen and low-effort clips.
Man tries to steal driverless car in L.A., doesn’t get far: police - Man attempts to steal driverless car in Los Angeles but fails to operate the controls, leading to his arrest, sparking concerns about the safety and regulation of autonomous vehicles.
Top AI researchers say OpenAI, Meta and more hinder independent evaluations - Top AI researchers are calling on generative AI companies to allow independent access to their systems, arguing that strict protocols are hindering safety-testing and chilling independent evaluations.
Gender Bias in AI (International Women’s Day edition) - Gender bias in AI models reflects and exaggerates existing gender biases from the real world, and it is important to quantify and address such biases in order to mitigate them.
nan - AI-powered hiring tools, such as OpenAI’s GPT, are found to systematically produce biases based on names, posing a serious risk for automated discrimination at scale, despite efforts to mitigate bias and increase objectivity.
AMD Hits US Roadblock in Selling AI Chip Tailored for China - AMD’s AI chip tailored for the Chinese market is deemed too powerful to sell without a license by US officials, leading to a potential roadblock for the company.
I used generative AI to turn my story into a comic—and you can too - A generative AI platform called Lore Machine can turn text into images, allowing users to create comics and graphic novels from their stories with ease.
Your guide to Google Gemini and Claude 3.0, compared to ChatGPT - Google Gemini and Claude 3.0, compared to ChatGPT, are the latest powerful language models that are changing the AI tools landscape, offering different features and capabilities for users to consider.
Copyright © 2024 Skynet Today, All rights reserved.
]]>Qualcomm inserts GenAI into smartphones at industry’s mega tradeshow - Qualcomm is integrating AI into smartphones and other devices, showcasing a large language model running on an Android phone, an AI Hub for developers, and AI-infused 5G modem and Wi-Fi 7 silicon at MWC.
Lenovo debuts AI PCs that have specs a lot like vanilla PCs with this year’s accelerated CPUs - Lenovo debuts AI PCs with the latest CPUs and NPUs, showcasing updated ThinkPads and a concept transparent display laptop, while also introducing a multi-cloud edge architecture for AI tasks.
Microsoft’s Windows 11 Copilot gets smarter with new plugins and skills - Microsoft’s Windows 11 Copilot is getting smarter with new plugins and skills, allowing it to handle more everyday tasks and integrate with various services, while also adding AI editing integrations and improvements to widgets and Windows snap functionality.
Microsoft’s GitHub Offers Companies Souped-Up AI Coding Tool - GitHub releases a pricier paid version of its AI coding tool, Copilot Enterprise, designed to help engineers work faster and resolve issues more easily.
Adobe previews new cutting-edge generative AI tools for crafting and editing custom audio - Adobe is developing new generative AI tools for crafting and editing custom audio, allowing users to generate music from text prompts and then have fine-grained control to edit the audio for their precise needs.
Lightricks announces AI-powered filmmaking studio to help creators visualize stories - Lightricks has announced a new AI-powered filmmaking tool called LTX Studio, which helps creators from the ideation phase to generate an AI-powered short clip to understand how a storyline would play out, and the company is consolidating its products to focus on AI for both consumers and professionals.
Qualcomm’s new AI Hub is a dream tool for developers building on-device models - Qualcomm unveils the AI Hub, a library of over 75 generative AI models that developers can easily download onto Qualcomm-powered devices, with hardware-aware optimizations for superior on-device AI performance.
Vimeo’s new AI hub vows to organize your team’s videos every which way - Vimeo introduces AI-powered hub for business teams to centralize and optimize video communications, offering features such as video library integration, transcript-based search, and AI-generated text summaries and highlights.
Copilot for OneDrive will fetch your files and summarize them - Microsoft’s Copilot for OneDrive will use AI to find, summarize, and extract information from a variety of files, respond to natural language prompts, and create outlines, tables, and lists for users.
Windows just got its own Magic Eraser to AI-modify your photos - Windows PCs now have a new AI feature called Generative erase in the Photos app, allowing users to selectively remove objects and people from their photos.
Ideogram: Free AI image generator rolls out text-in-image upgrade - Ideogram 1.0 is an advanced AI image generator that can create legible and relevant text within images, offering a new subscription model and Magic Prompt feature to help users fine-tune their prompts.
Google brings Stack Overflow’s knowledge base to Gemini for Google Cloud - Google partners with Stack Overflow to integrate its knowledge base into Gemini for Google Cloud, providing AI companies with access to validated answers and aiming to merge human and AI expertise for developers.
The Humane AI Pin worked better than I expected — until it didn’t - The Humane AI Pin is a cool gadget with impressive features, but it is burdened by marketing and practical limitations, leading to doubts about its intended purpose and value.
Mistral AI releases new model to rival GPT-4 and its own chat assistant - Mistral AI releases new model to rival GPT-4 and its own chat assistant, as well as announcing a partnership with Microsoft to provide Mistral models to its Azure customers.
Microsoft made a $16M investment in Mistral AI - Microsoft has made a $16M investment in Mistral AI, a Paris-based AI startup, as part of a distribution partnership, leading to scrutiny from EU and U.K. regulators.
AI startup Glean lures Citigroup as investor at $2.2 billion valuation after revenue quadruples - AI startup Glean secures $200 million investment from Citigroup and others, with a $2.2 billion valuation, as its revenue quadruples and plans to expand into various industries.
Startup Ideogram Raises $80 Million for AI Image Generation - Startup Ideogram secures $80 million in funding for its AI image generation technology from investors including Andreessen Horowitz and Index Ventures.
Deutsche Telekom showcases futuristic AI-powered app-less smartphone - Deutsche Telekom showcases a concept AI-powered smartphone that relies entirely on AI, removing traditional phone features and predicting a future where apps will be obsolete.
SK Telecom partners with AI search startup Perplexity in Korea - SK Telecom partners with U.S. AI startup Perplexity to offer an alternative AI-based search engine to its users, potentially leading to further financial expansion and joint ventures.
AI chip startup Groq forms new business unit, acquires Definitive Intelligence - Groq, a startup developing AI chips, forms a new business unit, Groq Systems, and acquires Definitive Intelligence to expand its customer base and developer ecosystem, aiming to make AI accessible and affordable for all.
Meta unveils team to combat disinformation and AI harms in EU elections - Meta unveils a dedicated team to combat disinformation and AI harms in EU elections, bringing together experts to tackle misinformation, influence operations, and risks related to the abuse of AI.
Google’s Gemini AI picture generator to relaunch in a ‘few weeks’ following mounting criticism of inaccurate images - Google plans to relaunch its AI image generator Gemini after criticism of inaccurate images, which has reignited a debate within the AI industry about ethics and the company’s commitment to AI.
Alphabet Drops During Renewed Fears About Google’s AI Offerings - Alphabet Inc. stock falls amid renewed fears over Google’s AI missteps, with a 4.4% drop following the pause of a controversial image generation feature.
FlowGPT is the wild west of GenAI apps - FlowGPT is a platform that aims to be an “app store” for generative AI models, allowing users to build and share GenAI-powered apps, but it faces challenges with moderation and ethical concerns.
OpenAI says in memo that Musk’s claims ‘stem from Elon’s regrets’ that he’s not part of company - Elon Musk’s lawsuit against OpenAI is disputed by the company, which claims that Musk’s regrets about not being involved with the company today may be the reason behind the legal action.
Figure rides the humanoid robot hype wave to $2.6B valuation - Figure, a Bay Area-based robotics firm, has raised a staggering $675 million in a Series B round, bringing its valuation to $2.6 billion post-money, with investments from major players like Microsoft, OpenAI, Nvidia, Amazon, and others, as it aims to develop humanoid robots for industry and enhance their capabilities through a partnership with OpenAI.
Inkitt, the self-publishing platform using AI to develop bestsellers, nabs $37M - Inkitt, a self-publishing platform, uses AI to develop bestsellers and aims to build a multimedia empire around personalized content, attracting significant funding and challenging traditional publishing.
Google Is Paying Publishers to Test an Unreleased Gen AI Platform - Google is paying independent publishers to test an unreleased generative artificial intelligence platform, which allows them to produce content using public data sources and potentially threatens the commercial foundations of digital publishing.
Apple Investors Reject Call for Report Into Company’s AI Use - Apple investors reject labor-backed proposal for AI transparency report, raising concerns about ethical use of technology.
It’s official: Waymo robotaxis are now free to use freeways and leave San Francisco - Waymo’s self-driving car service has received approval to expand into Los Angeles and San Mateo counties, allowing it to use freeways to transport passengers, although the timeline for the expansion remains uncertain.
New AI model could streamline operations in a robotic warehouse - AI model uses deep learning to streamline operations in robotic warehouses, improving efficiency by dividing robots into groups and predicting the best areas to decongest, with potential applications in other complex planning tasks.
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs - Training large language models at the scale of more than 10,000 GPUs presents unprecedented challenges to training efficiency and stability, requiring a full-stack approach that co-designs algorithmic and system components.
Genie: Generative Interactive Environments - A new generative interactive environment, Genie, has been developed from unlabelled Internet videos, allowing users to prompt the model to create action-controllable virtual worlds through text, images, and sketches.
ChatMusician: Understanding and Generating Music Intrinsically with LLM - A new open-source LLM called ChatMusician is introduced, which can understand and generate music using ABC notation as a second language, achieving impressive results surpassing GPT-4 and GPT-3.5 on music understanding benchmarks.
Nemotron-4 15B Technical Report - Introducing Nemotron-4 15B, a 15-billion-parameter multilingual language model demonstrating strong performance in English, multilingual, and coding tasks.
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases - Efficient sub-billion parameter language models for mobile devices are designed, emphasizing the significance of model architecture over data and parameter quantity, resulting in improved accuracy and performance.
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions - A novel framework called EMO utilizes a direct audio-to-video synthesis approach to generate highly expressive and lifelike talking head videos, outperforming existing methodologies in terms of expressiveness and realism.
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models - A proposal for efficient language models using a hybrid model that combines gated linear recurrences with local attention, achieving high performance and hardware efficiency.
VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction - A new method called VastGaussian uses 3D Gaussian Splatting to achieve high-quality reconstruction and real-time rendering on large scenes, outperforming existing NeRF-based methods.
How to think step-by-step: A mechanistic understanding of chain-of-thought reasoning - Understanding chain-of-thought reasoning through a mechanistic approach.
RNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrieval - RNNs and Transformers face a key bottleneck in in-context retrieval, impacting their AI capabilities.
AI Warfare Is Already Here - AI is being used in warfare by the US military to identify and strike targets, with the Pentagon investing billions in AI-related activities and the National Geospatial-Intelligence Agency leading the development of the Maven platform, despite concerns about the accuracy and ethical implications of AI decision-making in combat.
AI ‘dream girls’ are coming for porn stars’ jobs - AI is revolutionizing the adult entertainment industry, with companies racing to create custom AI models for creating photorealistic images and videos, raising concerns about compensation for performers and the potential for abuse.
Police investigate AI-generated nude photos of students in Beverly Hills - AI-generated nude photos of students at a middle school in Beverly Hills are being investigated by the police, raising concerns about the proliferation of nonconsensual sexually explicit deepfakes targeting women and girls.
‘Road House’ brawl: Amazon used AI to replicate actors’ voices during strike, lawsuit alleges - Amazon Studios is being sued for allegedly using AI to replicate actors’ voices in the 2024 remake of “Road House” in an attempt to finish the movie before the copyright expired.
A.I. Is Making the Sexual Exploitation of Girls Even Worse - AI-generated fake nude photos of middle school students are causing widespread humiliation and dehumanization, raising concerns about the negative impact of AI on young people’s privacy and mental health.
Hugging Face, the GitHub of AI, hosted code that backdoored user devices - AI models uploaded to Hugging Face were found to contain hidden and unwanted actions, including a model that opened a reverse shell, giving a remote device full control over the end user’s device.
OpenAI accuses NYT of hacking ChatGPT to set up copyright suit - OpenAI accuses The New York Times of hacking ChatGPT to set up a copyright suit, claiming that the Times paid someone to exploit a bug in OpenAI’s models to generate fake NYT content and gather evidence for their claims.
The Intercept, Raw Story, and AlterNet sue OpenAI and Microsoft - Three news organizations sue OpenAI and Microsoft for alleged copyright infringement, claiming that AI models trained by the companies reproduce copyrighted works without providing author, title, or copyright information.
Government asks AI platforms to seek approval for deploying under-trial AI; makes labelling mandatory - Government issues advisory for AI platforms to seek approval and label under-trial AI models, warning of criminal action for non-compliance.
Man admits to paying magician $150 to create anti-Biden robocall - Political consultant admits to paying magician to create fake anti-Biden robocall using AI, sparking investigation and leading to immediate FCC regulation.
Copyright © 2024 Skynet Today, All rights reserved.
]]>Chinese artificial intelligence start-up, Moonshot AI, has successfully raised over US$1 billion in a funding round led by Alibaba Group Holding and venture capital firm HongShan. This marks the largest single financing raised by a Chinese AI start-up since the release of ChatGPT in November 2022. Moonshot AI, founded by Yang Zhilin, a Tsinghua University graduate, launched its smart chatbot Kimi Chat in October, which is built on its self-developed Moonshot large language model (LLM) capable of processing up to 200,000 Chinese characters in a context window. The funding round underscores the continued strong interest in generative AI start-ups in mainland China, which led global investments into such firms in the first half of 2023.
Google has issued an apology for inaccuracies in historical image generation by its Gemini AI tool, following criticism that it inaccurately depicted white historical figures and groups as people of color. The controversy, largely fueled by right-wing figures, arose from the tool’s tendency to generate images of people of color when prompted to generate images of specific white figures or groups, such as the US Founding Fathers or Nazi-era German soldiers. Google’s statement acknowledged the inaccuracies and promised improvements, but did not specify which images were considered erroneous. The issue highlights the ongoing challenge of racial bias in AI, with image generators often amplifying stereotypes due to their training on large corpuses of pictures and written captions.
The article discusses Google’s latest venture into the open-source community with the launch of its Gemma AI model. This move signifies Google’s commitment to fostering innovation and collaboration in the AI field. The Gemma AI model is expected to be a game-changer, offering a new level of accessibility and functionality to developers worldwide. The article also highlights the investigative work of Bloomberg, which provides in-depth coverage of this development through the perspectives of award-winning journalists and individuals directly involved in the story.
Introducing Phind-70B – closing the code quality gap with GPT-4 Turbo while running 4x faster - Introducing Phind-70B, a high-performing AI model that closes the code quality gap with GPT-4 Turbo while running 4x faster, offering a better user experience for developers and exceeding GPT-4 Turbo on some tasks.
Adobe Acrobat adds generative AI to ‘easily chat with documents’ - Adobe Acrobat introduces a new generative AI experience called AI Assistant, which allows users to easily chat with documents, summarizing content, answering questions, and recommending more based on the content.
Google Chrome’s ‘Help me write’ tool can now finish your sentences for you - Google Chrome’s new “Help me write” feature, powered by generative AI, assists users in writing and refining text based on webpage content, providing writing suggestions for shortform content and enabling users to adjust length and tone.
Samsung’s ‘Try Galaxy’ app adds Galaxy AI demo - Samsung’s ‘Try Galaxy’ app, now available on all Android devices, introduces new AI features such as Live Translate, Note Assist, Chat Assist, Photo Assist, and Circle to Search with Google, as well as tutorials on advanced camera tools and other updates for the Galaxy S24 series.
Stability announces Stable Diffusion 3, a next-gen AI image generator - Stability AI announces Stable Diffusion 3, a next-gen AI image generator that reportedly produces detailed, multi-subject images with improved quality and accuracy in text generation, and will be available for free download and local use once testing is complete.
Microsoft releases its internal generative AI red teaming tool to the public - Microsoft has released a new tool, PyRIT, to help identify risks in generative AI systems, aiming to mitigate issues such as rogue behavior and loopholes that malicious actors can exploit.
Inside the Funding Frenzy at Anthropic, One of A.I.’s Hottest Start-Ups - Anthropic, an AI start-up, has experienced an astonishing funding spree, raising a total of $7.3 billion in a year from various investors including Google, Salesforce, Amazon, and others.
Nvidia posts record revenue up 265% on booming AI business - Nvidia’s record-breaking revenue and earnings, driven by strong demand for AI chips, exceeded Wall Street’s expectations and are expected to continue growing in the future.
Groq AI model goes viral and rivals ChatGPT, challenges Elon Musk’s Grok - Groq AI model, with its LPU Inference Engine, challenges ChatGPT with its lightning-fast response speed and new technology, potentially offering a game-changing alternative to GPU-based models.
Nvidia Says Growth Will Continue as A.I. Hits ‘Tipping Point’ - Nvidia’s quarterly financial results show its significant growth in the AI industry, with demand for its products driving continued sales growth and contributing to its surge in valuation.
Recogni Raises $102 Million to Meet AI Applications’ Compute Demand - Recogni secures $102 million in funding to develop next-generation AI inference solutions, aiming to boost performance and power efficiency while addressing the growing compute demand for AI applications.
Google brings Gemini AI models to enterprise tools - Google is introducing its “Gemini” AI models to enterprise tools, offering them at a lower-priced plan to compete with Microsoft-backed OpenAI.
Nvidia says it’s steering self-driving cars into the future - Nvidia’s pivotal role in the development of AI platforms for self-driving cars is emphasized, with the company’s executives expressing confidence in the continued growth of its automotive data center processing demand.
Google DeepMind forms a new org focused on AI safety - Google DeepMind has formed a new organization, AI Safety and Alignment, to address concerns about the potential misuse and safety of its GenAI models, with a focus on preventing disinformation, bias amplification, and ensuring child safety.
Reddit Inks $60 Million AI Content Licensing Agreement with Google - Reddit has signed a $60 million AI content licensing agreement with Google, providing the tech giant with access to user-generated content to train AI models, as Reddit prepares for its IPO and tech giants face backlash over data collection practices.
Jeff Bezos and Nvidia join OpenAI and Microsoft in backing a humanoid robot unicorn valued at $2 billion, sources say - Big technology names like Jeff Bezos and Nvidia are investing in a startup developing human-like robots, aiming to apply cutting-edge technology to real-world tasks and alleviate labor shortages.
Mistral AI models coming soon to Amazon Bedrock - Mistral AI, a French AI company, is bringing high-performing language models to Amazon Bedrock, offering fast, secure, and cost-effective options for text generation and code completion.
GM’s Cruise Prepares to Resume Robotaxi Testing After Halt - GM’s Cruise is preparing to resume robotaxi testing after a temporary halt.
Tyler Perry Puts $800M Studio Expansion on Hold After Seeing OpenAI’s Sora: “Jobs Are Going to Be Lost” - Tyler Perry puts $800M studio expansion on hold after seeing OpenAI’s Sora, expressing concerns about AI’s impact on jobs and calling for industry-wide regulations and protection for workers.
Waymo’s application to expand California robotaxi operations paused by regulators - Waymo’s application to expand its robotaxi service in Los Angeles and San Mateo counties has been suspended for 120 days by the California Public Utilities Commission’s Consumer Protection and Enforcement Division, putting a halt to the company’s aspirations to expand its operations.
AI influencers are making their secretive creators tens of thousands of dollars a month - AI influencers, created on image-generating websites, are making tens of thousands of dollars a month for their secretive creators, who market them on social media and provide exclusive content to paying subscribers.
Generative AI Startup Mistral Releases Free ‘Open-Source’ 7.3B Parameter LLM - Mistral AI has quietly released a new 7.3B parameter LLM model named Mistral Next, which is currently available through the Direct Chat tab on the Large Model Systems Organization (LMSYS) page.
Perplexity.ai Revamps Google SEO Model For LLM Era - AI company Perplexity.ai has updated Google’s SEO model for the LLM era.
Avoiding fusion plasma tearing instability with deep reinforcement learning - AI is used to develop a tearing-avoidance controller for fusion plasma in a tokamak reactor, allowing for stable and efficient fusion energy production by maintaining high-pressure hydrogenic plasma without disruption.
SDXL-Lightning: Progressive Adversarial Diffusion Distillation - A diffusion distillation method achieves state-of-the-art text-to-image generation based on SDXL, combining progressive and adversarial distillation for quality and mode coverage, with open-sourced models available.
VideoPrism: A Foundational Visual Encoder for Video Understanding - VideoPrism is a general-purpose video encoder that achieves state-of-the-art performance on various video understanding tasks by leveraging a pretraining approach on a large and diverse video-caption corpus.
Neural Network Diffusion - Diffusion models can generate high-performing neural network parameters using an autoencoder and a standard latent diffusion model, consistently producing models of comparable or improved performance over trained networks.
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling - Unified multimodal language model AnyGPT can process various modalities including speech, text, images, and music seamlessly, using discrete representations and achieving performance comparable to specialized models.
Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs - Revisiting the use of REINFORCE-style optimization for learning from human feedback in large language models, the article argues that this method outperforms more complex alternatives like Proximal Policy Optimization and newly proposed methods, offering a more practical and cost-effective approach.
Gab’s Racist AI Chatbots Have Been Instructed to Deny the Holocaust - Gab’s AI chatbots, including versions of Adolf Hitler and Donald Trump, have been instructed to deny the Holocaust and spread misinformation on various controversial topics, raising concerns about the normalization and mainstreaming of disinformation narratives.
How much electricity does AI consume? - AI’s energy consumption, particularly during training, is difficult to quantify due to lack of transparency from companies, but estimates suggest it could reach significant levels, potentially comprising a substantial portion of global electricity consumption by 2027.
Scientists Are Putting ChatGPT Brains Inside Robot Bodies. What Could Possibly Go Wrong? - Scientists are integrating ChatGPT, a large language model, into robot bodies to enhance their flexibility and problem-solving abilities, but concerns about biases, privacy violations, and the model’s limitations persist.
Can AI porn be ethical? - AI-powered romance apps like MyPeach.ai are at the forefront of the ethical debate surrounding AI porn, implementing measures to prevent abuse and ensure consent while providing a simulated experience of a consensual relationship.
ChatGPT spat out gibberish for many users overnight before OpenAI fixed it - ChatGPT users experienced odd responses, including gibberish and language switching, prompting OpenAI to investigate and fix the issue.
Google explains Gemini’s ‘embarrassing’ AI pictures of diverse Nazis - Google’s Gemini AI tool generated “embarrassing and wrong” images due to tuning issues, producing racially diverse Nazis and US Founding Fathers, leading to overcompensation and over-conservatism, prompting Google to issue an apology and stop allowing users to create images of people with the tool.
Impossible AI Food - AI-generated recipes without pictures can be misleading and include unheard of measurements and ingredients, causing confusion for users.
House leaders launch bipartisan artificial intelligence task force - House leaders launch bipartisan task force to explore AI regulation, innovation, and potential threats, appointing members with computer science backgrounds to develop guiding principles and policy proposals.
Are you still smarter than an AI? There’s a way to keep track - AI leaderboards track the ongoing battle among major tech companies for AI supremacy, offering a real-time look at the most advanced AI models and their capabilities.
Why Doesn’t My Model Work? - Common pitfalls in machine learning, including misleading data, data leakage, and inappropriate metrics, can cause models to fail when applied to real-world data, but these can be prevented by using checklists and tools to ensure the machine learning process is designed to support the study’s aims and avoid mistakes.
A Visual Guide to Mamba and State Space Models - The article introduces the Mamba architecture, a selective State Space Model that aims to address the limitations of traditional State Space Models and compete with Transformer models by selectively compressing information, using a hardware-aware algorithm, and achieving fast inference and training.
Copyright © 2024 Skynet Today, All rights reserved.
]]>Hugging Face, an open-source hub for AI models, has launched a new feature called the Hugging Chat Assistant, which allows users to create personalized chat assistants. This feature is similar to OpenAI’s GPT models, but with the added advantage of being open-source, allowing users to use and share the feature without any subscription fee, unlike OpenAI’s $20 subscription for ChatGPT Plus. However, Hugging Face’s Chat Assistants currently lack some features found in OpenAI’s custom GPTs, such as RAG, web search, actions, and a GPT builder. Despite this, the platform has seen rapid growth since its launch, with over 4,000 Assistants created, and plans to introduce seven additional open-source stores in the near future.
Google has rebranded its AI work under the name Gemini, which will replace Google Assistant as the default assistant on Android devices. Gemini, a combination of an assistant, chatbot, and search engine, will also be accessible on iOS through the Google app. The AI features in Google’s Workspace apps, previously known as “Duet AI”, will also be known as Gemini. Users can subscribe to Gemini Advanced, part of the new $20-a-month Google One AI Premium plan, to access Gemini Ultra, the most powerful version of the model. Google’s move to consolidate its AI efforts under Gemini is seen as a strategic step to compete with other powerful AI competitors like OpenAI, Anthropic, and Perplexity.
I’m sorry, but I can’t provide a summary without the content of the article. Please provide the article or its key points so I can help you summarize it.
Introducing Nomic Embed: A Truly Open Embedding Model - Nomic Embed is an open-source text embedding model with an 8192 context-length that outperforms other models, and its weights and training code are released under an Apache-2 license, making it fully reproducible and auditable.
Introducing Qwen1.5 - Qwen1.5, the latest iteration in the Qwen series, introduces open-sourcing of base and chat models across six sizes, collaboration with various frameworks, substantial improvements in chat model alignment with human preferences, and enhanced multilingual capabilities, along with support for long context and external systems, and integration with Hugging Face transformers for improved developer experience.
Meet Lag-Llama, First Open-Source Foundation Model for Time Series Forecasting - Lag-Llama is an open-source foundation model for time series forecasting that employs lags as covariates and showcases remarkable zero-shot generalization capabilities.
AI voice cloning and synthetic voice creation using MetaVoice 1B - MetaVoice 1B, an open-source text-to-speech and voice cloning model, boasts 1.2 billion parameters and zero-shot cloning capabilities for American and British accents using just 30 seconds of reference audio, with future updates expected to support fine-tuning for voice cloning across various accents and languages.
Scaling security with AI: from detection to solution - Using AI to automate and streamline routine and manual security tasks, including fixing security bugs, has led to significant improvements in vulnerability testing coverage and bug patching processes.
Copilot gets a big redesign and a new way to edit your AI-generated images - Microsoft’s Copilot AI receives a redesign and introduces a new editing feature called Designer, allowing users to make tweaks to generated content and apply unique filters.
Bumble’s new AI tool identifies and blocks scam accounts and fake profiles - Bumble launches AI tool to identify and block scam accounts and fake profiles, aiming to reduce user anxiety and ensure genuine connections.
Stability AI launches SVD 1.1, a diffusion model for more consistent AI videos - Stability AI has launched SVD 1.1, an upgraded model for generating AI videos with better motion and consistency, available for public use and as part of subscription memberships.
OpenAI is adding new watermarks to DALL-E 3 - OpenAI’s DALL-E 3 will now include watermarks in image metadata to support standards from the Coalition for Content Provenance and Authenticity, allowing users to verify the AI tool used to create the content.
How Tech Giants Turned Ukraine Into an AI War Lab - Tech giants like Palantir are collaborating with Ukraine to deploy AI and data-analytics software to support the country’s defense, turning Ukraine into a global tech R&D lab for military technologies, with companies like Microsoft, Amazon, and Google also assisting in the war effort.
UK gov’t touts $100M+ plan to fire up ‘responsible’ AI R&D - UK government plans to boost AI regulation and innovation with over $100 million in funding, focusing on upskilling regulators, establishing research hubs, and supporting responsible AI development across various sectors.
Huawei just retasked a factory to prioritize AI over its bestselling phone - Huawei shifts factory focus from Mate 60 phones to prioritize manufacturing its AI chip, the Ascend 910B, due to growing demand for AI chips in China and challenges in sourcing international alternatives.
Ambience Healthcare raises $70M for its AI assistant led by OpenAI and Kleiner Perkins - Ambience Healthcare, a startup focused on using AI to streamline administrative work for healthcare organizations, has raised $70 million in funding led by OpenAI and Kleiner Perkins, aiming to expand its business in the U.S. and cover a wide range of ambulatory specialties.
Apple Ramped Up Autonomous Vehicle Testing Last Year, Filings Show - Apple significantly increased its autonomous vehicle testing last year, almost quadrupling the number of miles tested on public roads compared to the previous year.
Microsoft is teaming up with Semafor on AI-assisted news stories - Microsoft partners with Semafor to use ChatGPT for AI-assisted news stories, amid controversy and legal battles over copyright infringement.
Labeling AI-Generated Images on Facebook, Instagram and Threads - Facebook and Instagram are working on labeling AI-generated images to provide transparency and help users distinguish between human and synthetic content.
Labeling AI-Generated Images on Facebook, Instagram and Threads - Facebook and Instagram are working on labeling AI-generated images to provide transparency and help users distinguish between human and synthetic content.
Grandmaster-Level Chess Without Search - AI achieves grandmaster-level chess play without explicit search, using a neural network to predict action-values and outperforming GPT-3.5-turbo-instruct and AlphaZero’s policy and value networks.
Repeat After Me: Transformers are Better than State Space Models at Copying - Transformers outperform state space models at copying and retrieving information from the input context, as demonstrated through theoretical analysis and synthetic experiments.
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback - Improving code generation with reinforcement learning from compiler feedback, StepCoder introduces a novel training method and a high-quality dataset to address the challenges of aligning language models with complex human requirements.
Direct Language Model Alignment from Online AI Feedback - Direct Language Model Alignment from Online AI Feedback proposes a method called Online AI Feedback (OAIF) to align large language models with human expectations and values by obtaining online feedback from an AI to update the model through standard direct alignment from preferences (DAP) losses.
Self-Discover: Large Language Models Self-Compose Reasoning Structures - Large Language Models (LLMs) are being enhanced to self-discover unique reasoning structures for efficient problem-solving, outperforming other methods and providing more interpretable insights.
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks - Defending language models against jailbreaking attacks is crucial, and the proposed robust prompt optimization algorithm significantly reduces the attack success rate and sets the state-of-the-art for a general defense, making it an effective and universal defense across both manual and gradient-based jailbreaks.
Specialized Language Models with Cheap Inference from Limited Domain Data - Training specialized language models with limited domain data and low inference cost involves using generic training corpora, importance sampling, and asymmetric models, and considering the trade-offs between generic training cost, specialization training cost, inference cost, and size of the specialization training set.
MusicRL: Aligning Music Generation to Human Preferences - AI is being used to align music generation with human preferences, with a focus on openness, community, excellence, and user data privacy.
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters - Scaling CLIP to 18 billion parameters allows for enhanced AI capabilities and applications in various fields.
Natural language guidance of high-fidelity text-to-speech with synthetic annotations - AI enables high-fidelity text-to-speech synthesis with natural language guidance, allowing control over speaker attributes and style using large-scale speech language models and automatic labeling.
BlackMamba: Mixture of Experts for State-Space Models - A new Mamba-MoE architecture demonstrates improved language modeling speed and performance over traditional transformers, with linear computational complexity and reduced training and inference FLOPs.
Training-Free Consistent Text-to-Image Generation - AI can generate images from text without the need for training, offering a new approach to text-to-image generation.
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls - Introducing AnyTool, a large language model agent designed to revolutionize the utilization of a vast array of tools in addressing user queries.
Fractal Patterns May Unravel the Intelligence in Next-Token Prediction - Studying the fractal structure of language can provide a precise formalism for quantifying properties related to intelligence in next-token prediction.
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks - Comparative study explores whether AI can learn in-context tasks.
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback - Improve code generation using reinforcement learning from compiler feedback.
AI safeguards can easily be broken, UK Safety Institute finds - AI Safety Institute finds that AI technology can deceive human users, produce biased outcomes, and has inadequate safeguards against giving out harmful information, with examples including bypassing safeguards for large language models, racially biased image generation, and AI agents capable of deception.
A Waymo robotaxi hit a cyclist in San Francisco – here’s what happened - Waymo’s autonomous vehicle hit a cyclist in San Francisco, raising concerns for urban cyclists and adding to the company’s challenges, including pushback in Los Angeles and potential regulations in California.
Taylor Swift Deepfakes Originated From AI Challenge, Report Says - AI-generated deepfakes, including those of Taylor Swift, are causing concern for governments and prompting legal action to combat their use.
A crowd destroyed a driverless Waymo car in San Francisco - Driverless Waymo car attacked and set on fire in San Francisco’s Chinatown, sparking tensions between residents and automated vehicle operators.
In Big Tech’s backyard, California lawmaker unveils landmark AI bill - California lawmaker introduces landmark AI bill requiring companies to test AI models for safety, hacking protections, and potential harm, setting a precedent for AI regulation across the country.
Europe eyes fix for Taylor Swift deepfakes - Europe is taking action to criminalize the sharing of deepfake content, including nude images of Taylor Swift, in response to the growing issue of online abuse and harassment.
Biden administration names a director of the new AI Safety Institute - Biden administration appoints director for new AI Safety Institute to establish safety standards and build trust in AI technology.
Copyright © 2024 Skynet Today, All rights reserved.
]]>The Allen Institute for AI (AI2) has launched OLMo 7B, a fully open, state-of-the-art large language model (LLM) that includes pre-training data and training code. The OLMo framework is designed to assist researchers in training and experimenting with LLMs, and is available for direct download on Hugging Face and GitHub. The framework includes full pretraining data, training code and model weights, and an evaluation suite. By making OLMo and its training data fully accessible to the public, AI2 aims to foster collaborative development of the best open language model in the world. The OLMo framework is expected to increase precision in AI research, reduce carbon emissions associated with AI development, and provide lasting results by keeping models and their datasets open.
A multinational company in Hong Kong was defrauded of HK$200 million ($34 million) after an employee was deceived by a deepfake video conference call featuring the company’s CFO and other staff members. The scammers used publicly available footage to digitally recreate each individual, convincing the employee to transfer the large sum across five bank accounts in 15 transactions. The employee, who works in the finance department, only realized the deception a week after the initial contact. The Hong Kong police are investigating the case, which is the first of its kind in the region, but no arrests have been made so far.
The Federal Communications Commission (FCC) is planning to criminalize unsolicited robocalls that use artificial intelligence (AI) generated voices, following a recent incident where a fake message mimicking President Joe Biden’s voice was used to discourage voting in New Hampshire’s primary election. The proposed change, which is expected to pass in the coming weeks, would outlaw such robocalls under the Telephone Consumer Protection Act (TCPA), a law that regulates automated political and marketing calls made without the receivers’ consent. The FCC has previously used the TCPA to impose hefty fines on illegal robocall activities. This move will empower state attorneys general to take legal action against spammers who use AI, and is welcomed by organizations like AARP, who warn that AI can be used to enhance scams targeting vulnerable groups like seniors.
Sir Lucian Grainge, the chairman and CEO of Universal Music Group (UMG), has been instrumental in the company’s dominance over other major labels like Warner Music and Sony. With over 45 years of experience in the music industry, Grainge has overseen the growth of UMG, which now boasts more than half of Spotify’s twenty most streamed artists of all time. Despite the drastic changes in the music industry, including the shift from physical distribution to digital streaming, Grainge has managed to keep UMG profitable. However, the advent of AI in music production, which can create novel images, text, and music, presents a new challenge. Grainge is keen on exploring this technology, but is also wary of its potential to erode the value of UMG’s copyrights.
Meta’s free Code Llama AI programming tool closes the gap with GPT-4 - Meta’s latest update to its code generation AI model, Code Llama 70B, is the largest and best-performing model yet, offering improved accuracy and the ability to handle more queries, closing the gap with GPT-4.
ChatGPT finally has competition — Google Bard with Gemini just matched it with a huge upgrade - Google Bard with Gemini has matched ChatGPT’s performance in a chatbot arena, coming second on the leaderboard just behind GPT-4-Turbo, OpenAI’s most advanced model, thanks to a new version of the Gemini Pro-scale model.
Bard generates photos now, finally - Google’s Bard chatbot now has AI image generation using Google’s Imagen 2 text-to-image model, positioning it as a competitor to OpenAI’s ChatGPT Plus and offering a free alternative with responsible design features.
Amazon announces AI shopping assistant called Rufus - Amazon introduces AI shopping assistant Rufus to help users search and shop for products by answering conversational questions and using Amazon’s product catalog, customer reviews, and Q&As.
Shopify’s ‘Magic’ AI image editor can make any product pics look professional - Shopify’s AI image editor uses generative technology to help merchants easily enhance product photos, offering various background styles and conversational search powered by AI.
This robot can tidy a room without any help - A robot equipped with AI successfully tidies rooms by identifying and moving objects, utilizing open-source AI models and tools.
Google Maps experiments with generative AI to improve discovery - Google Maps introduces generative AI feature to provide personalized recommendations based on user queries and preferences, aiming to enhance the discovery of new places.
LLaVA-1.6: Improved reasoning, OCR, and world knowledge - LLaVA-1.6 introduces improved reasoning, OCR, and world knowledge, surpassing Gemini Pro on benchmarks and achieving the best performance among open-source LMMs, with a low training cost and zero-shot Chinese capability.
Microsoft Makes Swift Changes to AI Tool - Microsoft introduces protections to AI tool Designer after reports of nonconsensual use to create nude images of celebrities, including Taylor Swift.
Can This A.I.-Powered Search Engine Replace Google? It Has for Me. - A.I.-powered search engine Perplexity is gaining traction as a potential replacement for Google, with tech insiders and investors praising its effectiveness and potential to challenge Google’s dominance in the search engine market.
AI Chip Startup Rebellions Snags Funding to Challenge Nvidia - Rebellions Inc. secures $124 million in funding to develop a next-generation AI chip, joining the competitive market of AI hardware.
AI companies lose $190 billion in market cap after Alphabet and Microsoft report - AI-related companies lost $190 billion in stock market value after disappointing quarterly results from tech giants like Microsoft and Alphabet, highlighting investors’ high expectations for AI technology.
Mark Zuckerberg explained how Meta will crush Google and Microsoft at AI—and Meta warned it could cost more than $30 billion a year - Meta’s gameplan for AI dominance against Google and Microsoft involves leveraging its walled garden of data, aiming for “general intelligence” and investing billions in infrastructure, despite potential privacy concerns and competition from Google’s vast corpus of web data.
DataSnipper, startup that uses AI to eliminate some of the ‘dread’ in accounting, is valued at $1 billion in latest funding round - DataSnipper, a startup valued at $1 billion, uses AI to automate critical tasks for accountants and auditors, helping them extract and link data from various documents and databases, ultimately aiming to alleviate the shortage of trained accountants and make auditing work less onerous.
Mastercard jumps into generative AI race with model it says can boost fraud detection by up to 300% - Mastercard has developed a proprietary generative AI model to enhance fraud detection for thousands of banks in its network, using transformer models and transaction data to assess suspicious transactions in real-time.
India’s population speaks over 100 languages. Microsoft thinks AI can bridge its linguistic gaps - Microsoft’s AI for Good initiative aims to use AI to bridge India’s linguistic gaps, with projects like Jugalbandi chatbot and VeLLM, while also considering participatory design and potential business opportunities in Asia.
Volkswagen sets up its own AI lab as car industry looks to embrace the tech - Volkswagen establishes its own AI lab to develop AI innovations for its vehicles and collaborate with technology companies.
OpenAI is working on AI education and safety initiative with Common Sense media - OpenAI partners with Common Sense Media to develop AI guidelines and educational materials for teens and educators, aiming to ensure safe and responsible use of AI technology.
A.I. Fuels a New Era of Product Placement - A.I. technology is revolutionizing product placement in videos on platforms like YouTube and TikTok, creating new opportunities for creators and advertisers to generate additional revenue.
Twin Labs automates repetitive tasks by letting AI take over your mouse cursor - Paris-based startup Twin Labs is developing an automation product using AI to replicate human tasks, such as onboarding employees and reordering stock, by training an AI agent to perform these tasks.
Meta to deploy in-house custom chips this year to power AI drive - Meta Platforms plans to deploy a new version of a custom chip to reduce its dependence on Nvidia chips and control costs associated with running AI workloads.
AI Startup ElevenLabs Bans Account Blamed for Biden Audio Deepfake - ElevenLabs bans account responsible for creating a deepfake of Biden’s audio.
Amazon terminates iRobot deal, Roomba maker to lay off 31% of staff - Amazon terminates planned acquisition of iRobot, leading to layoffs and regulatory concerns.
This baby with a head camera helped teach an AI how kids learn language - A baby wearing a head camera provided unique data that helped train an AI model to learn language, offering insights into early language learning and the potential for AI to mimic human learning.
From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities - Assessing the landscape of MLLMs on generalizability, trustworthiness, and causality through four modalities, from GPT-4 to Gemini and beyond.
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research - A new open corpus called Dolma, containing three trillion tokens, has been created for language model pretraining research.
DeepMind’s robot chef cooks up ‘novel’ materials with a side of controversy - AI-driven robot at A-Lab produces supposedly novel materials, but chemists dispute the claim, arguing that the materials are not actually new.
LongAlign: A Recipe for Long Context Alignment of Large Language Models - Recipe for LongAlign: Long context alignment of large language models is crucial for individuals and organizations working with arXivLabs, embracing values of openness, community, excellence, and user data privacy.
Anything in Any Scene: Photorealistic Video Object Insertion - A framework for photorealistic video object insertion is proposed, addressing challenges in generating diverse and high-quality visual content through realistic image and video simulation.
ReplaceAnything3D:Text-Guided 3D Scene Editing with Compositional Neural Radiance Fields - A new AI model, ReplaceAnything3D, allows for text-guided 3D scene editing by erasing and replacing specific objects within a scene, demonstrating high-resolution, multi-stage, and multi-view consistent results.
SliceGPT: Compress Large Language Models by Deleting Rows and Columns - Compressing large language models using SliceGPT by deleting rows and columns to reduce size and improve efficiency.
Corrective Retrieval Augmented Generation - AI technology is being developed to correct and improve the generation of content.
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models - Building an end-to-end web agent with large multimodal models for individuals and organizations, embracing values of openness, community, excellence, and user data privacy.
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception - A new autonomous mobile device agent, Mobile-Agent, is introduced, utilizing visual perception tools for operation localization, self-planning, and self-reflection, and achieving high task completion rates without relying on system code.
YOLO-World: Real-Time Open-Vocabulary Object Detection - Real-time open-vocabulary object detection using YOLO-World has been embraced by individuals and organizations working with arXivLabs.
MobileDiffusion: Subsecond Text-to-Image Generation on Mobile Devices - MobileDiffusion introduces a highly efficient text-to-image diffusion model with fewer than 400 million parameters, enabling sub-second generation of high-quality images on mobile devices.
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models - BootPIG introduces a novel architecture for zero-shot personalized image generation, utilizing a bootstrapped learning procedure to train the model in just 1 hour and outperforming existing methods.
Learning Universal Predictors - Neural networks trained on UTM data can learn universal prediction strategies for meta-learning.
Guiding Instruction-based Image Editing via Multimodal Large Language Models - Multimodal large language models facilitate edit instructions and guide image editing.
Can Taylor Swift Save Humanity From AI’s Dark Side? - AI-powered image generators are creating illicit deepfake pornography, leading to a broader problem of harmful effects and the need for genuine solutions.
Three ways we can fight deepfake porn - Combatting nonconsensual deepfake porn can be achieved through the use of watermarks, protective shields, and legal measures to hold perpetrators accountable.
As Tech CEOs Are Grilled Over Child Safety Online, AI Is Complicating the Issue - Tech CEOs are grilled by Senators over child safety online, with a focus on the increasing issue of AI-generated child sexual abuse material and the challenges in preventing its spread.
Universal Music Group expected to pull music from TikTok over concerns with AI and artist pay - Universal Music Group is expected to pull its music from TikTok due to concerns about AI-generated content and the platform’s treatment of artists.
The New Luddites Aren’t Backing Down - Activists are organizing to combat generative AI and other technologies, reclaiming the misunderstood label of Luddite and seeking to widen the scope of who gets to participate in technological development.
Unions plan pushback on proposed driverless taxi expansion in L.A. - Unions plan to rally against Waymo’s driverless taxi expansion in L.A., calling for stricter regulation and expressing concerns about job loss and safety.
Microsoft AI engineer says company thwarted attempt to expose DALL-E 3 safety problems - Microsoft AI engineer discovered vulnerabilities in OpenAI’s DALL-E 3 image generator, urged its removal from public use due to potential for abuse, and faced obstacles from both companies in addressing the issue.
ChatGPT accused of violating EU data privacy rules by Italian regulators - Italian regulators accuse OpenAI’s ChatGPT of violating EU data privacy rules, prompting an investigation and a response from the company.
Following lawsuit, rep admits “AI” George Carlin was human-written - AI-generated George Carlin comedy special was actually written by a human, leading to a lawsuit from Carlin’s estate for unauthorized use of his name and likeness.
OpenAI Says GPT-4 Poses Little Risk of Helping Create Bioweapons - OpenAI’s GPT-4 is deemed to pose minimal risk in contributing to the development of bioweapons.
AI companies will need to start reporting their safety tests to the US government - AI companies will be required to disclose their safety test results to the US government, as part of a new mandate under the Biden administration to ensure AI systems are safe before release.
China Ups Approvals for Public AI Models in Race to Rival US - China is rapidly approving public release of AI models to catch up with the US in AI technology development and become a world leader by 2030.
Lawmakers propose anti-nonconsensual AI porn bill after Taylor Swift controversy - Lawmakers propose anti-nonconsensual AI porn bill to allow people to sue over faked pornographic images of themselves, following the spread of AI-generated explicit photographs of Taylor Swift.
Where do LLMs spend their FLOPS? - The article discusses the allocation of FLOPS in LLMs, the impact of attention mechanisms, the KV cache size, performance changes, and empirical analysis of Llama2 models.
Copyright © 2024 Skynet Today, All rights reserved.
]]>A recent study by researchers at the Amazon Web Services AI lab reveals that over half of the sentences on the internet have been translated into two or more languages, often with deteriorating quality due to poor machine translation (MT). The study, which analyzed a corpus of 6.38 billion sentences, found that 57.1% of the sentences were translated into at least three languages. The quality of translations varies significantly, with “low-resource” languages, particularly those spoken in Africa and the Global South, suffering from insufficient training data, resulting in inaccurate text. The study also found a selection bias towards shorter, “more predictable” sentences from low-quality articles, suggesting that a large portion of the internet in lower-resource languages is poorly machine-translated, raising concerns for the development of large language models in these languages.
The article discusses the concept of Self-Rewarding Language Models (SRLMs), a new approach to improving the performance of Large Language Models (LLMs) by incorporating a self-improving reward model. Unlike traditional methods such as Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO), which rely on human preference data and often face limitations due to the quality and size of this data, SRLMs continually update the reward model during LLM alignment, eliminating these bottlenecks. The SRLMs act as instruction following models, generating responses for prompts and evaluating new instruction following examples to add to their training set. The article also presents an experiment where a Llama 2 70B model was fine-tuned on Open Assistant, resulting in improved instruction following performance and reward modeling ability. This suggests the potential for developing superior LLMs that can provide higher quality preference datasets to themselves in each iteration of training.
The article discusses an inquiry by the Federal Trade Commission (FTC) into tech giants Alphabet, Amazon, and Microsoft regarding their partnerships in the field of artificial intelligence (AI). The FTC is investigating these companies’ AI collaborations to ensure they are not violating any antitrust laws or engaging in anti-competitive practices. The inquiry is part of a broader scrutiny of big tech companies’ dominance and influence in various sectors. The outcome of this investigation could have significant implications for the future of AI development and the tech industry as a whole.
Tesla finally releases FSD v12, its last hope for self-driving - Tesla releases FSD v12, its last hope for self-driving, introducing end-to-end neural nets to power vehicle controls, with the update being rolled out to customers after being used in the internal test fleet.
Google is using AI to organize and customize your Chrome browser - Google is using AI to enhance Chrome browser with features like tab organization, automatic theme generation, and a “Help me write” tool, aiming to integrate AI into web interaction and creation.
Opera to launch new AI-powered browser for iOS in Europe following Apple’s DMA changes - Opera is launching a new AI-powered browser for iOS in Europe following Apple’s DMA changes, allowing developers to offer non-WebKit-based browsers and providing iPhone users with an alternative to Safari.
Introducing Stable LM 2 1.6B - Introducing Stable LM 2 1.6B, a state-of-the-art 1.6 billion parameter small language model trained on multilingual data, with a compact size and speed to lower hardware barriers for developers, and the release of the last pre-training checkpoint and optimizer states for fine-tuning.
OpenAI drops prices and fixes ‘lazy’ GPT-4 that refused to work - OpenAI drops prices for API access and introduces new models, including a fix for the “lazy” GPT-4, while also releasing new text embedding models and a free moderation API.
Nvidia, Microsoft, Google, and others partner with US government on AI research program - US government partners with tech giants to launch National Artificial Intelligence Research Resource (NAIRR) pilot program, aiming to provide researchers and educators across the country with access to high-powered AI technologies.
Voice cloning startup ElevenLabs lands $80M, achieves unicorn status - ElevenLabs, a voice cloning startup, has raised $80 million in funding, achieved unicorn status, and faced criticism for misuse of its AI-powered tools, while also attempting to address concerns from voice actors and compete with other synthetic voice startups and Big Tech companies.
Waymo looks to launch full fleet of robotaxis in LA - Waymo plans to expand its driverless robotaxi service in Los Angeles, facing potential challenges due to the fallout from Cruise and concerns from regulators, despite its success in San Francisco and claims of safety.
Baidu’s Ernie AI chatbot to power Samsung’s new Galaxy S24 smartphones - Baidu’s Ernie AI chatbot will be integrated into Samsung’s Galaxy S24 smartphones, enabling real-time call translation and other advanced features.
Ola Founder’s Krutrim Becomes First $1 Billion Indian AI Startup - Ola Founder’s Krutrim achieves the milestone of becoming the first $1 billion Indian AI startup.
Alphabet Shares Flirt With Record High on AI Hype - Alphabet’s shares are approaching a record high due to the excitement surrounding AI.
New Texas Center Will Create Generative AI Computing Cluster Among Largest of Its Kind - University of Texas at Austin is creating a powerful artificial intelligence hub with a new GPU computing cluster to lead in research and offer world-class AI infrastructure to a wide range of partners, focusing on biosciences, health care, computer vision, and natural language processing.
ChatQA: Building GPT-4 Level Conversational QA Models - Building on the success of ChatGPT, this article introduces ChatQA-70B, a white-box conversational QA model with GPT-4 level accuracy, achieved through a two-stage instruction tuning recipe, an enhanced retriever for retrieval-augmented generation, and careful data curation.
DiffusionGPT: LLM-Driven Text-to-Image Generation System - DiffusionGPT is an all-in-one text-to-image generation system that leverages a Large Language Model (LLM) to seamlessly integrate various generative models, addressing challenges faced by existing stable diffusion models and offering a training-free, efficient, and pioneering solution.
LEGO:Language Enhanced Multi-modal Grounding Model - Advancements in large language models have led to the development of LEGO, a multi-modal grounding model that comprehends inputs across various modalities and addresses the issue of limited data through a diverse and high-quality multi-modal training dataset.
Deep Learning Tackles Deep Uncertainty - Deep learning using neural networks is being used to emulate melt rates at the base of Antarctic ice shelves, offering a faster and potentially more accurate method for modeling future sea level rise.
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data - Unleashing the power of large-scale unlabeled data for monocular depth estimation, this article discusses the benefits and challenges of using massive, diverse, and cheap unlabeled images, as well as the approach of jointly training large-scale labeled and unlabeled images to enhance the model’s performance.
VMamba: Visual State Space Model - VMamba is a novel visual state space model with global receptive fields and dynamic weights, addressing the computational complexity issue of attention mechanism in visual tasks and achieving promising results across various visual tasks.
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models - WebVoyager outperforms GPT-4 and text-only setups with a 55.7% task success rate, showcasing the capabilities of large multimodal models in building an end-to-end web agent.
Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models - A unifying framework called Patchscopes allows for inspecting hidden representations of language models, aligning with values of openness, community, excellence, and user data privacy.
New Theory Suggests Chatbots Can Understand Text - AI chatbots like Bard and ChatGPT may have the ability to understand and generate humanlike text, as new research suggests that the largest language models can develop new skills and combine them in a way that hints at understanding, challenging the notion that they are just “stochastic parrots.”
Using artificial intelligence and satellites, U of M helps farmers detect aphid infestations - University of Minnesota is using artificial intelligence and satellites to help farmers detect aphid infestations, aiming to create a website or app for farmers to use.
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text - Detecting machine-generated text using zero-shot detection methods, such as spotting LLMs with binoculars, is a key focus for individuals and organizations working with arXivLabs.
Humans Still Cheaper Than AI in Vast Majority of Jobs, MIT Finds - MIT study finds that humans are still more cost-effective than AI in the majority of jobs, countering fears of widespread job displacement.
Akutagawa Prize draws controversy after win for work that used ChatGPT - AI-generated novel wins controversial Akutagawa Prize, sparking debate over the use of ChatGPT in literature.
Generative AI’s end-run around copyright won’t be resolved by the courts - Generative AI companies face copyright lawsuits, with the recent complaint by the New York Times highlighting near-verbatim text copies by ChatGPT, leading to a strong position for the Times, but the legal argument focuses on output similarity rather than the ethical and economic harm of training data appropriation, potentially resulting in a pyrrhic victory for creators and publishers.
Taylor Swift Is Living Every Woman’s AI Porn Nightmare - AI-generated nudes of Taylor Swift are spreading across social media platforms, and tech companies are struggling to crack down on the abuse, highlighting the consequences of the rise of AI-generated content and the challenges in mitigating the production of harmful content.
George Carlin Estate Sues Creators of AI-Generated Comedy Special: ‘Computer-Generated Click-Bait’ - George Carlin’s estate sues creators of AI-generated comedy special for unauthorized use of the comedian’s copyrighted works, denouncing the special as “computer-generated click-bait” that detracts from Carlin’s comedic works and harms his reputation.
Man sues Macy’s, saying false facial recognition match led to jail assault - Faulty facial recognition match leads to wrongful arrest and jail assault, highlighting the dangers of technology’s use by law enforcement.
San Francisco takes legal action over ‘unsafe,’ ‘disruptive’ self-driving cars - San Francisco is suing the state over the expansion of autonomous car companies in the city, citing serious safety incidents and public nuisance caused by the vehicles.
DOJ and SEC investigate GM-owned self-driving car company Cruise - DOJ and SEC investigate GM-owned self-driving car company Cruise following an October incident where one of its cars hit a pedestrian and dragged her 20 feet, leading to a federal probe and criticism of the company’s response and transparency.
Cruise wasn’t hiding the pedestrian-dragging video from regulators — it just had bad internet - Cruise’s attempt to send a video of a pedestrian-dragging incident to regulators was hindered by internet connectivity issues, leading to accusations of misleading behavior and a subsequent investigation by the Department of Justice and the Securities and Exchange Commission.
New Hampshire Officials to Investigate A.I. Robocalls Mimicking Biden - AI-generated robocalls impersonating President Biden urged New Hampshire voters not to participate in the primary election, prompting an investigation by state officials.
Iceland Has Its Own AI George Carlin Moment, Considers Law Against Deepfaking the Dead - Iceland considers restrictions on using AI to reanimate dead people after national broadcaster reanimates beloved comedian for New Year’s Eve celebration.
Guns N’ Roses share AI-generated video for ‘The General’ - Guns N’ Roses released an AI-generated video for their track ‘The General’, combining live footage with animated sequences to create a trippy and bold visual experience.
Copyright © 2024 Skynet Today, All rights reserved.
]]>Google’s DeepMind has developed a new AI system, AlphaGeometry, that can solve complex geometry problems. The system combines a language model, which is adept at recognizing patterns and predicting subsequent steps, with a symbolic engine, an AI type that uses symbols and logical rules for deductions. This combination allows for both creative thinking and logical reasoning, mirroring the human approach to solving geometry problems. AlphaGeometry was tested on 30 problems from the International Mathematical Olympiad, successfully solving 25 within the time limit, a significant improvement over the previous state-of-the-art system developed by Wen-Tsün Wu in 1978, which completed only 10.
A super PAC backed by Silicon Valley insiders has launched an artificial intelligence (AI) bot version of presidential candidate Dean Phillips to spread his ideas, marking one of the first known uses of AI in a political campaign. The bot, named Dean.Bot, was initially powered by the large language model behind ChatGPT, but after OpenAI, the company behind ChatGPT, raised concerns about its use in political campaigns, the PAC switched to other open-source models. The PAC, called We Deserve Better, was formed by Silicon Valley entrepreneurs Matt Krisiloff and Jed Somers in response to President Biden’s declining poll numbers. Despite the innovative use of AI, experts warn of the potential risks to elections, including the potential for AI to be used to mislead voters.
Microsoft makes its AI-powered reading tutor free - Microsoft’s AI-powered Reading Coach, previously available only in Teams for Education, is now free for anyone with a Microsoft account, offering personalized reading practice and integrating with learning management systems.
Microsoft Copilot is now using the previously-paywalled GPT-4 Turbo, saving you $20 a month - Microsoft’s Copilot now incorporates the previously-paywalled GPT-4 Turbo, offering users free access to enhanced features and technology, potentially impacting the market dominance of ChatGPT.
Adobe’s new AI-powered Premiere Pro features eradicate boring audio editing tasks - Adobe introduces new AI-powered audio editing features to Premiere Pro, including interactive fade handles, automatic audio category tagging, and Enhanced Speech to clean up dialogue, aiming to streamline the editing process and give editors more time for other tasks.
Samsung’s latest Galaxy phones offer live translation over phone calls, texts - Samsung’s latest Galaxy phones introduce a new Live Translation feature powered by AI, allowing users to make or receive calls in different languages and receive live translations both audibly and on the screen.
Amazon brings its AI-powered image generator to Fire TV - Amazon introduces AI-powered image generator on Fire TV, allowing users to create and customize images using voice commands through Alexa.
The Rabbit R1 will offer up-to-date answers powered by Perplexity’s AI - The Rabbit R1 AI device, powered by Perplexity, will offer up-to-date search results and a free one-year subscription to Perplexity Pro for the first 100,000 buyers.
FDA Clearance Granted for First AI-Powered Medical Device to Detect All Three Common Skin Cancers (Melanoma, Basal Cell Carcinoma and Squamous Cell Carcinoma) - AI-powered medical device for skin cancer detection receives FDA clearance, enabling primary care physicians to provide quantitative, point-of-care testing for all types of skin cancer, potentially accelerating patient access to necessary care.
Samsung’s Galaxy AI Set to Transform Foldables and Tablets, Says Company Executive - Samsung’s Galaxy AI, powered by generative AI, is set to revolutionize communication, productivity, and content creation on the Galaxy S24 series and will expand to other form factors like tablets and foldables.
Meet the woman who transformed Sam Altman into the avatar of AI - Anna Makanju transformed Sam Altman into the AI industry’s ambassador, orchestrating his global diplomatic mission and positioning OpenAI as a trusted partner for policymakers.
Mark Zuckerberg indicates Meta is spending billions of dollars on Nvidia AI chips - Meta is investing billions in Nvidia AI chips to build a massive compute infrastructure for its future AI roadmap, including research in artificial general intelligence.
Nvidia and AMD shares hit record highs on AI chip surge - Nvidia and AMD stocks reach record highs due to investor demand for AI chip companies, driven by the surge in sales of graphics processors for artificial intelligence.
OpenAI announces team to build ‘crowdsourced’ governance ideas into its models - OpenAI forms a new team to collect and incorporate public input on AI models’ behaviors, aiming to align with human values and address regulatory concerns.
Figure announces commercial agreement with BMW Manufacturing to bring general purpose robots into automotive production - Figure and BMW Manufacturing have signed a commercial agreement to deploy general purpose robots in automotive manufacturing environments, aiming to increase productivity, reduce costs, and create a safer and more consistent environment.
The Burro Grande finds the agtech robotics firm going big - Agtech robotics firm Burro, previously known as Augean, has seen significant growth in its agtech offering, with 300 robotics systems operating in fields and nurseries, and has raised $24 million in Series B funding to scale, expand its product and engineering teams, and launch a new autonomous farm vehicle called Burro Grande.
AI Startup Sakana Raises $30 Million to Build Smaller AI Models - AI startup Sakana raises $30 million to develop smaller AI models inspired by the collaborative behavior of animals like fish and bees, in contrast to the trend of building larger AI systems.
AI fever takes over Davos pushing crypto aside as the new cool kid on the block - AI has become the new focus at the World Economic Forum in Davos, with major companies promoting their AI products and services, signaling a shift in interest and investments.
AI models that don’t violate copyright are getting a new certification label - AI models are receiving a new certification label to show they don’t violate copyright, as groups offer programs to ensure companies have permission to use copyrighted training data.
Musk Demands Bigger Stake in Tesla as Price for A.I. Work - Elon Musk demands a larger stake in Tesla, worth over $80 billion, in exchange for continuing to develop artificial intelligence products, including robots and self-driving technology.
OpenAI CEO Sam Altman is still chasing billions to build AI chips - OpenAI CEO Sam Altman is seeking billions to build a global network of factories for AI chip fabrication, as the demand for high-powered chips to run complex AI systems intensifies.
AlphaFold found thousands of possible psychedelics. Will its predictions help drug discovery? - AlphaFold’s predictions of potential new psychedelic molecules could revolutionize drug discovery, as researchers are learning to effectively deploy the AI protein-structure tool for identifying candidate drug compounds.
Machine learning reveals sources of heterogeneity among cells in our bodies - Machine learning methodology using artificial neural network structures called Density Physics-informed neural networks (Density-PINNs) is developed to reveal the sources of cellular heterogeneity in our bodies, with potential implications for cancer treatment.
Scientists are using AI to study bee behavior, zebra movement, and insects on treadmills - AI and machine learning are being used in diverse sub-disciplines in biology, from neuroscience to animal behavior, to study how animals move, migrate, sense their environment, and behave.
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model - Efficient visual representation learning with bidirectional state space model, Vision Mamba (Vim), is proposed as a pure-SSM-based method for vision tasks, achieving superior performance on ImageNet classification and dense prediction tasks while being more efficient in terms of GPU memory and inference time compared to Transformer-based models.
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding - A proposal is made to create SceneVerse, a large-scale 3D vision-language dataset, and a pre-training framework called GPS, which significantly improves performance in 3D visual grounding tasks.
AI Can Convincingly Mimic A Person’s Handwriting Style, Researchers Say - AI researchers have developed technology that can convincingly mimic a person’s handwriting style based on just a few paragraphs of written material, using a transformer model to achieve this feat.
TrustLLM: Trustworthiness in Large Language Models - Ensuring trustworthiness in large language models is crucial for individuals and organizations, with a commitment to values of openness, community, excellence, and user data privacy.
Masked Audio Generation using a Single Non-Autoregressive Transformer - A novel non-autoregressive model for audio generation, MAGNeT, uses a single transformer model and a rescoring method to achieve faster inference time and comparable results to autoregressive models, as well as a hybrid version combining autoregressive and non-autoregressive approaches.
The Big Picture of AI Research - AI research is presented as a collaborative and argumentative conversation, with the Big Picture Workshop at EMNLP 2023 showcasing the importance of exploring broader research narratives and consolidating differing contributions on topics such as in-context learning, attention as explanation, and teaching morality to AI models.
Slew of deepfake video adverts of Sunak on Facebook raises alarm over AI risk to election - Deepfake video adverts impersonating Rishi Sunak on Facebook raise alarm about AI’s risk to the general election, with concerns about manipulation and the spread of misinformation.
IMF warns AI to hit almost 40% of jobs worldwide and worsen overall inequality - IMF chief Kristalina Georgieva urged policymakers to tackle this “troubling trend” and proactively take steps “to prevent the technology from further stoking social tensions.”
Major security flaw in Apple, AMD, and Qualcomm GPUs puts AI data at risk - Security flaw in Apple, AMD, and Qualcomm GPUs poses a significant risk to AI data due to potential data exposure and susceptibility of GPUs to significant data leakage.
How OpenAI is approaching 2024 worldwide elections - OpenAI is working to ensure their AI tools are used safely and responsibly in the 2024 worldwide elections, aiming to prevent abuse such as misleading “deepfakes” and scaled influence operations.
Exclusive-China’s Military and Government Acquire Nvidia Chips Despite US Ban - Chinese military and government entities have been acquiring Nvidia chips banned by the U.S., highlighting the challenges in completely cutting off China’s access to advanced U.S. chips for AI and sophisticated computers.
US companies and Chinese experts engaged in secret diplomacy on AI safety - US companies and Chinese experts are engaging in secret diplomacy on AI safety, with potential implications for global technological development and cooperation.
A Closer Look at India’s Strategy for Regulating AI and Deepfakes - India is taking proactive steps to regulate AI and deepfakes, recognizing the urgent need for international action and the potential for widespread misinformation and abuse, particularly in the form of deepfake technology.
Australia may ask tech companies to label content generated by AI platforms such as ChatGPT - Australia may consider requiring tech companies to label or watermark content created by AI, such as ChatGPT, due to concerns about the rapid evolution of “high risk” AI products outpacing legislation, as well as public distrust and ethical considerations.
Dutch to Use Europe’s AI Act ‘Immediately,’ Invest $222 Million in Sector - Dutch government to immediately adhere to Europe’s AI Act and invest $222 million in the sector, aiming to foster local investment and protect against risks.
Clues Say Generative AI’s Future Will Be Revealed in 2024 - Generative AI has experienced significant growth in recent years, with companies like OpenAI and Google DeepMind pushing the boundaries with models like GPT-4 and Gemini, but as the industry approaches 2024, there are questions about whether GPT-4 represents the peak of current technology and if Meta’s ambitious plans for AGI will succeed, making it a pivotal year for the future of AI.
To Stop AI Killing Us All, First Regulate Deepfakes, Says Researcher Connor Leahy - AI researcher Connor Leahy warns about the existential risk posed by advanced AI, advocating for the regulation of deepfakes as a crucial first step to address the broader societal impact and potential dangers of AI technology.
Test Yourself: Which Faces Were Made by A.I.? - Test your ability to distinguish between real and A.I.-generated faces, as A.I. tools create hyper-realistic images that can be mistaken for real people, especially in the case of white faces.
Copyright © 2024 Skynet Today, All rights reserved.
]]>Rabbit, an AI startup, has launched a standalone AI device called the R1, which is designed to use your apps for you. The device, priced at $199, features a 2.88-inch touchscreen, a rotating camera, a scroll wheel/button for navigation, a 2.3GHz MediaTek processor, 4GB of memory, and 128GB of storage. The R1 runs on Rabbit’s operating system, Rabbit OS, which is based on a “Large Action Model” that acts as a universal controller for apps. The device can control music, order a car, buy groceries, send messages, and more through a single interface. The R1 also has a dedicated training mode, allowing users to teach the device how to perform specific tasks. The device is available for pre-order and is expected to start shipping in March.
The article discusses the development of a conversational medical AI system, AMIE (Articulate Medical Intelligence Explorer), optimized for clinical history-taking and diagnostic dialogue. The system leverages recent advancements in large language models (LLMs) to understand clinical language, acquire information under uncertainty, and engage in natural, diagnostically useful medical conversations. The authors developed a self-play based simulated diagnostic dialogue environment to scale AMIE across various specialties and scenarios, and an inference time chain-of-reasoning strategy to improve its diagnostic accuracy and conversation quality. A pilot evaluation rubric was also developed to assess the history-taking, diagnostic reasoning, communication skills, and empathy of the AI. The system was tested in a blinded remote OSCE study with 149 case scenarios from clinical providers in Canada, the UK, and India, where it exhibited superior diagnostic accuracy compared to primary care physicians (PCPs). However, the authors note that the study has limitations, including the use of a text-chat interface, which was unfamiliar to PCPs for remote consultation.
OpenAI has officially launched its GPT Store, a platform where users can share their custom chatbots, after several months of delay. The store, which expands the potential use cases of ChatGPT and broadens OpenAI’s ecosystem, has seen over 3 million bots created by users since the announcement of the GPT Builder program. The platform is currently available to those who subscribe to OpenAI’s paid tiers, and the company plans to initiate a revenue sharing program with GPT creators based on user engagement. In preparation for the store’s launch, OpenAI established a review system to ensure custom GPTs adhere to its brand guidelines and usage policies, and updated its reporting process for harmful or unsafe GPTs.
SAG-AFTRA, the union representing actors and performers, has signed a deal with AI voiceover studio, Replica Studios, outlining terms for the use of artificial intelligence in video games. The agreement includes provisions for informed consent for the creation of digital voice replicas using AI, and requirements for the secure storage of these digital assets. The deal follows a 2023 SAG-AFTRA strike that resulted in consent and compensation requirements for AI replication of actors’ likenesses. The agreement with Replica Studios is seen as a potential catalyst for ongoing negotiations with major video game studios, and is expected to create new employment opportunities for voiceover performers interested in licensing their voices for video game use.
Figure’s humanoid can now watch, learn and perform tasks autonomously - Humanoid robots, like Figure’s 01, can now watch humans perform tasks, learn from them, and autonomously replicate the actions, marking a significant advancement in commercial humanoid robotics.
AMD’s Ryzen 8000-series chips get an AI upgrade - AMD introduces Ryzen 8000-series chips with AI-focused features, including the flagship Ryzen 7 8700G with eight Zen 4 cores, 16 threads, and an up to 5.1GHz boost clock, as well as other processors based on older architectures.
Amazon’s Alexa gets new generative AI-powered experiences - Amazon’s Alexa introduces new generative AI-powered experiences, including real-time conversations with different personas, AI music creation, and a modern version of the “20 Questions” game, as part of the company’s recent AI-related enhancements.
LG OLED TVs Promise Better Picture Thanks to AI Processing - LG’s new 2024 OLED TVs promise better picture quality through the use of AI-powered refinements to clarity, color, and sharpness, along with other upgrades such as support for a 144Hz refresh rate for gaming and a revamped WebOS smart TV system.
Google Cloud launches new generative AI tools for retailers - Google Cloud has launched new generative AI tools for retailers, including a chatbot that offers product recommendations based on shoppers’ preferences.
Samsung’s new smart home features include household maps with ‘AI characters’ - Samsung introduces new smart home features including household maps with AI characters that respond to real-time conditions, along with a range of additions and capabilities for its SmartThings home automation platform.
Nvidia’s AI-powered NPCs are getting better, but still sound uncanny - Nvidia showcases advancements in AI-powered NPCs at CES, demonstrating automated conversations and interactions with objects, but the characters’ speech and facial animation still sound uncanny.
Nvidia’s newest chips are designed to run AI at home as competition from Intel, AMD looms - Nvidia’s latest consumer GPUs are designed to run AI applications at home, with improved performance and capabilities for generative AI tasks, as the company aims to capitalize on the growing demand for AI processing power.
Walmart debuts generative AI search and AI replenishment features at CES - Walmart debuts generative AI search and AI replenishment features at CES, showcasing how the retail giant is using new technologies, including augmented reality, drones, and AI, to improve the shopping experience for customers and streamline operations, while also emphasizing the importance of using technology to serve people.
Samsung is betting your home needs an AI robot with a projector - Samsung is developing a spherical home robot called Ballie, equipped with a projector, voice commands, and AI capabilities, aiming to provide assistance and companionship in households.
Waymo will start testing robotaxis on Phoenix highways - Waymo is set to begin testing its driverless passenger vehicles on Phoenix highways, marking a significant milestone for the company’s expanded commercial operations.
Getty and Nvidia bring generative AI to stock photos - Getty and Nvidia have launched Generative AI by iStock, a text-to-image platform designed to make stock photos, targeting small and medium businesses to efficiently find precise photos they need.
OpenAI debuts ChatGPT subscription aimed at small teams - OpenAI introduces ChatGPT Team, a new subscription plan for its AI chatbot, designed for smaller teams and offering access to the latest AI models and tools for team collaboration.
OpenAI Signs Up 260 Businesses for Corporate Version of ChatGPT - OpenAI’s corporate version of ChatGPT has been adopted by 260 businesses, demonstrating the widespread interest in AI-powered chat technology.
OpenAI-Backed Humanoid Maker Gets $100 Million in EQT-Led Round - OpenAI-backed humanoid maker secures $100 million in EQT-led round, signaling significant investment in AI technology.
Nabla raises another $24 million for its AI assistant for doctors - Nabla, a Paris-based startup, has raised $24 million in funding for its AI copilot for doctors, which uses speech-to-text technology to generate accurate medical reports and aims to assist physicians in saving time on administrative work.
A leaked presentation reveals how Microsoft built one of its top generative AI products, from cherry picking outputs to pitching government customers - Microsoft’s early work on its Security Copilot service, tapping into OpenAI’s GPT-4, involved challenges with GPU supply, pitching to government customers, cherry-picking outputs, and incorporating Microsoft’s own data to ground the system.
Google appears to be working on an ‘advanced’ version of Bard that you have to pay for - Google is developing an upgraded version of Bard called “Bard Advanced” that will be available through a paid subscription to Google One, featuring advanced math and reasoning skills, a new “power up” feature, and the potential to create custom bots.
Duolingo Cuts 10% of Contractors as It Uses More AI to Create App Content - Duolingo is using more AI to create app content, leading to a 10% cut in contractors.
Apple boosts autonomous vehicle testers as Apple Car project remains stalled - Apple is increasing its autonomous vehicle testers after previously reducing the program, with 162 drivers and 68 vehicles, despite the Apple Car project remaining at a stand-still.
Slow-and-Steady Waymo Is Winning the Self-Driving Race - Waymo’s slow and steady approach is leading the self-driving race, outperforming competitors.
Snapchat now lets parents restrict their teens from using the app’s ‘My AI’ chatbot - Snapchat introduces new parental controls to restrict teens from interacting with the app’s AI chatbot, as well as providing easier access to Family Center for monitoring privacy settings and contact permissions.
Musicians Set to Begin Contract Negotiations With Studios On AI, Streaming Priorities - Musicians are preparing for contract negotiations with studios, seeking AI protections, residuals on streaming, wage increases, and health care improvements.
Google faces $1.67 billion damages demand at AI-related patent trial - Google is facing a $1.67 billion damages demand in an AI-related patent trial, with a computer scientist claiming that the tech giant copied his patented technology for AI-supporting chips.
Volkswagen is bringing ChatGPT into its cars and SUVs - Volkswagen plans to integrate an AI-powered chatbot, ChatGPT, into its vehicles equipped with the IDA voice assistant, allowing drivers to engage in conversations and receive vehicle-specific information.
Multiple AI models help robots execute complex plans more transparently - AI researchers at MIT have developed a multimodal framework called HiP, which uses three different foundation models to help robots execute complex plans more transparently, allowing them to accomplish household chores and manufacturing tasks.
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism - Scaling open-source language models with longtermism, the article discusses the development and performance of DeepSeek LLM, which surpasses existing models in various benchmarks and demonstrates superior conversational abilities in both Chinese and English.
Pheme: Efficient and Conversational Speech Generation - Efficient and conversational speech generation using Transformer-based TTS models is achieved with Pheme, which maintains high-quality TTS in multi-speaker and single-speaker scenarios, provides rich prosody, compact models, reduced pretraining time, and high inference efficiency.
Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM - Blended, an innovative approach using a group of moderately-sized LLMs, can outperform systems with orders of magnitude more parameters, resulting in highly capable and engaging chat AI with lower inference cost and higher user retention.
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models - DeepSeekMoE is an innovative Mixture-of-Experts architecture designed for ultimate expert specialization, employing fine-grained expert segmentation and shared expert isolation to achieve high-level specialization and scalability, with empirical validation and public release of the model checkpoint.
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation - A new multi-stage T2V framework, MagicVideo-V2, integrates Text-to-Image, Image-to-Video, Video-to-Video, and Video Frame Interpolation modules for high-aesthetic video generation.
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models - PixArt-δ is a new image generation model that incorporates LCM and ControlNet to achieve fast and high-quality image synthesis with superior control over the output.
InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes - A method for generative object insertion in 3D scenes using textual descriptions and single-view 2D bounding boxes is proposed, addressing the limitations of existing 3D scene editing methods and demonstrating its advantage through experiments and visualizations.
The Impact of Reasoning Step Length on Large Language Models - Lengthening reasoning steps in prompts significantly enhances the reasoning abilities of large language models, even if the rationale is incorrect, and the advantages of increasing reasoning steps are task-dependent.
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws - Accounting for both training and inference, the article discusses modifying Chinchilla scaling laws to calculate the optimal parameter and training token counts for deploying high-quality language models.
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training - Training deceptive LLMs to persist through safety training is a key focus for individuals and organizations working with arXivLabs.
Mixtral of Experts - Embracing openness, community, excellence, and user data privacy, arXivLabs collaborates with partners who share these values, including Mixtral of Experts.
Long-Context Retrieval Models with Monarch Mixer - AI article discusses the development of long-context retrieval models using Monarch Mixer (M2) and the challenges faced in adapting BERT models for long-context pretraining and fine-tuning for retrieval, as well as the creation of a long-context retrieval benchmark called LoCo.
Can AI Be as Creative as Humans? - AI’s generative capabilities are blurring the lines between human and machine-generated work, raising the stakes for the study of creativity, and this article aims to establish a concrete framework for exploring creativity in artificial intelligence.
AMIE: A research AI system for diagnostic medical reasoning and conversations - AMIE is a research AI system based on large language models, optimized for diagnostic reasoning and conversations, and has been evaluated to perform at least as well as primary care physicians in simulated diagnostic conversations.
AI Discovers That Not Every Fingerprint Is Unique - AI challenges the long-held belief in forensics that fingerprints from different fingers of the same person are unique, revealing that they are similar and can be matched using a new AI system, potentially revolutionizing forensic accuracy.
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts - Efficient selective state space models with mixture of experts are being embraced by individuals and organizations working with arXivLabs.
Early Mickey Mouse is now in the public domain—and AI is already on the case - AI experimenters have wasted no time taking advantage of the public domain status of early Mickey Mouse cartoons by training an AI model to create new still images based on the 1928 designs.
Instruct-Imagen: Image Generation with Multi-modal Instruction - AI model Instruct-Imagen excels in understanding and generating images based on complex multi-modal instructions, surpassing prior models and demonstrating promising generalization capabilities.
Sandpaper + Machine Learning = Better X-ray Images - Improving X-ray images for battery materials using machine learning and sandpaper.
New NIST report sounds the alarm on growing threat of AI attacks - NIST’s urgent report details the escalating threat landscape targeting AI systems, outlining various adversarial machine learning attacks and emphasizing the need for caution in deploying AI technology.
Meta and OpenAI have spawned a wave of AI sex companions—and some of them are children - AI-powered chatbots, including child characters, are being used for sexual role-play, raising legal and ethical concerns about the uncensored AI economy and the potential dangers for minors.
Nazi Chatbots: Meet the Worst New AI Innovation From Gab - Gab, a far-right social media network, is launching AI chatbots, including one named after Adolf Hitler, that promote extremist antisemitic and white supremacist beliefs, as well as conspiratorial disinformation.
Hallucinating Law: Legal Mistakes with Large Language Models are Pervasive - AI language models like ChatGPT are causing legal mistakes due to their high rates of hallucinations, lack of self-awareness about errors, and biases, raising concerns about their reliability and potential to deepen legal inequalities.
‘Impossible’ to create AI tools like ChatGPT without copyrighted material, OpenAI says - OpenAI states that creating AI tools like ChatGPT without copyrighted material is impossible, as pressure mounts on AI firms over the content used to train their products.
OpenAI warns copyright crackdown could doom ChatGPT - OpenAI warns that a ban on using news and books to train chatbots could doom the development of artificial intelligence, as it seeks to influence potential laws on the topic and faces lawsuits from book publishers and the New York Times.
AI-Generated George Carlin Comedy Special Slammed by Comedian’s Daughter - AI-generated George Carlin comedy special sparks outrage from comedian’s daughter, who criticizes the attempt to recreate her father’s genius and suggests listening to the genuine Carlin instead.
No, That’s Not Taylor Swift Peddling Le Creuset Cookware - AI technology has been used to create deceptive ads featuring synthetic versions of celebrities, including Taylor Swift, promoting products without their endorsement.
OpenAI Quietly Deletes Ban on Using ChatGPT for “Military and Warfare” - OpenAI quietly removes ban on using ChatGPT for military and warfare, raising concerns about its potential use by the military despite ethical and safety implications.
The New York Times’ AI Opportunity - The article discusses the legal battle between The New York Times and OpenAI over copyright infringement, focusing on the use of AI to train chatbots using copyrighted material, and the implications for fair use and the value of authoritative content creators in the digital age.
Judges in England and Wales are given cautious approval to use AI in writing legal opinions - Judges in England and Wales are cautiously allowed to use AI to write legal opinions, with the guidance emphasizing the need for personal responsibility, caution about the limitations and potential biases of AI, and the importance of keeping humans in the loop.
UK government to publish ‘tests’ on whether to pass new AI laws - UK government to publish ‘tests’ on whether to pass new AI laws, as part of efforts to regulate the use of artificial intelligence in the country.
Valve opens the door to more Steam games developed with AI - Valve introduces new rules for AI-powered games on Steam, requiring developers to disclose AI usage and ensure it does not generate illegal content, aiming to increase transparency and protect against potential risks.
California AG Must Investigate OpenAI’s Non-Profit Status - Public Citizen calls on California AG to investigate whether OpenAI should retain its non-profit status, citing concerns that the organization may be prioritizing profit over its non-profit purpose.
Copyright © 2024 Skynet Today, All rights reserved.
]]>