stp2y - CybAI news

23 May

Asymmetric Certified Robustness via Feature-Convex Neural Networks

stp2y0 CommentsAIasymmetric robustness, certified robustness, input-convex neural networks

Asymmetric Certified Robustness via Feature-Convex Neural Networks TLDR: We propose the asymmetric certified robustness problem, which requires certified robustness for only one class and reflects real-world adversarial scenarios. This focused setting allows us to introduce feature-convex classifiers, which produce closed-form and deterministic certified radii on the order of milliseconds. Figure 1. Illustration of feature-convex classifiers and their certification for sensitive-class inputs. This architecture composes a Lipschitz-continuous feature map $varphi$ with a learned convex function $g$. Since $g$ is convex, it is globally underapproximated by its tangent plane at $varphi(x)$, yielding certified norm balls in the feature space. Lipschitzness of $varphi$…

23 May

Ethical, trust, and skill barriers slow generative AI progress in EMEA

stp2y0 CommentsChat-GPT

76% of consumers in EMEA think AI will significantly impact the next five years, yet 47% question the value that AI will bring and 41% are worried about its applications. This is according to research from enterprise analytics AI firm Alteryx. Since the release of ChatGPT by OpenAI in November 2022, there has been significant buzz about the transformative potential of generative AI, with many considering it one of the most revolutionary technologies of our time. With a significant 79% of organisations reporting that generative AI contributes positively to business, it is evident that a gap needs to be addressed to demonstrate AI’s value to consumers both in…

23 May

Pocket-Sized AI Models Could Unlock a New Era of Computing

stp2y0 CommentsGenAIapple, artificial intelligence, deep learning, fast forward, google, machine learning, microsoft

When ChatGPT was released in November 2023, it could only be accessed through the cloud because the model behind it was downright enormous.Today I am running a similarly capable AI program on a Macbook Air, and it isn’t even warm. The shrinkage shows how rapidly researchers are refining AI models to make them leaner and more efficient. It also shows how going to ever larger scales isn’t the only way to make machines significantly smarter.The model now infusing my laptop with ChatGPT-like wit and wisdom is called Phi-3-mini. It’s part of a family of smaller AI models recently released by…

23 May

Most US TikTok Creators Don’t Think a Ban Will Happen

stp2y0 CommentsRAG modelsadvertising, algorithms, china, instagram, meta, privacy, social media, tiktok

A majority of US TikTok creators don’t believe the platform will be banned within a year, and most haven’t seen brands they work for shift their marketing budgets away from the app, according to a new survey of people who earn money from posting content on TikTok shared exclusively with WIRED.The findings suggest that TikTok’s influencer economy largely isn’t experiencing existential dread after Congress passed a law last month that put the future of the app’s US operations in jeopardy. The bill demands that TikTok separate from its Chinese parent company within a year or face a nationwide ban; TikTok…

23 May

Instruction Tuning OPT-125M

stp2y0 CommentsViral News

Large language models are pretrained on terabytes of language datasets. However, the pretraining dataset and strategy teach the model to generate the next token or word. In a real world sense, this is not much useful. Because in the end, we want to accomplish a task using the LLM, either through chat or instruction. We can do so through fine-tuning an LLM. Generally, we call this instruction tuning of the language model. To this end, in this article, we will use the OPT-125M model for instruction tuning. Figure 1. Output sample after instruction tuning OPT-125M on the Open Assistant Guanaco…

23 May

Data Moats in Generative AI

stp2y0 CommentsNews

The deep learning wave of the early 2010s led to a surge of data-hungry products. These products needed so much data that gathering it requires significant investment. So, the business community started honing the idea of data as a strategic asset and a business moat. As the Economist put it in a 2017 issue, “The world’s most valuable resource is no longer oil, but data.” This essay discusses data moats in today’s context of generative AI, which is driven by models that are exponentially more data-hungry. But first, what is a data moat? what is even an “AI product”?A data…

23 May

Accelerate Mixtral 8x7B pre-training with expert parallelism on Amazon SageMaker | Amazon Web Services

stp2y0 CommentsAI

Mixture of Experts (MoE) architectures for large language models (LLMs) have recently gained popularity due to their ability to increase model capacity and computational efficiency compared to fully dense models. By utilizing sparse expert subnetworks that process different subsets of tokens, MoE models can effectively increase the number of parameters while requiring less computation per token during training and inference. This enables more cost-effective training of larger models within fixed compute budgets compared to dense architectures. Despite their computational benefits, training and fine-tuning large MoE models efficiently presents some challenges. MoE models can struggle with load balancing if the tokens…

23 May

Study Finds That 52 Percent of ChatGPT Answers to Programming Questions Are Wrong

stp2y0 CommentsChat-GPT

Ah yes. And yet...Not So SmartIn recent years, computer programmers have flocked to chatbots like OpenAI's ChatGPT to help them code, dealing a blow to places like Stack Overflow, which had to lay off nearly 30 percent of its staff last year.The only problem? A team of researchers from Purdue University presented research this month at the Computer-Human Interaction conference that shows that 52 percent of programming answers generated by ChatGPT are incorrect.That's a staggeringly large proportion for a program that people are relying on to be accurate and precise, underlining what other end users like writers and teachers are experiencing:…

23 May

Least-Squares Concept Erasure with Oracle Concept Labels

stp2y0 CommentsGenAI

This post assumes some familiarity with the idea of concept erasure and our LEACE concept erasure method. We encourage the reader to consult our arXiv paper for background. For a PyTorch implementation of this method, see the OracleFitter class in our GitHub repository. WARNING: Because this erasure transformation depends on the ground truth concept label, it can increase the nonlinearly-extractable information about the target concept inside a representation, even though it eliminates the linearly available information. For this reason, optimizing deep neural networks on top of O-LEACE'd representations is not recommended; for those use cases we recommend vanilla LEACE. In…

23 May

Sigma Secures $200M Round to Advance Its BI and Analytics Solutions

stp2y0 CommentsViral News

(NicoElNino/Shutterstock) Sigma Computing, a cloud-based analytics solutions provider, has raised $200 million in Series D funding to further advance its efforts in broadening BI use within organizations by enabling users to query and analyze data without writing code. The latest rounding of funding takes the vendor’s total funding to $581.3 million with a valuation estimated to be around $1.5 billion, a staggering rise of 60% since the last funding round in 2021. The steep rise in valuation is partially a result of rising demand for greater productivity and monetization in the era of cloud data transition. Spark Capital and Avenir…