Diverse OpenAI - sjukt imponerende AI

morbid · Mandag kl 17:38

En metode, og som vi har brukt med godt hell er Retrieval-Augmented Generation(RAG) sammen med språkmodellen da kan du kontekstualisere og bruke egne data som kilde til output.

Asbjørn · Mandag kl 18:59

morbid skrev:
En metode, og som vi har brukt med godt hell er Retrieval-Augmented Generation(RAG) sammen med språkmodellen da kan du kontekstualisere og bruke egne data som kilde til output.

Ja, som sagt, prøvd her også, men ikke veldig imponert over resultatet.

Fra din link:

The use of RAG does not completely eliminate the general challenges faced by LLMs, including hallucination.

morbid · Mandag kl 19:24

Niks, men reduserer det kraftig, f.eks så er det en stor fordel for språkmodellen å hente god dokumentasjon på det faktiske systemet brukeren vil vite noe om enn å ikke gjøre det.

Asbjørn · Mandag kl 19:38

Jo, ser den, spesielt når det er firma-intern dokumentasjon som forhåpentligvis ikke hadde lekket ut for å bli brukt i generelle treningsdata. Da går man fra 100 % fabulering til kanskje 10 %, men kan fortsatt ikke vite hva som stemmer eller ikke i svarene uten å sjekke selv.

Hallucination (artificial intelligence) - Wikipedia

en.m.wikipedia.org

For example, a chatbot powered by large language models(LLMs), like ChatGPT, may embed plausible-sounding random falsehoods within its generated content. Researchers have recognized this issue, and by 2023, analysts estimated that chatbots hallucinate as much as 27% of the time,[8] with factual errors present in 46% of generated texts.[9] Detecting and mitigating these hallucinations pose significant challenges for practical deployment and reliability of LLMs in real-world scenarios.[10][8][9]

The hallucination phenomenon is still not completely understood. Researchers have also proposed that hallucinations are inevitable and are an innate limitation of large language models.[73] Therefore, there is still ongoing research to try to mitigate its occurrence.[74] Particularly, it was shown that language models not only hallucinate but also amplify hallucinations, even for those which were designed to alleviate this issue.[75]

Ji et al.[76] divide common mitigation method into two categories: data-related methods and modeling and inference methods. Data-related methods include building a faithful dataset, cleaning data automatically and information augmentation by augmenting the inputs with external information. Model and inference methods include changes in the architecture (either modifying the encoder, attention or the decoder in various ways), changes in the training process, such as using reinforcement learning, along with post-processing methods that can correct hallucinations in the output.

Researchers have proposed a variety of mitigation measures, including getting different chatbots to debate one another until they reach consensus on an answer.[77] Another approach proposes to actively validate the correctness corresponding to the low-confidence generation of the model using web search results. They have shown that a generated sentence is hallucinated more often when the model has already hallucinated in its previously generated sentences for the input, and they are instructing the model to create a validation question checking the correctness of the information about the selected concept using Bingsearch API.[78] An extra layer of logic-based rules was proposed for the web search mitigation method, by utilizing different ranks of web pages as a knowledge base, which differ in hierarchy.[79] When there are no external data sources available to validate LLM-generated responses (or the responses are already based on external data as in RAG), model uncertainty estimation techniques from machine learning may be applied to detect hallucinations.[80]

Link videre:

https://arxiv.org/pdf/2401.11817

Jeg ser det som et fundamentalt problem for LLM’er i alle noenlunde seriøse anvendelsesområder. Det er ikke bare en bug som vil bli fikset i neste release, men en grunnleggende egenskap ved hva LLM’er er og gjør.

PederP · Torsdag kl 13:35

What DeepSeek Means for Open-Source AI

Its new open reasoning model cuts costs drastically on AI reasoning

spectrum.ieee.org

Diverse OpenAI - sjukt imponerende AI

Diskusjonstråd Se tråd i gallerivisning

morbid

Hi-Fi freak

Asbjørn

Rubinmedlem

morbid

Hi-Fi freak

Asbjørn

Rubinmedlem

Hallucination (artificial intelligence) - Wikipedia

PederP

Hi-Fi freak

What DeepSeek Means for Open-Source AI

Diskusjonstråd Se tråd i gallerivisning