acec@lemmy.worldEnglish · 11 months agowhy reddit?plus-squaremessage-squaremessage-square2fedilinkarrow-up12arrow-down10
arrow-up12arrow-down1message-squarewhy reddit?plus-squareacec@lemmy.worldEnglish · 11 months agomessage-square2fedilink
noneabove1182@sh.itjust.worksMEnglish · 11 months agoPhind V7 subjectively performing at GPT4 levels for codingplus-squarenews.ycombinator.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkPhind V7 subjectively performing at GPT4 levels for codingplus-squarenews.ycombinator.comnoneabove1182@sh.itjust.worksMEnglish · 11 months agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 11 months agoMin P sampler (an alternative to Top K/Top P) has been merged into llama.cppplus-squaregithub.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkMin P sampler (an alternative to Top K/Top P) has been merged into llama.cppplus-squaregithub.comnoneabove1182@sh.itjust.worksMEnglish · 11 months agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 11 months agoHUGE dataset released for open source useplus-squaretogether.aiexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkHUGE dataset released for open source useplus-squaretogether.ainoneabove1182@sh.itjust.worksMEnglish · 11 months agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · edit-211 months agoI've started uploading quants of exllama v2 models, taking requestsplus-squarehuggingface.coexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkI've started uploading quants of exllama v2 models, taking requestsplus-squarehuggingface.cononeabove1182@sh.itjust.worksMEnglish · edit-211 months agomessage-square0fedilink
rufus@discuss.tchncs.deEnglish · edit-211 months agoNearly 10% of people ask AI chatbots for explicit content. Will it lead LLMs astray? [Article from October 3]plus-squarewww.zdnet.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkNearly 10% of people ask AI chatbots for explicit content. Will it lead LLMs astray? [Article from October 3]plus-squarewww.zdnet.comrufus@discuss.tchncs.deEnglish · edit-211 months agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 11 months agoText Generation Web-UI has been updated to CUDA 12.1, and with it new docker images are neededplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareText Generation Web-UI has been updated to CUDA 12.1, and with it new docker images are neededplus-squarenoneabove1182@sh.itjust.worksMEnglish · 11 months agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 11 months agoSingle Digit tokenization improves LLM math abilities by up to 70xplus-squaretwitter.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkSingle Digit tokenization improves LLM math abilities by up to 70xplus-squaretwitter.comnoneabove1182@sh.itjust.worksMEnglish · 11 months agomessage-square0fedilink
SnokenKeekaGuard@lemmy.dbzer0.comEnglish · 1 year agoMusical notationplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareMusical notationplus-squareSnokenKeekaGuard@lemmy.dbzer0.comEnglish · 1 year agomessage-square0fedilink
ylai@lemmy.mlEnglish · 1 year agoAre Local LLMs Useful in Incident Response? - SANS Internet Storm Centerplus-squareisc.sans.eduexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkAre Local LLMs Useful in Incident Response? - SANS Internet Storm Centerplus-squareisc.sans.eduylai@lemmy.mlEnglish · 1 year agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · edit-21 year agoDolphin 2.0 based on mistral-7b released by Eric Hartfordplus-squarehuggingface.coexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkDolphin 2.0 based on mistral-7b released by Eric Hartfordplus-squarehuggingface.cononeabove1182@sh.itjust.worksMEnglish · edit-21 year agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 1 year agoBeginner questions threadplus-squaremessage-squaremessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1message-squareBeginner questions threadplus-squarenoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
rufus@discuss.tchncs.deEnglish · edit-211 months agoMistral 7B modelplus-squaremistral.aiexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkMistral 7B modelplus-squaremistral.airufus@discuss.tchncs.deEnglish · edit-211 months agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 1 year agoMicrosoft's latest LLM agent: autogenplus-squaremicrosoft.github.ioexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkMicrosoft's latest LLM agent: autogenplus-squaremicrosoft.github.iononeabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 1 year agoQA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Modelsplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkQA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Modelsplus-squarearxiv.orgnoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 1 year agoEffective Long-Context Scaling of Foundation Models | Research - AI at Metaplus-squareai.meta.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkEffective Long-Context Scaling of Foundation Models | Research - AI at Metaplus-squareai.meta.comnoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 1 year agoJeremy Howard: A Hackers' Guide to Language Modelsplus-squareyoutu.beexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkJeremy Howard: A Hackers' Guide to Language Modelsplus-squareyoutu.benoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 1 year agoAmazon investing in Anthropic - Expanding access to safer AI with Amazonplus-squarewww.anthropic.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkAmazon investing in Anthropic - Expanding access to safer AI with Amazonplus-squarewww.anthropic.comnoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 1 year agoVery interesting thread about reversal knowledgeplus-squaretwitter.comexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkVery interesting thread about reversal knowledgeplus-squaretwitter.comnoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
noneabove1182@sh.itjust.worksMEnglish · 1 year agoDraft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decodingplus-squarearxiv.orgexternal-linkmessage-square0fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkDraft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decodingplus-squarearxiv.orgnoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink