noneabove1182@sh.itjust.worksMEnglish · 1 year agoBeginner questions threadplus-squarepinmessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareBeginner questions threadplus-squarepinnoneabove1182@sh.itjust.worksMEnglish · 1 year agomessage-square0fedilink
llama@lemmy.dbzer0.comEnglish · edit-210 days ago How to run LLaMA (and other LLMs) on Android.plus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-square How to run LLaMA (and other LLMs) on Android.plus-squarellama@lemmy.dbzer0.comEnglish · edit-210 days agomessage-square0fedilink
OmegaLemmy@discuss.onlineEnglish · 13 days agoWhat is a good model that runs on 6GB Vram?plus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareWhat is a good model that runs on 6GB Vram?plus-squareOmegaLemmy@discuss.onlineEnglish · 13 days agomessage-square0fedilink
artificialfish@programming.devEnglish · 14 days agoHas anyone applied tree of thought prompting to r1 yet?plus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareHas anyone applied tree of thought prompting to r1 yet?plus-squareartificialfish@programming.devEnglish · 14 days agomessage-square0fedilink
ikt@aussie.zoneEnglish · 14 days agoMistral Small 3 (24B) releasedplus-squaremistral.aiexternal-linkmessage-square0fedilinkarrow-up11
arrow-up11external-linkMistral Small 3 (24B) releasedplus-squaremistral.aiikt@aussie.zoneEnglish · 14 days agomessage-square0fedilink
ikt@aussie.zoneEnglish · 16 days agoDid DeepSeek R1 just pop nvidias bubble?plus-squarewww.youtube.comexternal-linkmessage-square0fedilinkarrow-up11
arrow-up11external-linkDid DeepSeek R1 just pop nvidias bubble?plus-squarewww.youtube.comikt@aussie.zoneEnglish · 16 days agomessage-square0fedilink
SmokeyDope@lemmy.worldEnglish · edit-218 days agoWhy llms are suprisingly good at math, and what it means to process language.plus-squarelemmy.worldimagemessage-square0fedilinkarrow-up11
arrow-up11imageWhy llms are suprisingly good at math, and what it means to process language.plus-squarelemmy.worldSmokeyDope@lemmy.worldEnglish · edit-218 days agomessage-square0fedilink
SmokeyDope@lemmy.worldEnglish · edit-220 days agoThoughts on new deepseek R1 distill modelsplus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareThoughts on new deepseek R1 distill modelsplus-squareSmokeyDope@lemmy.worldEnglish · edit-220 days agomessage-square0fedilink
brokenlcd@feddit.itEnglish · 1 month agounsure on how to quantize modelplus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareunsure on how to quantize modelplus-squarebrokenlcd@feddit.itEnglish · 1 month agomessage-square0fedilink
🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕕𝕚𝕝𝕖@lemm.eeEnglish · 1 month agoHow much gpu do i need to run a 90b modelplus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareHow much gpu do i need to run a 90b modelplus-square🇦🇺𝕄𝕦𝕟𝕥𝕖𝕕𝕔𝕣𝕠𝕔𝕕𝕚𝕝𝕖@lemm.eeEnglish · 1 month agomessage-square0fedilink
SmokeyDope@lemmy.worldEnglish · 1 month agoNvidia Digits AI Supercomputer just announcedplus-squarelemmy.worldimagemessage-square0fedilinkarrow-up11
arrow-up11imageNvidia Digits AI Supercomputer just announcedplus-squarelemmy.worldSmokeyDope@lemmy.worldEnglish · 1 month agomessage-square0fedilink
Halo@lemmy.worldEnglish · 1 month agoGo toolchain error - Does anyone know what's going on here? lemmy.worldimagemessage-square0fedilinkarrow-up11
arrow-up11imageGo toolchain error - Does anyone know what's going on here? lemmy.worldHalo@lemmy.worldEnglish · 1 month agomessage-square0fedilink
hendrik@palaver.p3x.deEnglish · edit-22 months ago(New) papers by Meta: Large Concept Models and BLTplus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-square(New) papers by Meta: Large Concept Models and BLTplus-squarehendrik@palaver.p3x.deEnglish · edit-22 months agomessage-square0fedilink
BB84@mander.xyzEnglish · edit-22 months agoNew open-weight 🐋 DeepSeek V3. 685B MoE. Beats Claude 3.5 Sonnet on Aider coding benchmarkplus-squarehuggingface.coexternal-linkmessage-square0fedilinkarrow-up11
arrow-up11external-linkNew open-weight 🐋 DeepSeek V3. 685B MoE. Beats Claude 3.5 Sonnet on Aider coding benchmarkplus-squarehuggingface.coBB84@mander.xyzEnglish · edit-22 months agomessage-square0fedilink
hok@lemmy.dbzer0.comEnglish · edit-22 months agoCan you fine-tune on localized steering of an LLM?plus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareCan you fine-tune on localized steering of an LLM?plus-squarehok@lemmy.dbzer0.comEnglish · edit-22 months agomessage-square0fedilink
sith@lemmy.zipEnglish · 2 months agoQuestions about HW for local LLM.plus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareQuestions about HW for local LLM.plus-squaresith@lemmy.zipEnglish · 2 months agomessage-square0fedilink
HumanPerson@sh.itjust.worksEnglish · edit-22 months agoFixed itplus-squaresh.itjust.worksimagemessage-square0fedilinkarrow-up11
arrow-up11imageFixed itplus-squaresh.itjust.worksHumanPerson@sh.itjust.worksEnglish · edit-22 months agomessage-square0fedilink
hok@lemmy.dbzer0.comEnglish · 2 months agoLlama 3.3 70b - End of open-weight pretrained models from Meta or just a better Llama 3.1 405b finetune?plus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareLlama 3.3 70b - End of open-weight pretrained models from Meta or just a better Llama 3.1 405b finetune?plus-squarehok@lemmy.dbzer0.comEnglish · 2 months agomessage-square0fedilink
projectmoon@lemm.eeEnglish · edit-22 months agoOpenWebUI OpenStreetMap Tool 2.1.0plus-squareopenwebui.comexternal-linkmessage-square0fedilinkarrow-up11
arrow-up11external-linkOpenWebUI OpenStreetMap Tool 2.1.0plus-squareopenwebui.comprojectmoon@lemm.eeEnglish · edit-22 months agomessage-square0fedilink
lynx@sh.itjust.worksEnglish · 3 months agoQwen2.5-Coder-7Bplus-squaremessage-squaremessage-square0fedilinkarrow-up11
arrow-up11message-squareQwen2.5-Coder-7Bplus-squarelynx@sh.itjust.worksEnglish · 3 months agomessage-square0fedilink