Skip Navigation

InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)B
Posts
23
Comments
229
Joined
2 yr. ago

  • I see. Mistral was the favorite in self-hosted LLM circles back in 2023-2024 but general opinion is that they have since been far surpassed by Chinese and American models, hence my question.

    Good to know they've found a market with their online offering.

  • Is there a use case where Mistral still beat Qwen or Gemma? If you're using Mistral, which model and what do you use it for?

  • Reinforcement learning makes the model better over time, so why should there be fewer and fewer good results?

    If you're talking about the rate of improvement going down, then yes, of course. That's bound to happen (unless you have an actual intelligence explosion, but in that case you won't know what "good results" even mean anyway).

  • Yes :)

  • No one feeds random LLM output straight back though. The whole idea of reinforcement learning is you take some ML model output, check if it is good, and push the model in that direction if it is good.

    As long as you believe that e.g. it's easier to verify a mathematical result than to come up with one, then RL should work.

  • You took those quotes wildly out of context. Of course there is a hard limit on how much information can be extracted from data. Clever processing won't break that limit. But only in basic cases have we seen proofs that certain statistical inference methods make optimal use of the data. In complicated systems like neural nets it is basically impossible to prove such optimality. In fact the models are almost definitely not using the data optimally. Processing can help. A lot.

  • How does the crosspost listing on Lemmy work?

    The original post (just 10 minutes earlier) is here but is not shown in the list of crossposts for this post. Why?

  • On a separate note, I hope the author of this post and his colleagues are safe. His university got bombed by America and Israel earlier this month.

  • This is great! The value of having your computational pipeline nicely debuggable and visualizable cannot be overstated.

    The float32 limitation sounds a bit arbitrary. Would be cool if blender allows float64 in geo nodes in the future.

  • My not very confident guess is that it's just to label what kind of galaxies are observed by the instrument at that range. Really not sure about the less dense ring in the outer part though.

  • what. there definitely are differences between the universe today and the universe billions years ago.

  • the latter. the map looks different further away from the center of the circle because further away = earlier time. if they attempted to compensate for how things far away have changed since the light was emitted, the map would look uniform.

  • hmm brb imma go invent a refrigeration cycle that runs 15C <-> 250C

  • startup idea: fridge with warm little nook for dog

  • i want that too. lots of houses already have hot and cold water lines so it shouldn't be too hard. problem is getting appliances to adopt a standard on how to connect to this network

  • that's the joke. i tried to imply it in the title but i didn't realize that in english you call it 2nd law of thermodynamics rather than 2nd rule

  • a heat pump oven sounds like an actually cool idea. why is it not a thing yet?

  • if you're already heating your home, then what does it hurt to have the fridge do a bit more of that?

    in fact, the fridge is a tiny heat pump using your food as the reservoir. so unless your house is heat pump equipped, it is beneficial energy wise to keep the fridge inside.

    if your house is heat pump equipped, then it depends on how the efficiency compare. if you put lots of hot food into your fridge then you should definitely probably keep it inside.

  • 196 @lemmy.blahaj.zone

    2nd rule

  • Memes @lemmy.ml

    "content curation"

  • Science @mander.xyz

    Private donors pledge 860M EUR for CERN's Future Circular Collider

    home.cern /news/press-release/cern/private-donors-pledge-860-million-euros-cerns-future-circular-collider
  • Physics @mander.xyz

    Private donors pledge 860M EUR for CERN's Future Circular Collider

    home.cern /news/press-release/cern/private-donors-pledge-860-million-euros-cerns-future-circular-collider
  • Mander @mander.xyz

    Performance issues?

  • Technology @lemmy.zip

    So You Think You've Awoken ChatGPT...

    www.lesswrong.com /posts/2pkNCvBtK6G6FKoNn/so-you-think-you-ve-awoken-chatgpt
  • Math Memes @lemmy.blahaj.zone

    Are you in the 95%?

  • Memes @lemmy.ml

    someone teach them LaTeX

  • Science Memes @mander.xyz

    fruit flies are not ergodic

  • [CLOSED] FediLore + Fedidrama @lemmy.ca

    Hexbear is back? What's the lore?

  • Astronomy @mander.xyz

    Euclid discovers a stunning Einstein ring

    www.esa.int /Science_Exploration/Space_Science/Euclid/Euclid_discovers_a_stunning_Einstein_ring
  • Memes @lemmy.ml

    I love open-weight models, especially when they steal from proprietary models (OC)

  • Science Memes @mander.xyz

    vibes-based astrophysics

  • LocalLLaMA @sh.itjust.works

    New open-weight 🐋 DeepSeek V3. 685B MoE. Beats Claude 3.5 Sonnet on Aider coding benchmark

    huggingface.co /deepseek-ai/DeepSeek-V3
  • 196 @lemmy.blahaj.zone

    "blahaj" is pronounced "blo-hai" rule

  • Science Memes @mander.xyz

    ur dada so buff he falls significantly faster than g

  • Science Memes @mander.xyz

    they tricked us

  • Science Memes @mander.xyz

    👁️ 🌹 💨 💨

  • Astronomy @mander.xyz

    25 Images for Chandra's 25th

    chandra.harvard.edu /photo/2024/25th/
  • Science Memes @mander.xyz

    the final boss after you clear Donald Knuth