• 0 Posts
  • 16 Comments
Joined 1 year ago
cake
Cake day: July 3rd, 2023

help-circle





  • The original paper itself, for those who are interested.

    Overall, this is really interesting research and a really good “first step.” I will be interested to see if this can be replicated on other models. One thing that really stood out, though, was that certain details are obfuscated because of Sonnet being proprietary. Hopefully follow-on work is done on one of the open source models to confirm the method.

    One of the notable limitations is quantifying activation’s correlation to text meaning, which will make any sort of controls difficult. Sure, you can just massively increase or decrease a weight, and for some things that will be fine, but for real manual fine tuning, that will prove to be a difficulty.

    I suspect this method is likely generalizable (maybe with some tweaks?), and I’d really be interested to see how this type of analysis could be done on other neural networks.





  • It’s not just convenient for them to do it; it’s how they are able to evade anti-trust action (not that the U.S. is great at it anyway but still). I also run my own mail server. It’s not impossible, and I wouldn’t even say it’s even hard. It’s just time consuming to set up (if it’s the first time), and there are a lot of hurdles to make it so impractical that it’s virtually impossible to the average person. Only the most patient or those who have a real desire to run their own mail server will even attempt it. Anyone can set up their own mail server, but most won’t because it’s not worth it compared to using something that just works from Google.



  • I think what they’re saying is that Americans don’t pay attention and forgot how terrible the Trump presidency was because it’s been a few years. Most people think that “we’re better now” and any major issues have abated without understanding that nothing has fundamentally changed. Because of all that, Trump will win the election. The DnD portion of the post is just what got OP to think about this.

    Sad thing is that there’s merit to the argument. It’s the old trope of “Americans have short memories.”


  • I’m all about this. When I made my personal webpage, this is how I do it. I’m surprised it’s not more popular (at least for certain things) because it looks nice and clean, is fast, and crucially, is easy to put together. Most webpages don’t need a ton of JS to “accomplish the mission.” I get that not everything can do this, but there are soooooo many sites that can strip down to a more minimal site and have better functionality and a better experience. This is a case of less-is-more.


  • This is a much better article. OP’s article just shows the author’s surface understanding of how coding works and how well an LLM can actually code. There’s way more that goes into a programming task than just coding.

    I see LLMs as having the potential of being almost like a super library. I can prompt GPT, Claude, etc. to write me a custom function that I copy, paste, test, scrutinize, and almost certainly change. It’s a tool that will make someone a more productive programmer. It won’t completely subsume a human’s ability to be creative and put the pieces together.

    At the absolute worst over the next decade, I could see programming changing from writing and debugging code to prompting, stitching together, and debugging.


  • Yeah, you’re right on a lot of chatbots just being paraphrased responses from the support database, but for a lot of people, that’s all they want or need. There are a great number of people who just don’t want to read the entire article to find their answer. For that, I don’t really mind chatbots because I get the use case. What I hate is when there isn’t an option to go to the next tier of support without going in circles forever with the stupid bot.