I was trying to have some insightful discussion on the actual capability of LLM which is difficult when the involvement of the human element is played down amd the role of the LLM is played up to feed the hype machine. It’s hard to acknowledge the real capabilities and weaknesses when the capabilities are always over reported and the weaknesses down played or denied.
It’s great that so many bugs are getting discovered but as I say there is no reporting on what effort was needed to sift and review the LLM output or how functional or understandable any PoC were… The article doesn’t directly even state the PoC were directly produced by the LLM and reads very ambigously.
I think some of that is because the reporting is focused on the new stuff, that was previously not possible. That human work is involved and some of the weaknesses are not really new. But also because the information in this case comes from a company that wants to sell their AI. I agree that the reporting is probably biased and not really sharp and therefore limited in usefulness.
Also, my (second) comment was not specifically about your comment but generally about the “vibe” of this community
I was trying to have some insightful discussion on the actual capability of LLM which is difficult when the involvement of the human element is played down amd the role of the LLM is played up to feed the hype machine. It’s hard to acknowledge the real capabilities and weaknesses when the capabilities are always over reported and the weaknesses down played or denied.
It’s great that so many bugs are getting discovered but as I say there is no reporting on what effort was needed to sift and review the LLM output or how functional or understandable any PoC were… The article doesn’t directly even state the PoC were directly produced by the LLM and reads very ambigously.
I think some of that is because the reporting is focused on the new stuff, that was previously not possible. That human work is involved and some of the weaknesses are not really new. But also because the information in this case comes from a company that wants to sell their AI. I agree that the reporting is probably biased and not really sharp and therefore limited in usefulness.
Also, my (second) comment was not specifically about your comment but generally about the “vibe” of this community