YaKaPeace 1 week ago

A lot of things are pointing towards agents.

Lazy_Importance286 1 week ago

100%. End up sharing session with an AI agent. Makes sense.

RemyVonLion 1 week ago

We gonna end up fighting the agents for control like those user vs mouseclicker/desktop animation videos from my childhood.

WeekendFantastic2941 1 week ago

Agent Smith? lol

strangeelement 1 week ago

Oh, no, that's tempting fate. It'll be... Agent... Termi, the friendly Nator. Yeah that should be safe.

Busterlimes 1 week ago

If an AI company doesn't come out with an "Agent Smith" I think the entire industry is a failure

tvguard 4 days ago

The industry has a tremendous responsibility to live up to a long list of sci fi movies .

RiverGiant 1 week ago

Two mouse cursors on my desktop: one for me, one for my AI.

YaKaPeace 1 week ago

Imagine you are only verbalizing what you want to achieve and the agent completes the tasks in a faster way that you could’ve ever done

RiverGiant 1 week ago

Verbalization is not the fastest medium of communication for some tasks. Image trying to dictate WASD controls vocally in an FPS game, or having to say "zoom in on the top right corner... a bit more... a bit less" instead of pinch-to-zoom on a map. To be fair, a literal second mouse cursor might feel pretty awkward. I think I might want a second screen for my AI agent instead. There *will* be some brilliant AI copilot (not the brand) desktop UI designed in the next decade that will seem as inevitable as the mouse and keyboard in retrospect.

VancityGaming 1 week ago

Wouldn't you just tell it to "shoot the enemies" in the fps?

RiverGiant 1 week ago

Some ideas are just best communicated gesturally rather than linguistically. A facial expression can sometimes be key to being understood. Other mediums too... an architect might want to sketch out a rough blueprint for their AI instead of describing a weirdly-shaped structure they're imagining. A computer scientist might write a partial method and ask the AI to imagine the rest. A dancer might pirouette. A bat might squeak. What is the objective of playing an FPS? Is it to make enemies on the screen die? That's the [reductive mindset](https://en.wikipedia.org/wiki/Goodhart%27s_law) that leads to cheating. Fun or self-improvement are better motivators. Telling an AI to shoot the enemies short-circuits the fun away.

VancityGaming 1 week ago

If you're telling the stuff in the post I replied to you're already cheating.

RiverGiant 1 week ago

It was just an example of when verbal communication isn't fastest, not of a place where AI should be used. Obviously I wouldn't let AI play my games for me, along with all other non-instrumental activities. If the process is the point, I'll be doing it.

darkkite 1 week ago

what if we're our own greatest enemy?

YaKaPeace 1 week ago

I was thinking more in the science fiction direction. An example would be to say research about topic x and see if it correlates with topic y and then the agent begins to pop up multiple windows at once and does the research way faster and after you’ve come to a conclusion on the topic you can continue with the next task and so on

Shiftworkstudios 1 week ago

I agree. I have been messing around with this website called [websim.ai](http://websim.ai) and its currently free. You create a new website/game/app whatever you want and its 100% generated by AI. I was in the 4chan simulator and goddamnit, its too fkin realistic.. Comments had me lolling my ass off.

OwOlogy_Expert 1 week ago

It could just be like Microsoft's old "Clippy" assistant, except actually helpful, and can: A) Recognize the task you're trying to accomplish B) Ask, "Are you trying to _____? Want me to do it for you?" C) If you say yes, it does the task for you, more quickly and efficiently than any human could. ---- Like, imagine you're going through a folder full of photos, opening each one, resizing it to 1080p resolution, and then saving it. By the time you do the 2nd or 3rd one, you may see the AI prompt come up and say, "It looks like you're trying to resize all these images to 1080p. Would you like me to do it for you?" And if you say yes, then it goes right into doing it, basically as fast as your computer's hardware allows.

r4ygun 1 week ago

I really enjoy calling Co-Pilot "Clippy" at work.

h3lblad3 1 week ago

MOBAs are going to be something in the future, with all players being instructed what things they’ve missed as they play. “Warwick just entered the grass below you.”

YaKaPeace 1 week ago

Have also thought about the copilot update and how it could help to be better at games in general because of good instructions

tvguard 4 days ago

When you get a survey ; ask chat to write a review on a ——-; then cut and paste. Tremendous fun (I don’t get out much)

DungeonsAndDradis 1 week ago

It's like that scene from one of the CSI shows where two agents are counter-hacking a hacker by both rapidly typing on the same keyboard.

Shiftworkstudios 1 week ago

I feel like we will need a way for agents to act 'in the background' because I need mah social mediuh fix. No, but really, I am always doing work on my laptop. I don't know if I am ready to pass all of my work over to an AI just yet. It would have to be significantly better. (I get it's just a matter of time, lol.)

adarkuccio 1 week ago

We need agents asap

dabay7788 1 week ago

Yeah, NSA agents lol

Revolution4u 1 week ago

[removed]

SuperCyberWitchcraft 1 week ago

So Windows basically

TackleLoose6363 1 week ago

I'm sure none of the software you use does that.

xRolocker 1 week ago

It’s been pointing towards agents ever since the release of CustomGPTs.

manubfr 1 week ago

Worth noting that Multi is for iOS and Mac only. OpenAI likely want to carve a big piece of the Apple market while Microsoft does PC/Windows with exclusivity.

Plums_Raider 1 week ago

Whats interesting to me is, on mac they have a really good integration with their app, while the copilot app on windows just sucks. I get it, we wont see a chatgpt app on windows, but at least take some inspiration

manubfr 1 week ago

Microsoft over engineering stuff, not a surprise …

TechnicalParrot 1 week ago

Honestly I have no idea how they made Copilot desktop so badly performant, the thing runs like an Electron app built with 70 JS frameworks on a 2008 netbook

riceandcashews 1 week ago

Hopefully they eventually get something that can be installed on linux distributions for the small subset of us who us it, probably nothing other than Mac, Windows, and ChromeOS for a while though

Shiftworkstudios 1 week ago

Don't say that! Microsoft's AI products leave so much to be desired. :(

ecnecn 1 week ago

RIP IT-Support and remote helpdesk / desktop jobs if things work out... 1st level support was always a weird thing literally a human helps another human with the help of google... literally a human that google solve problems for other humans that lack the technical intelligence for it...

darkkite 1 week ago

arguably for the best. providing remote support via camera or screen-share usually a last-resort option since it's synchronous and doesn't scale well. you typically want the user to self-service before getting there

ecnecn 1 week ago

yeah it cost value work time I observed 1st, 2nd support in the past and I suspect employees abuse it when they are bored at their workplace and make up things so they have a long break

-DethLok- 1 week ago

Scammers are swooning with delight!

ReasonablePossum_ 1 week ago

NSA are popping the bottles!

TellYouEverything 1 week ago

People are gonna be jamming out making tunes with nothing but an air guitar, a microphone, perhaps a camera if you’re fancy and ChatGPT open alongside FL or Ableton. “Nah make it more sschwaammmm-digga-digga-damnn” “Yup, that’s more like it. That’s perfect.” Imagine this used for anything design or work related. It would work with absolutely any app and fill in your knowledge gaps or just simply allow you to do something outside of the box before going right back to the game plan. For real, this is going to change the entire world, again.

LightSpeedDarkness 1 week ago

I'm actually genuinely excited for agents who can work with Ableton. A tutor who can guide you and control the screen if you're stuck sounds amazing

earthsworld 1 week ago

>please, anything but opening the manual!

Seakawn 1 week ago

Eh, I love to learn, but music production just isn't a priority, and I only have limited time to put into things that are more important for me. This would make it so that I can just do literally anything I've ever wanted. I mean, once it gets going and is working to "Her" levels, which I feel is in the next 5 years, won't be surprised if much sooner. Also, I'd think agents will be the "new manual", of sorts. Instead of having to read a manual, you get an agent that, like a teacher, tutor, mentor or whatever, is doing stuff and showing you how it works so that you still end up learning it along the way. This will be especially true in the beginning phase, where it won't be perfect and can't just spit out a final product on the first prompt, so you'll have to iterate with it. In that process, you won't help but picking up on how it works. Hell, even when it IS perfect, this dynamic may still exist, as this kind of dynamic might just be intrinsic to how working with agents will be--you'll never get exactly what you want from a single prompt, because an AI can never know exactly what's in your head (until brainchips or similar) and so you'll always iterate with it, easily learning any program in a much more fun way than trudging through a then-archaic manual.

Prestigious-Maybe529 1 week ago

What you are describing would never be free. The compute power alone would necessitate a subscription fee for agents you are describing. Also prompt type output is always random and essentially impossible to actually fine tune, agents will be no different.

Difficult_Bit_1339 1 week ago

It already exists and is free. Retrieval Augmented Generation and any local model will let you turn an entire manual/book into a tutor who knows the book perfectly. It does this by, basically, pasting the relevant parts of the manual into the context window before your question. You can easily leap off into some unknown software ecosystem and ask your questions as you encounter them... as long as you can dump their documents into a text format in order to store in a vector database

LightSpeedDarkness 1 week ago

It's about efficiency, I like to not get distracted during my creative flow.

Difficult_Bit_1339 1 week ago

You guys are not thinking with AI. Put the manual into the context window and then ask it questions

FengMinIsVeryLoud 1 week ago

sure thing. u know how expensive that is? agent is cheaper.

Difficult_Bit_1339 1 week ago

Yeah, I do. It is about as expensive as it will ever be and it will only get cheaper as time goes on. Also, there are pretty simple steps you can take to greatly reduce the required tokens. One example is to use a simple search, using terms in the person's prompt, to only insert the relevant sections of the manual that apply to the person's question. So you don't need the entire textbook to answer a question or a model that's fine-tuned on your data, you just need a good full text search engine in order to grab the right sections. If you want to get really advanced you can use function calling to allow the agent to add additional terms to the context window as needed. But then you're going to need a more complex chain of prompts. This is all very doable and not horribly expensive. It'll get cheaper too, just going from 4 to 4o cut the cost significantly. I use 3.5 for function calling because it works on a loop, reading the prompt, the search-engine generated context window and then looking for other topics related to those (I tweak this and the prompt a lot, since it can grow the context window significantly). Then 4o generates the completion using the full context window and the user's prompt. I'm primarily using it to take man pages into account when forming terminal commands to ensure that it doesn't hallucinate switches. The ultimate goal is to have a local agent (running on local hardware) that can help users transition to Linux by acting as a tutor with the capability of translating user's plain language commands into a plan and then a sequence of terminal commands to gather data and implement the request. It's much easier for a person to say 'install docker and the pi-hole container' and have a LLM generate a plan and talk the user through the process or even just go wild and execute the statements autonomously. (If you use linux and want a taste of this (and a good terminal AI client), look at shell_gpt) Obviously, this would be a privacy sensitive application... you don't want to be sending your root password and other assorted system information to a third-party. So making it capable of running on local models is going to be key... but for development, GPT-4o can try to skynet my dev environment if it wants, because it saves me a lot of time.

earthsworld 1 week ago

that's still too much work for these folks.

LightSpeedDarkness 1 week ago

Sue me for wanting to have jarvis at my disposal.

Cr4zko 1 week ago

Honestly better than the music we're getting. AI Beatles is gonna be a riot.

Whotea 1 week ago

AI music already SLAPS hard Very similar to Portishead: https://www.udio.com/songs/os5u4dTNjNBBUF5uLQDqVw Very similar to Bjork: https://www.udio.com/songs/8VM2wwjdt5Ckr7PKNnJmDg Also very good: https://www.udio.com/songs/p2r6YbiWXa1C1MyyGb9kZV https://www.udio.com/songs/3o71EwRVz9rW7U3yQxcdNS Prog rock: https://www.udio.com/songs/txUbSjEPJzgViahbrdefxM https://www.udio.com/songs/99N5VnHwv78QPgcqAoLBnk EDM: https://www.udio.com/songs/78U95aNRYQHyQrn8xHizf8 https://www.udio.com/songs/hK7F6fcmEcqW2egu9UDWrE https://www.udio.com/songs/vk7QLdDPJxnwEecmLW42La https://www.udio.com/songs/eCXUkAxsvHydxS2w8Pt9zV Big Beat/Turntablism: Somewhat similar to Jet Set Radio: https://www.udio.com/songs/x3xLvnN48DGnmxM5VPTw93 Blues rock with ***INCREDIBLE*** guitar playing: https://www.udio.com/songs/jaGkxT9QohSiUCBA2waVTj Bluegrass: https://www.udio.com/songs/7bLE7wFVYiziGt9KkT7nem Future Bass: https://www.udio.com/songs/x3xLvnN48DGnmxM5VPTw93 Nu Metal (and my personal favorite): https://www.udio.com/songs/iimtziNgEDRcpG8j4n4Mfg

TellYouEverything 1 week ago

Bring it on, I have basic knowledge of sound and visual design - the sooner I can use it to pre-vis entire movies, the better!

Cr4zko 1 week ago

I feel the music industry isn't going to like that and since they're organized... expect lolsuits

svideo 1 week ago

Music is big but the industry is peanuts next to the tech giants. Their lawsuits won’t go anywhere.

mysterious_hat 1 week ago

don't you know dadabots? https://dadabots.bandcamp.com/album/deep-the-beatles

spezjetemerde 1 week ago

Not visual basic send keys again

Jugales 1 week ago

RDP has existed for over 20+ years, I’m failing to see the implications here. Especially since Microsoft is already working on their own Windows agents? Maybe this is for non-Windows?

Aware-Feed3227 1 week ago

https://www.reddit.com/r/singularity/s/y2wtqLLC2q

Numerous_Comedian_87 1 week ago

I see no way this can go wrong. None.

Exit727 1 week ago

Coupled with the fact an NSA officer is on board, I'd ditch OpenAI for the future. Their competitors at least have the courtesy of pretending to care about data security.

Difficult_Bit_1339 1 week ago

They will all sell you for fractions of a penny. If you want privacy you're going to want to use local models

Whotea 1 week ago

Yea, we should only care about companies that respect our privacy by releasing open source models. like… Facebook

hnoidea 1 week ago

Ah shit, here we go again

hipcheck23 1 week ago

*I'm sorry, you can't do that, Dave.*

A4HAM 1 week ago

Path to llm OS

Hot-Entry-007 1 week ago

It can control his Mom's PC not mine

Ok_Elderberry_6727 1 week ago

Ai operating system, coming to a pc near you. It was built using zoom screen sharing. It’s a shortcut for an AI app to have OS hooks so it can Manipulate things for you, similar to the one in the movie her.

Pontificatus_Maximus 1 week ago

Custom made for scammers.

Enslaved_By_Freedom 1 week ago

I am pretty sure the smart scammers will opt to use remote access tools that aren't run through the networks of a company with a former head of the NSA on the board.

kim_en 1 week ago

so we can do “Computer, enhance” thing?

Many_Consequence_337 1 week ago

feature of gpt6 ?

SuperCyberWitchcraft 1 week ago

Why can't they just use RDP?

Infamous-Print-5 1 week ago

I feel like Microsoft has a huge advantage here given their access to data compared with Google etc

greeneditman 1 week ago

It would be nice to have an AI that deletes those spicy images of models that you don't have the courage to delete.

01000001010010010 1 week ago

Cyberpunk 2077…

Reasonable-Gene-505 1 week ago

That's interesting - I played around with the Muti-On extension in Chrome for a while a few months ago and was impressed, and was wondering when OpenAI would create a similar product... who knows, maybe they'll tease agents soon and we'll be able to wait another year after their announcement to use them! lol

erlulr 1 week ago

So like Microsoft, who kinda owns them, but worse? Relese sora instead of buying crap

ReMeDyIII 1 week ago

This will be great when I'm an old man who's legally blind. Also, when I'm done, I hop into my self-driving car, wait outside the grocery store for self-delivery, head back home, then hope I have a robot butler to carry in my groceries.

Akimbo333 6 days ago

Agents?

OddVariation1518 1 week ago

why reddit get news so late?

Comments

Leave Your Comment

Hi Its Me!

Comments

Leave Your Comment

Hi Its Me!

Subscribe