Home IT Hardware Assets Vision is About

[Hearing from an AI Expert – 3] Vision is About

277


Sven Dickinson, Head of Samsung’s Toronto AI Center

 

Can you think about a world the place the private AI assistant in your smartphone is ready to perceive as a lot concerning the world as you do? What a couple of state of affairs the place speaking with that AI assistant is as pure and straightforward as interacting with one other human? Developing these sorts of capabilities is precisely what the workforce at Samsung’s AI Center in Toronto are placing their minds to.

 

Samsung Newsroom sat down with Sven Dickinson, Head of Samsung’s Toronto AI Center to be taught extra about these thrilling fields, and what they may imply for the long run.

 

 

The Vision for Vision

The second Samsung AI middle established in North America, Samsung’s Toronto AI Center is led by Dr. Sven Dickinson, an professional in laptop imaginative and prescient and former chair of the Department of Computer Science on the University of Toronto.

 

At the epicenter of AI analysis and growth, Samsung’s Toronto AI Center is especially targeted on growing the visible understanding capabilities that permit a Samsung machine to grasp the world through which it’s located. In addition, the workforce is engaged on multi-modal interactions, that are user-machine interactions that encapsulate imaginative and prescient, language and information.

 

“Allowing Samsung devices to ‘see the world’ through computer vision enables them to ‘visually ground’ their dialog with the user, providing an integrated, multimodal experience that’s far more natural than one that’s solely vision or dialog-based” says Dickinson, whose experience contains exploring issues surrounding form notion and object recognition.

 

Touching on the advantages of multimodal expertise, Dickinson claims that, “I should not have to read manuals to figure out which buttons to push on my device and in which order. Rather, I should be able to show my device what I want, and tell it what I want, in natural language that is understandable, and situated in the world that I live in.”

 

Extrapolating on the interaction between laptop imaginative and prescient and multimodal inputs, he goes on to say that, “To achieve this breadth of comprehension, the device has to have a model of my understanding of the world, the capacity to communicate robustly and naturally with me, and the ability to see and understand the same world that I see.”

 

 

Remarking on functions for this expertise, Dickinson identifies probably the most compelling as being “a personal assistant that you not only speak to, but that sees the world the same way that you do.” Speaking to the significance of multi-modal machine interactions, Dickinson factors out how a lot cancelling out one of many modes of communication (audio, speech, sight and so on.) would hamper communication between two folks, and says that additionally applies to non-public gadgets.

 

 

A Truly Enhanced User Experience is Key

At the 2019 Consumer Electronics Show (CES), Samsung unveiled its imaginative and prescient for Connected Living, which includes connecting the 500 million gadgets the corporate sells yearly, and making them clever. Dickinson highlights that Samsung’s broad product portfolio will probably be instrumental in fulfilling this imaginative and prescient, saying that, “What differentiates Samsung is that it makes a multitude of devices in the home, including digital appliances, TVs, and mobile phones. Samsung has a unique opportunity to leverage these devices to yield a multi-device experience which follows the user from one device to another, and one room to another. This will help realize the full potential of each device to effectively communicate, to help the user execute device-specific tasks, and to learn the user’s habits and preferences so that subsequent communication is not intrusive but instead ‘always helpful.’”

 

Speaking about what his middle might want to do to really notice laptop imaginative and prescient and multimodal interplay, Dickinson feedback that, “Vision shouldn’t be about understanding photos; imaginative and prescient is…



Source hyperlink

LEAVE A REPLY

Please enter your comment!
Please enter your name here