AIDL Weekly Issue #66 – Uber Lays Off 100 SDC operators

Editorial

Thoughts From Your Humble Curators

We have a BIG announcement this week. We launched Expertify, our messaging web app, for our 155,000 A.I. community members to engage and discuss in real-time.

Hope you can join and try it out.

In other news, we look at two technical stories this week:

How DeepMind is measuring abstract reasoning, and actually what are the abstract reasoning we are talking about?
The new Facebook Talk the Walk task – how would AI guide a tourist through the New York City?

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 155,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65. Join our community for real-time discussions here – Expertify

Artificial Intelligence and Deep Learning Weekly

News

Uber Lays Off 100 SDC operators

We should ask : what does this mean to Uber SDC? And what does it mean to SDC in general?

cnbc.com

Deals

Artificial Intelligence and Deep Learning Weekly

Blog Posts

DeepMind papers at ICML 2018

Several interesting papers in our mind:

Machine Theory of Mind
Parallel WaveNet: Fast High-Fidelity Speech Synthesis
Efficient Neural Audio Synthesis
Measuring abstract reasoning in neural networks – see the next item.

deepmind.com

DeepMind’s Measuring Abstract Reasoning

This recent work from DeepMind probably catch many of our eyes. What are DeepMind doing? And how are they measuring reasoning from AI?

First of all, what is the abstract reasoning the blog is talking about. In fact, we are really talking about a particular type of intelligence test called “Raven’s Progressive Matrices” (RPMs). See the Wikipedia Page for some examples. Basically you are given panels of figures each have geometric shapes or patterns and your goal is to reason what should the next panel look like. It is mostly a visual reasoning test for humans. They were first presented with sequences of shapes, colors and patterns in several panels and then asked to derive the las.

With this setup, it’s not hard to imagine that you can come up with a task which machines can be used for guessing answers for RPMs. As you might read from the blog, the best architecture is what the authors called wild relationship network (WRAN). What is WRAN? Actually it’s rather simple. Say you are given K panels. You just run your favorite network on the task, then the answer would be summed and feed into another MLP.

At this point you might ask, “Hmm. So how much of this really measure abstract reasoning that I am thinking about?” The truth is, not much, even the authors admit that visual reasoning is only part of general-purpose abstract reasoning. Still it’s interesting to see if machines can be used to solve a popular IQ test of humans.

If you view it this way, experimental results of the paper are quite fascinating, e.g. the DeepMind researchers have tried to add distractor into panels to see if performance drop. For more details, you might want to take a look of the paper version of the work. You may also find the original data set at here

deepmind.com

Open Source

Andrew Trask’s New Book

We are always fans of Andrew Liam Trask. He is almost finishing a deep learning book. Follow the link to download it!

twitter.com

Navigating New York City by AI Talking With Each Other

Here’s an exciting new task introduced by FB – can we use an AI agent to guide tourist through New York City verbally? In the task “Talk the Walk”, two agents, “a guide” and “a tourist” has the common goal to reach a target location through dialogue. The “tourist” only access the street view, and can communicate with the “guide” through natural language dialogue. Whereas the guide can only guess the tourist location through a map. This is definitely an open problem. Can AI resolve this task?

First of all, the street view actually comes with landmarks. So the first tasks is to come up with a guess of where the “tourist” is, or tourist localization in the paper. This doesn’t make the task easy – the ground truth suggest that landmark observation doesn’t always localize a tourist. The team then tries to come up with a method which also account for other information from tourist and map them into a coordinate in the map.

It sounds like a difficult but an interesting open problem to us. Here we are talking about two agents which NLP dialogue is the only means of communication, and thus you can imagine information will be lost in the process. The task sounds really cool and can have major applications in real life. So check it out!

arxiv.org

About Us

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 156,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65. Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Editorial

Thoughts From Your Humble Curators

Sponsor

News

Deals

Blog Posts

Open Source

About Us

Leave a Reply Cancel reply