Author: grandjanitor

AIDL Issue 80 – Alright….. We are Going to Talk About the AI News Anchor…..

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Issue 80 – Alright….. We are Going to Talk About the AI News Anchor…..

Issue 80 November 12th 2018

Editorial

Thoughts From Your Humble Curators

This week we cover two topics: first off, why Facebook fails to build their own speech recognizer. It is surprising to see an organization like Facebook to have so much troubles to build and grow one machine learning technology. What are the reasons behind? We will quote the Forbes’ article as well as give out some of our own opinions.

Also…. since we got around 10 post submission on the much hyped AI news anchor, we are going to talk about it too. What is it really? Is it “intelligent”? And what is the true impact of such technology?

As always, if you find this newsletter useful, feel free to share it with your friends/colleagues.

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 178,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65.

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

News

Deals

Correction: Early version of this item states that Precision got $20M Series B, the correct company name should be Taranis.

Artificial Intelligence and Deep Learning Weekly

Factchecking

The “First” AI News Anchor

Last week, we learn that Chinese state-run news agency, Xinhua, is deploying an AI news anchor. In our experience, sensational news like this can easily mislead the public. e.g., just on our forum AIDL alone, we see 10 post submissions on the same news. And we also see this well-timed piece from MIT review which argues that, “the anchor isn’t intelligent in the slightest. It’s essentially just a digital puppet that reads a script”.

In this piece, we will focus on three aspect of the news. The first is, is it really that shocking to see an AI news anchor like this? Or had we already seen similar research work before? Second, do machine programs like this think like what do? Can they really replace human news anchor? The last is what is the impact this kind of automatic news delivery machines in our future.

On the first count, what is this AI news anchor actually? In essence, it is just an audio-to-video synthesizer. Given an audio, the machine can generate video monologue which seems like a human news anchor is speaking. Such highly realistic synthesized monologue has been researched for a while. For example, just last year, researchers from Washington University has published similar work which create a video from President Obama based on audio alone, all you need to do is to have large amount of video and a clever frame picking algorithm. There are many nuances of such work. For example, making the lip realistic has always been a tough technical problem. We recommend you to take a look of this original paper.

There were also similar commercial effort such as Lyrebird. In a nutshell, if you follow deep learning development in the last few years, this piece of news shouldn’t surprise you at all. You may replicate this work if you happen to have a lot of video footages from a news anchor. Since Xinhua is state-run, it is not a surprise that large amount of footages is available for a specific anchor, in this case, his name is Zhang Zhao.

If an AI news anchor is technologically feasible, then the next question is whether such machine is “intelligent” as in our common sense. The answer is: of course not. As we just mentioned, the anchor is more like an audio-to-video converter. Since it is just a converter, you should not expect the anchor would exhibit other human anchors’ behaviors. For example, they would not have a friendly banter with their coanchors, nor they would improvise when there is an unexpected situation during broadcasting This means the machine is really not how we human think of as an intelligent news anchor. All it is, as MIT review suggest, “a digital puppet that reads a script”.

The final point is what is the implication of such technology. The strength of an automatic news machine, of course, is that unlike humans it will never got exhausted. And such technology could have many interesting applications. For example, for poor area where setting up physical news stations is a problem, machine news anchor can come in and fill the void. Of course, the downside is they are also easy subject of misuse. e.g. Malicious party can just use them to spread disinformation. Remember DeepFake? There has been concerns on whether hackers would deep-learning based video synthesis to affect US mid-term election.

We believe though, the first step to stop misusing a technology such as DeepFake is to understand what the technology really is. Hopefully this article helps you to do so.

scmp.com

Blog Posts

Why Facebook Failed to Build Their Own Speech Recognizer

One of us (Arthur), has been working in the speech recognition industry, so we are quite aware that Facebook is developing their own recognizer. Yet, it is also painfully clear from the article that FB’s direction of development is misguided, and it is doomed to fail. So we are writing this item to analyze the cause, and what Facebook could potentially do.

To start off, what’s so special about automatic speech recognition (ASR) in the business of machine learning then? If you delve deep, speech as a pattern is subtle and hard to model. By itself, currently it is usually seen as speech as a sequence of n-dimensional vector. So modeling is usually done by non-trivial models such as hidden Markov model (HMM) or in deep learning long short term model (LSTM). Even now, training a good ASR model with 10000+ hours, takes a lot of resources to do well: first of all, data collection has to be based on the specific accent of the your market. e.g. an US English models would not work too well in other English-speaking countries.

Then there is data labeling, or in the case of speech recognition parlance, transcription. How do you find the linguistic experts to transcribe data? Unlike image recognition, crowd sourcing might give your poor quality transcripts, and results in poorly-performing models. So do you want to hire transcribers in-house? Would that fit to your budget? And who is going to manage them?

So say you have transcribed data, you are only half-way there. Unlike the workflow of image recognition training, many practical speech recognition training still consists of multiple steps. (Seq-to-seq type of training works but it takes a lot of data.) As a result, training is still a magical step which takes specialists to tend. e.g. What if the training fails? Do you need to fix the source code? And how do you interpret various failure modes in training? None of these questions are trivial. ASR source code are usually complex program which require programmers understand deep coding topics such as dynamic programming, numerical optimization or matrix computation. Many of them are written in low-level language such as C/C++ (i.e. not python), and takes an experienced coder to maintain. That’s why most company job postings on speech recognition usually requires master or doctor degrees from the candidates.

Once you can create and maintain a speech recognizer. The final question is how do you grow its capability. For example. It’s hard to estimate the time cost of training a recognizer in different languages. So how long should you setup your goal? If engineers fail to train a recognizer on time, what should you do? Those are difficult questions. For example, in our opinions, if it takes time for a team to fix bugs but they get it right in a production system, it worths for all the time it takes. But in our fast-paced world of development, not all the companies would/could give so much time to tend a recognizer. Not to say, the best speech recognizer, in our opinions, implement the most advanced mathematical model.

Let’s go back to the case of Facebook: from the article, we learn that product managers would switch domains of a speech recognizer in half a year basis. And it can go from news transcription to voice dialogue. For us who work in the business for long time, such switching of tasks, is the most nightmarish scenario. Half a year might not even give you enough time to collect data. Not to say, you might need to tailor-made product features based on one type of domain. e.g. There are a lot of differences between speech recognition which can run offline and run on live. Have the Facebook product manager ever consider these issues? We don’t think so.

What we would suggest to Facebook to do …. if they are serious about building a speech recognizer….. is to form a cross-departmental team with a powerful and an influential team leader. Members of the team, even for managers, should have strong understanding of how a recognizer should be built. We understand it is tough, but such team might give Facebook a real chance to at least sustain the effort in speech recognition, and perhaps when stars align, comes up with a product which is on-par with Google, Amazon or Apple.

Our two cents.

forbes.com

Google AI Blogs Last Two Weeks

Artificial Intelligence and Deep Learning Weekly

Spinning AI

OpenAI release Spinning AI last week. It’s mind-blowing because it’s perhaps the first time we see a major institution released a course related to deep reinforcement learning.

Looking into details of Spinning AI, the goal is to bridge the gap between papers and actual RL implementation. So in some sense, it is more a tutorial you can follow while taking on-line class such as David Silver’s RL class. The tutorial is in great detail, we also saw a fairly extensive bibliography which seems to more relevant than standard RL textbooks and resources existing.

As for support, Open AI promise to give high bandwidth support for Spinning AI. We hope they can sustain the effort because Open AI as an organization occasionally drop initiatives such as OpenAI Gym.

openai.com

Multi-object tracking with dlib by Adrian Rosebrock

This post expands Adrian’s previous tutorial to track multiple objects.

pyimagesearch.com

About Us

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 179,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65.

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Uncategorized

AIDL Weekly Issue #79 – The Man from La Famille de Belamy

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Weekly Issue #79 – The Man from La Famille de Belamy

Issue 79 October 29th 2018

Editorial

Thoughts From Your Humble Curators

We brought you “Edmond de Belamy, from La Famille de Belamy”, the AI-generated artwork which was sold for 10 times its estimated price; We point you to several technical blogs including Ruder’s “Neural History of NLP”, and the very entertaining Google AI’s post on curiosity vs procrastination in robotic agents.

As always, if you find this newsletter useful, feel free to share it with your friends/colleagues.

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

News

Edmond de Belamy, from La Famille de Belamy

Last week, AI-generated portrait “Edmond de Belamy, from La Famille de Belamy” was sold 40 times more than Christie’s initial estimates. We are surprised, but AI artists should their day too.

Some have argued that back in 2015, artists have already generated portraits which has similar quality as “Edmond de Belmany”. For example , take a look of Robbie Barrat’s website, you may see similar style of impressionism was there.

And an important issue here: does human deserve any credits in a piece of machine-generated art? Some will argue it should be, because humans still serve as the final judge of the aesthetic value of a piece, as well as the tuner of generative AI algorithms, such as GAN.

Perhaps one days machine-generated art will see for more than a Picasso. But before that, may be we should really understand how GAN works first.

nytimes.com

Commercial Pilot of Waymo.

Perhaps the first commercial pilot of SDC, and Waymo is now ahead of its competitors such as Uber, Ford. ARK Invest, a research group, estimate the cost of automatic taxi is 0.35 cents per mile. No one knows for sure what Waymo’s pricing model is. If we do know, then we can decide if Waymo really worths tens or billions of dollars as suggested by analysts.

ft.com

Deals

Artificial Intelligence and Deep Learning Weekly

Blog Posts

A Review of the Neural History of NLP

This article was published about a month ago. But we found it relevant, readable and illuminating. It gives you an idea about language modeling from pre-deep-learning eras: such as smoothing by Kneser and Ney to our current days, where large scale pre-train model is a norm. Also notable is its non-neural section which mention BLEU, LDA, OneNote and many significant NLP innovations which has nothing to do with deep learning. This should be refreshing for many of us who believe deep learning is the only solution for all our problems.

ruder.io

Nathaniel Popper on Blockchain in AI

Blockchain and AI are perhaps the two trendiest technologies now. So no wonder someone try to thread them together. One project we mentioned is OpenMined led by Andrew Trask which combine model training technology with cryptography.

But the companies you may hear more is perhaps SingularityNet, by Dr. Ben Goertzel, which attempts to use blockchain to link AI services together. The author of this piece, Nathaniel Popper, has covered blockchain for years. He penned the book Digital Gold which is a very interesting historical accounts of the rise of blockchain. So we think you might be interested in his view of how blockchain affects AI.

nytimes.com

Object Detection Using dlib by Adrian Rosebrock

Here is another article we quote from our great teacher, Adrian Rosebrock. This time he teach you how to write a fast object tracker using dlib. Other than the tutorial on dlib, perhaps the more interesting part is his presentation of dlib’s correlation tracking. We really enjoy it so don’t want to spoil it for you. Go check it out!

pyimagesearch.com

Curiosity vs Procrastination

Here is an interesting post from Google on how to deal with the sparse reward problem in RL, i.e. how to use curiosity to drive exploration of an RL algorithm. There are two interesting high-lights:

First, when Google authors try to create rewards based on inability of prediction, or just surprise. They found that agents could fall into a state which is close to human’s procrastination. That lead to the agent to continue to explore infinitely. So for example, when agent saw a (virtual) TV in a 3-D maze, he will get stuck infinitely because the sensory experience is always a surprise.
So the researchers figure out you can just store your recent experience in a memory bank, and try to reward exploration by comparing if the new experience is dissimilar to the old one. Sounds interesting. And that comparison is done by deep neural network.

We don’t pretend we fully understand the technical details, but Google’s demo is fairly intuitive, and we are fascinated.

googleblog.com

Open Source

Facebook Mask-RCNN in Pytorch

Here is an implementation of Mask RCNN in Pytorch released by Facebook.

github.com

About Us

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Uncategorized

AIDL Weekly #78 – Uber vs Waymo: A Revisit

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Weekly #78 – Uber vs Waymo: A Revisit

Issue 78 October 22nd 2018

Editorial

Thoughts From Your Humble Curators

This week we revisit the Uber vs Waymo case, and share several resources, including the second edition of Sutton and Barto’s book: “Reinforcement Learning: An Introduction”.

As always, if you find this newsletter useful, feel free to share it with your friends/colleagues.

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 177,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65.

Join our community for real-time discussions here – Expertify

Artificial Intelligence and Deep Learning Weekly

News

Deals

Artificial Intelligence and Deep Learning Weekly

Blog Posts

Did Uber Steal Google’s Intellectual Property? – A New Yorker’s Article

Back in February, Uber and Waymo has settled outside the court for 0.34 percent of Uber’s stake. At the time, many (including us) see the settlement as a surprising win for Uber and its new CEO, Dara Khosrowshahi.

Reading the New Yorkers’ article penned by Charles Duhigg should make you rethink, and he told us a very different story. The whole piece is also very readable and you can just see it as a historical account of rise of SDC in Google, or … , just the rise of SDC. We give you couple of highlights here:

Unlike most IP’s lawsuits in the valley, the Uber vs Waymo case is actually a trade-secret suit, rather than about copyright suit. This is unusual because California’s state has anti-non-compete law, trade-secret can easily leak from one company to another. (In some sense, by design.) So generally it’s hard for companies to protect trade secrets.
Knowing so should make you feel curious why Google sue Levandowski in the first place. As it turns out, it is more a recent trend in which big companies try to send a message to employees: leaving with trade secrets still has consequences.
Remember the few tens of thousands files Levandowski download? It was seen as the most important evidence against Levandowski. As it turns out, such download happens automatically, so the evidence is less damning. That’s why Google’s case against Waymo is shakier than we thought. And it is the true reason why Google drop the case and go for settlement.
Okay. What can we say about Levandowski then? Let us just quote this sentence from Duhigg?

He is a brilliant mercenary, a visionary opportunist, a man seemingly without loyalty.

What do we think? In artificial intelligence, generally, the biggest asset for a big company is the dataset they create and possess. But what if you have an equally resourceful opponent who can gather the same amount of data as you do? Then human talents would become the most important factor in your success. For example, given the same amount of data, a strong and creative MLE would create superior models. And capable ML engine designers/programmers, just like powerful system programmers are still rare in AI/DL. In some sense, they are the ones who truly carry a product from development to research.

That explains the power of the type of Levandowski…. Perhaps that also why his abuse of power seems to be unpunished too – again we quote Duhigg:

Levandowski, for his part, has been out of work since he was fired by Uber. It’s hard to feel much sympathy for him, though. He’s still extremely wealthy. He left Google with files that nearly everyone agrees he should not have walked off with, even if there is widespread disagreement about how much they’re worth.

newyorker.com

Terrence Sejnowski on Deep Learning

The Verge has an interview with Terrence Sejnowski on his view on deep learning. There are many interesting tidbits: e.g. his view on neuroscience and deep learning are related, and how Prof. Lecun designed ConvNet.

theverge.com

” ML is losing some of its luster for me. How do you like your ML career?”

This is a thread we saw in Reddit. And it says a lot of the status of current MLEs: In some sense, they are actually the core development team of their products. Yet they are often under-resourced, and indeed they are responsible to educate the whole company about the latest advance in A.I.

We want to throw out a few points:

First, let’s not single out management as the only problem of product development. They have their issues: working on sales, dealing with operations. Also generally management is not always an easy task. Your goal as an MLE is to help and educate everyone in your company to adopt a new technology. So do communicate and even over-communicate.
Then, we should ask is whether “feeling of ML is losing its lusters” is new. Actually, not really, ML is always a mundane job filled with tedious tasks. Just like every occupation, MLE has their fair share of joy and sorrow: Your time of struggling with code to get a script running, your time of tending a week-long training, your time to optimize inference and training routines. None of these take are trivial, but once you have done it couple of times, it still feel like one big grind.
So, may be a good way to give us solace is just to see ML or AI the way it is – nothing fancy, nothing futuristic. MLE is just a job which can pay you bill.

Or if you feel fancy: talking about AI in a dinner party is kind of cool. But that shouldn’t be why you choose MLE as a career. 🙂

reddit.com

Video

MIT AGI Conversation with Prof. Yoshua Bengio

Shared by Lex Friedman, Prof. Bengio talked about how to make an estimator be less biased to data.

youtube.com

Book Review

Sutton RL book 2nd edition

After many years of waiting, we finally see second edition of Prof. Richard Sutton and Prof. Andrew Barto’s book, “Reinforcement Learning: An Introduction” released. You can find the book shared under Creative Common license, and we share one of the links.

If you happen to own the first edition, you would notice that second edition has different notations, and many new and updated chapters. For example, there are new chapters on the neuroscience of reinforcement learning. Case studies in chapter 16 also include discussion of AlphaGo and AlphaGo Zero, both of which are very significant advances in computer science.

To us, this is much needed update of one bible in the field. So make sure you check it out.

google.com

About Us

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Uncategorized

AIDL Weekly Issue #77 – Keras vs Tensorflow by Adrian Joey Rosebrock Stanford NLP 2012

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Weekly Issue #77 – Keras vs Tensorflow by Adrian Joey Rosebrock Stanford NLP 2012

Issue 77 October 15th 2018

Editorial

Thoughts From Your Humble Curators

This week we cover two recent Google’s results: one on metastatic breast cancer detection, the other on their recent ActiveQA system.

As always, if you find this newsletter useful, feel free to share it with your friends/colleagues.

Join our community for real-time discussions here – Expertify

Artificial Intelligence and Deep Learning Weekly

News

Deals

Artificial Intelligence and Deep Learning Weekly

Blog Posts

Keras vs Tensorflow by Adrian Joey Rosebrock

Joey Rosebrock wrote another useful article for beginners: should you even compare Keras and Tensorflow? This is one FAQ in AIDL as well. And of course, the answer is no. Because Keras is a layer which build on top of Tensorflow, and we think Joey once again gave very good answer in his post.

A good question here is what’s the impact if you have misconceptions in deep learning, like you feel there is a difference between Keras and Tensorflow? We believe these misconceptions are detrimental to your learning. Generally, it’s always better to learn a technical topic from its first principle. e.g. in deep learning, you should learn up ideas such as optimization, gradient descent, the forward and backward propagation first. Then on the basic of Tensorflows, and eventually realize Keras is just a layer. Going through this path will take you more time, but you understanding would be real. You will also ask more relevant questions, thus get closer to finishing a task, a project, or perhaps really get a job in machine learning.

pyimagesearch.com

Stanford NLP 2012

Natural language processing (NLP) has been rapidly changed last 5 years. But then the basics hadn’t changed much – it’s useful to learn up parsing, stemming, basic IBM models. One good beginner to intermediate course in NLP is perhaps Jurafsky NLP class in 2012. Here is the official Stanford Online playlist. You may find it useful to watch it with the 2rd edition of Jurafsky’s book: Speech and Language Processing.

The issue of the course, viewing from modern perspective, is that it is not deep learning-based. But then, you still need fair amount of knowledge in NLP to learn better for the current days deep learning algorithm.

youtube.com

Applying Deep Learning to Metastatic Breast Cancer Detection

This is a recent blog post on how Google applies deep learning to metastatic breast cancer detection. The post is quite readable, and point you to two relevant papers. The idea is quite simple: use deep learning to detect spreading (metasized) cancer from images. So this is what the first paper “Applying Deep Learning to Metastatic Breast Cancer Detection” talks about. The paper details the development of LYNA which is essentially a visualization system for doctors. And the selling point is easy to understand: machine can look at images pixel by pixel, so this should save doctor’s time.

This is the theory, so what Google’s author did is to test out if this workflow is really better. It boils down to the second paper: “Impact of Deep Learning Assistance on the Histopathologic Review of Lymph Nodes for Metastatic Breast Cancer”. They did a study on 6-board certified pathologists, and ask will using system make them save time? It does, Google authors said, and the per-slide review time goes down from 2 minutes and 1 minutes.

If you look at the article, authors of the post was very cautious on whether if the technology is useful yet:

While encouraging, the bench-to-bedside journey to help doctors and patients with these types of technologies is a long one. These studies have important limitations, […]

This somber tone is oddly assuring.

One more thing, does this paper has any to do with DeepMind’s work on breast cancer as well? (As we reported in last issue?) It doesn’t seem to be the case. Currently Alphabet’s research on ML application on medical imaging seems to come from two institutions: DeepMind and Verily. While DeepMind was involved in the DeepMind-RoyalFree event in the past, Verily’s researchers seems to be more sober in tones when it comes to whether machine learning can really be used in medical imaging. It happens here in their breast cancer detection research. It also happens in their research of detecting cardiovascular disease through retinograph. (As we discussed in Issue 49.)

googleblog.com

ActiveQA

We learned about ActiveQA this week through Venturebeat and the reporting is quite faithful. So we just want to add a few things for our subscribers:

QA machines are not chatbots, so while the Google’s system is quite powerful, you can’t quite use it to come up with a bot which converses with humans. But then as it is advertised, ActiveQA is powerful to come up with natural sounding and clarifying questions.
Looking at the details from the original paper, there are many interesting technical tidbits. For example, neural MT is used in modeling the rephrase of a questions, but its training is funny: they first train a neural MT model based on bilingual language pairs, then recondition the final model for single-language retraining. So this applied the idea of zero-shot translation but it’s more suitable for tasks like rephrasing which has less training data.
The github is Tensorflow-based package and you may train the seq2seq reformulation models, as well as the Convnet answer-selection models. It feels more like a research codebase, rather than a general framework.

googleblog.com

Open AI Scholars 2019

New OpenAI Scholars for 2019 Winter is opened to public. Open AI, as a non-profit, has produced many impressive research results in the past few years. And their Open AI Scholars program also come up with several interesting final projects.

Successful applicants can remotely with mentors from the institutes, and you are guarantee to have an interview with OpenAI after finishing the program. Sound s like a great opportunities for our subscribers.

openai.com

About Us

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Uncategorized

AIDL Weekly Issue #76 – fast.ai pytorch library, MS’s infer.NET Oct 8th 2018

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Weekly Issue #76 – fast.ai pytorch library, MS’s infer.NET Oct 8th 2018

Issue 76 October 8th 2018

Editorial

Thoughts From Your Humble Curators

This week we cover two new open source frameworks: the fast.ai Pytorch library and Microsoft infer.NET.

As always, if you find this newsletter useful, feel free to share it with your friends/colleagues.

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 176,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65.

Join our community for real-time discussions here – Expertify

Artificial Intelligence and Deep Learning Weekly

News

Deals

Artificial Intelligence and Deep Learning Weekly

Project Maven: Redux

There are plenty of image classification applications in the military. So it should not come as a surprise that Google had been working with them. Google walked away from Maven back in June, would they come back?

washingtonpost.com

Blog Posts

fastai for PyTorch

We have great respect for Jeremy Howard who developed the very popular and practical course, aptly called Practical Deep Learning for Coders. And now, Jeremy is also publishing the course’s library which build on top of Pytorch. It has optimizations and blessings from the original Pytorch team, and has already been showcasing itself in several commercial projects. This post from Jeremy highlights several recents projects using the library, and what’s under the hood.

fast.ai

PyImageConf 2018 Recap by Joey Rosebrock

This is summary of PyImageConf 2018, written by one of the hosts: one of our favorites in our community, Joey Rosebrock. While this is a new conference, Joey was able to invite several luminaries in deep learning, such as Francois Chollet and David King to the venue.

Kudos to Joey and his co-host Jeff Nova. Keep up the good work!

pyimagesearch.com

MIT Human-Centered Autonomous Vehicle

Here is the paper version of human-centered autonomous vehicles we mentioned last week.

mit.edu

Open Source

Microsoft Infer.NET

Our first impression of Infer.NET was “Oh, isn’t this yet another dot net product?” And “MS already has CNTK” already. So why one more framework? But we were pleasantly surprised. Infer.NET is more about what the post termed “model-based machine learning” which is really about training up distributions such as good old classics like mixture of Gaussian distributions, principal component analysis. So in a sense, it’s more a “machine learning framework” rather than “deep learning framework” as we now accustomed to. You won’t see the now standard tutorial on DNN, image recognition and machine translation, but then you will find interesting applications such as recommendation systems or skill assessment.

Infer.NET was used in many real life cool applications, e.g. as cited by the post, is the Trueskill 2 system which access capability of players. There was also hundred plus papers based on the work.

This is already quite refreshing. The team wrote a book about the work in their book “Model-based Machine Learning” and it is cowritten by one of our favorite authors, Christorpher Bishop who penned the Bible “Pattern Recognition Machine Learning”. Furthermore, there is a future chapter on probabilistic programming. Wowowow, there are just too many goodies here. So don’t forget to check it out.

microsoft.com

About Us

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Uncategorized

AIDL Weekly #75 – Inside Google Dataset Search

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Weekly #75 – Inside Google Dataset Search

Issue 75 October 1st 2018

Editorial

Thoughts From Your Humble Curators

We cover Paige.ai and what’s inside Google Dataset Search this issue.

As always, if you find this newsletter useful, feel free to share it with your friends/colleagues.

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 175,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65.

Join our community for real-time discussions here – Expertify

Artificial Intelligence and Deep Learning Weekly

News

The Story of Paige.ai

We have a two-week break since last issue. So this piece about Paige.ai is around 10 days ago. But the story of Paige.ai is eerily similar to the DeepMind-Royal Free event which we have covered extensively on Issue 20 and 42. So we decide to a closer look.

Someone in Sloan Kettering’s Cancer Center decide to monetize the historical data and notes amassed over 60 years. More importantly, the company has exclusive use of the center’s 25 million tissue archive slides. That data corpus is VERY valuable.

The problem is none of these was disclosed to the staff pathologists beforehand. And the center’s executives own stakes of the new startup, Paige.ai. There are also potential privacy-related questions, even if the data have been anonymized (how anonymized?)? When all this came to light, it caused an uproar at the center. Since Sloan Kettering is a non-profit and Paige.ai is for-profit, the optics do not look good.

The NYT story gives a nice account of what happened. We recommend you to check it out.

We’ll leave you with the words of warning issued by the independent commission office (ICO) which investigated the DeepMind-Royal Free affair: Just that you can, it doesn’t mean you should.

nytimes.com

Deals

Artificial Intelligence and Deep Learning Weekly

Blog Posts

Inside Google Dataset Search

Google Dataset Search (GDS) has become our go-to website to search for database since its inception. When you think about it, building GDS, has to be a challenge in and of itself. How do you gather unstructured data from the web? How do you deal with replicated data? How do you gather this information and how do you structure them in a data structure.

All these questions are answered by this interesting post from the Google AI Blog. No surprise, none of these questions are trivial. For example, on the issue of representing the data, the researchers try to reconcile with the underlying Google Knowledge Graph (GKG) structures.

googleblog.com

The Best Machine Learning Books

Normally we are not into sharing lists of resources. Well, it’s just cliche.

But when Uri Eliabayev, an AI consultant, as well as the the administrator of Machine and Deep Learning Israel (MDLI) showed us his list, we paid attention. Uri’s list are gathered from experienced members of MDLI and all of them are gems. We recognize several gems: PRML? Duda and Hart? And Goodfellow’s Deep Learning. These are books which help us in the past as well, and we found them not only useful references, but must-read textbook to improve you ML knowledge.

medium.com

RTX 2080 Ti Benchmark

Lambda Lab did an interesting benchmark on how RTX 2080 Ti match up with its predecessor. TL:DR, the gain is truly impressive. Check out their link if you are thinking of upgrading your GPU card.

lambdalabs.com

Video

Human-centric SDC

MIT Researcher Lex Friedman shared this interesting link of his latest work on human-centric SDC. We found it quite impressive because unlike most current-day SDCs which focus on autonomous driving, Friedman’s put more emphasis on human’s emotional state and how that affects driving. This thinking of human in the loop of AI is not new, but Friedman perfectly showcase in his video. So check it out!

youtube.com

Member’s Question

Ask AIDL: Why People Choose to do AI

As answered by AIDL members. A little light-hearted, yet genuine.

facebook.com

About Us

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Uncategorized

AIDL Weekly Issue #74 – Facebook vs Fake News (or AI vs Fake News?)

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Weekly Issue #74 – Facebook vs Fake News (or AI vs Fake News?)

Issue 74 September 17th 2018

Editorial

Thoughts From Your Humble Curators

This week we look into the details of Facebook’s usage of machine learning in detecting misinformation. What is the purpose of their latest Rosetta system?. And how prepared is Facebook against faked news?

As always, if you find this newsletter useful, feel free to share it with your friends/colleagues.

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 173,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65.

Join our community for real-time discussions here – Expertify

Artificial Intelligence and Deep Learning Weekly

News

Tesla T4

Tesla T4 was just announced. Let’s take a look some notable features:

INT4 and INT1 (experimental) – once a team optimizes some deep learning code by lowering the floating point precision, the inevitable next step would be converting the floating point code into integer. Supporting INT4 and INT enable bleeding-edge team to create even faster product.
TensorRT Hyperscale: A collection of packages which wrap around T4. Sounds exciting and TensorRT5 can be used to optimize inference-related operations.
16G: This sounds like a lot, if you think of resnet scale. But then, you can imagine real-life convnet already reaches this limit which is imposed by earlier version of the card.

Of course there are also spec improvements. But overall speaking, T4 feels incremental. (And yeah, what happens after INT1?) In some sense, we are all waiting for a mass availability of TPU-like product released either by Google or up-and-coming but ultra secretive company such as Groq. This is especially true when we talk about inference time.

tomshardware.com

Deals and Acquisitions

Microsoft Acquires Lobe

Also take a look at the trends in AI in healthcare fundings.

Artificial Intelligence and Deep Learning Weekly

Blog Posts

The Facebook Rosetta System

Facebook has been plagued by faked news and misinformation. It’s a challenging problem. How would you solve it?

As it turns out, one of the thorniest problem is detecting text within memes. Memes’ text could come from different languages, have weird symbols, so a vanilla text detector is unlikely to give the best performance. As a result, Facebook comes up with Rosetta, their own text detection+recognition system.

So here are couple of notes on Rosetta which we think is quite interesting:

For starter, it is based on FasterRCNN. Technically, this is very interesting, because unlike YOLO, FasterRCNN is known to be more accurate yet much slower. How did FB guys speed it up? It turns out they use 1-year old idea (Shufflenet)[https://arxiv.org/pdf/1707.01083.pdf] to reduce the size of the networks. In the actual implementation, Facebook is using a non-trivial optimization of GEMM which utilized INT8. This further improves the speed of inference.
Then there is recognition. Would it be an all-in-one network of both detection and recognition? It doesn’t seem to be the case. Recognition itself is trained by Connectionist temporal classification (CTC) cost function. As a side note, CTC seems to be a more popular method in OCR. Whereas in speech recognition, other seq2seq paradigm seems to be still in active research.
With both detection and recognition network all set, we just need to tap the Facebook’s unlimited amount of training data right? Ah, but then where are the labels? So the researchers decided to use a another approach which tries to generate text “in the wild”. Classic example of data augmentation to come save the day.

Overall, Rosetta showcases a world-class level of engineering expertise. It churns out text and the input of many FB text analysis engines. The next big problem for FB now is how to engineer a programmatic system to fact-check at scale, which could be even more challenging.

For deeper dive into their text detection, take a look of the FB impressive paper at KDD18?

fb.com

Zuckerberg’s view on How prepared FB is on Faked New and Misinformation

This is the widely-circulated write-up by Mark Zuckerberg. He discusses how well Facebook is prepared for the the upcoming US Mid-term election.

As you might read in the news, US Justice Department has been pressing criminal charges on hackers (for example, see here ) which spread faked news on the internet, which allegedly affect US presidential election back in 2016. Of course, Facebook is in hot water when many of these fake information are spread through their network. How would they respond?

The question for us AIDLers is: Can we just create learning machines which filter out what is faked news or not? This seems to be just a simple text classification problem, but then several factors make the problem much harder. Actual human fact-checkers not only try to resolve if a piece of news is true or not, they also actively seek information from sources by interviewing them. The reason is simple: To determine if something is truth, you need to seek knowledge in the real world and decide. For a hypothetical example: to decide if a Pizza joint is actually selling pepperoni pizzas, just text doesn’t give you enough information, someone will need to gather information from the restaurant directly and ask “Do you actually sell pepperoni pizza?” or even ask for a pepperoni pizza and see if the restaurant can prove they can produce one.

This knowledge of the world has a fancy term called “ontology”. Ontology as a study comes from philosophy which dictionary would say it means “knowledge of being”. But in our sense, it is more just a kind of “world knowledge”. In some sense, human fact-checkers are able to effectively gather world-knowledge to judge a piece of news. Is it possible for us to replicate that capability? Unfortunately, while NLP driven by ontology is under active research, we don’t see it matures any time sooner.

Now perhaps you can see how tough it is for machine-based fact-checking. If you look at Zuckerberg’s address closely, machine learning only appears once and his focus in on automatic detection of fake accounts. And as we learn from the example of Rosetta, we also learn that FB has impressive effort to extract text from images such as memes. In the note, you may see Rosetta was used in detecting hate speech, which again we can think of as a more trivial case of text classification. But then, none of these efforts really automate fact-checking. That’s why Facebook is using independent agency to fact-check news. So in a sense, we still need to resort to humans.

So when Zuckerberg said:

Today, Facebook is better prepared for these kinds of attacks.

We sense cautiousness … and honesty from him. Compared to two years ago, technology has advanced such that we should be able to extract all texts (from images, from voice and from text) more easily. But to detect the “faked”ness of news? We are not yet “well-prepared”.

facebook.com

How to get started with Keras, Deep Learning, and Python – by Adrian Rosebrock

Here is another post from Adrian Rosebrock. This time is about how you should use Keras. It sounds common but once again you should look at the strength of Rosebrock – he is always very detail-oriented and come up with genuinely useful blog posts.

So what is the biggest problem of using Keras then? Well, installation. In fact, if you learn how to install a deep learning chain correctly on … let’s make a simpler case… Linux, then you are probably the favorite in your workplace. That’s because deep learning, developed to now, has a complicated toolchain, which is similar to compiling a compiler or doing cross-compilation. That’s why a detail tutorial would be nice.

So this is this Rosebrock’s tutorial does. The first step he guide you through is to install Keras correctly. There are more and we don’t want to give a spoiler. So take a look?

pyimagesearch.com

Anatomy of AI System

This is perhaps the most beautiful infographic on how a AI household product like Echo is actually made in practice. The graph includes many non-AI components, but that’s just the realty of AI product development: algorithms, even machine learning is just a very small part of the equation.

anatomyof.ai

About Us

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Uncategorized

AIDL Weekly Issue 70 – Nvidia Turing, TF 2.0

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Weekly Issue 70 – Nvidia Turing, TF 2.0

Issue 70 August 20th 2018

Editorial

Thoughts From Your Humble Curators

Hey Hey! We are back. This issue we brings you two interesting stories:

The Nvidia Turing architecture – how much would it affect deep learning?
Tensorflow 2.0 – what is the major change? How would that affect you?

As always, if you like our newsletter, share with your friends/colleagues!

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 168,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65.

Join our community for real-time discussions here – Expertify

Artificial Intelligence and Deep Learning Weekly

News

Deals

Also this from Techcrunch: Artificial Intelligence Continues Its Fundraising Tear In 2018

For a counter view, check out this piece on Quanergy Systems and how they lost its ways.

Artificial Intelligence and Deep Learning Weekly

Nvida Turing Architecture

What are the implications of the new GPU architecture for deep learning? Are there new opportunities for optimizations, in either training and inference?

Perhaps the most eye-catching feature is INT4, 4-bit integers that allow certain type of models to be optimized. Using INT4 seems to be happening in the world of FPGA design where the lower precision integer is used for better speed and energy efficiency.

Would the popular tensor-cores be carried over? Yes it does – developers who have optimized on Volta can potentially carry them to Turing.

Turing seems to have graphics-related features so it affects the Quadro series platform, which is usually known to be slower but more stable. Another segment Turning would likely to change is the Tesla series customer – probably the whole line of P100 or even V100 would be refreshed.

anandtech.com

Open Source

Tensorflow 2.0 is coming

TF 2.0 is coming and it’s a major milestone. If you read what Wicke wrote, eager execution would be the key feature. Essentially, eager execution means that 2.0 will no longer use a declarative programming model, which assume that programmers will first use python to declare the definition of a network, then TF would compile the model. While achieving higher efficiency, declarative programming is difficult to debug, and more difficult to learn than its alternative imperative model.

PyTorch is perhaps the alternative mainstream package that adopts an imperative model. So TF 2.0’s move might be seen as a response of PyTorch.

Another note about TF 2.0: from 2.0 TF will no longer distribute tf.contrib because individual projects have grown to the point which requires separate repos. This makes sense to us. It also gives opportunities to newer developers to join.

TF 2.0 is expected to release a preview version late this year.

google.com

Facebook Unsupervised MT Code

In April we saw a sequel of Facebook unsupervised MT work based on monolingual corpus. The new work focused on unsupervised MT and works well exceedingly well in low-resource language pairs. The team just released the code under github and you can find it from the link.

github.com

Video

Interview with Rachel Thomas

Rachel Thomas, one of the founders of fast.ai, answered 67 questions from Siraj Raval.

youtube.com

Member’s Question

“You know more than Silicon Valley Engineers”

(Original Link) Question: (Excerpt and rewritten) At the end of the lecture 3 of Ng’s Machine Learning Coursera Course, Andrew says that “if you understood what you have done so far in the course, you know much more than many of the Silicon Valley engineers that are having a lot of success” . Is it actually true?

Answer: (By Arthur) You might be around 5 years ago – that was the time machine learning was more an esoteric topic. At then, it is true that general programmers and engineers lack of basic understanding of ML concepts such as under/over-fitting, metric-driven development.

Translate to now though, you should be aware that machine learning became a mainstream topic and general CS major knows quite well ML works. You are competing with many young bright minds on your knowledge of machine learning now.

So I would probably say, given what you know so far until Lecture 3, “you have good basic understanding of machine learning”. You have a good start, but my guess you still have things to learn.

Artificial Intelligence and Deep Learning Weekly

About Us

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Uncategorized

AIDL Weekly Issue 69 – A Dexterous Robotic Hand

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Weekly Issue 69 – A Dexterous Robotic Hand

Issue 69 August 6th 2018

Editorial

Thoughts From Your Humble Curators

This week we look deep into latest OpenAI’s work on dexterous robotic hand, and ask if feed-forward networks are just as good as the recurrent ones.

As always, if you like our newsletter, feel free to share it with your friends and colleages.

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 165,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65.

Join our community for real-time discussions here – Expertify

Artificial Intelligence and Deep Learning Weekly

News

Deals

Artificial Intelligence and Deep Learning Weekly

US AI Patent Filings

As you can imagine, it grows explosively in the last 5 years. So ask: what if Tensorflow has patented ideas in it? And how would software patents affect AI production systems?

wired.com

Blog Posts

OpenAI’s Robotic Hand can Spin a Cube!

After stunning results such as AlphaGo, you would think it’s hard for AI to surprise again. But then OpenAI is still churning out interesting results day by day, and the latest result is to make a dexterous robotic hand.

There are many nuance in their results, and you may read their blog post (as linked), and their paper. But we want to highlight several things:

As in many AI work, the authors were using simulated data in training. But surprisingly enough, using real-data doesn’t help much. Perhaps it has to do with scalability, you can generate several order of magnitude of data than capturing real-data.
When researchers observed how robots manipulate the cube, they found that gripping happens in between index and middle finger, which is different from human, which use index finger and thumb. OpenAI’s researchers believe it has to do with the robotic hand has a flexible index finger.

Anyway, this sounds like an interesting and breakthrough research to us. Of course the natural dexterity shows in their video is impressive. But also robotic hand manipulation is just one of the examples where there are high-dimensional outputs to learn and require reinforcement learning to learn well. Other example perhaps is walking or climbing stairs. Solving this one problem well may lead to solving many difficult problems in the future.

openai.com

OpenCV Object Tracking by Adrian Rosebrock

Here is another tutorial written by Joey Rosebrock. This time is a hand-on implementation on object detection using 8 algorithms provided by OpenCV. As always you can always learn something from Rosebrock’s writing.

pyimagesearch.com

When Feed-forward model is as good

As you know, RNN or BLSTM is now taught in all standard deep learning courses. Conventional wisdom tells us that RNN usually outperforms the corresponding feed-forward networks (FFN) with limited context. Yet FFNs are still more prevalent than RNNs. Or practitioners just chose to use FFN because it parallelizes better.

So that begs a question – is FFN actually better than RNN in some problems? This is what this post, written by John Miller, is driving at. In fact, Miller summarizes several well-known systems in last few years which find convnets to give better performance. Another intriguing result is from Google’s technical report: “N-gram Language Modeling using Recurrent Neural Network Estimation” which shows that a 13-gram may performs as well as an well-trained RNN.

offconvex.org

Member’s Question

AIDL Admin’s Feedback on the New Pre-approval System.

An answer from Arthur: Zubair Ahmed posted a thread on getting everyone’s feedback about our now 2-month-old pre-approval system. So far, most feedbacks we got are positive. I just want to give you my take on the new system.

First off, our system (or think of it as post-approval) was meant to give members freedom to post what they like.

Unfortunately, self-inspection from all members couldn’t filter out all malicious postings such as porns, religious and political messages which have nothing to do with AI, etc. Plus there are too many complaints about basic questions such as “How do I learn AI?” was asked repetitively.

So here comes our new pre-approval system. How does it really work in practice? Let me just give you a sample of my day, and how we processed different posts and decide if they should appear in the feed. I am not the only approver, but we have fairly consistent standard across admins/mods. So you will have a good feel of our work.

Daily, we receive 50-70 posts required to be approved. In my timezone, I will process around 40-50 of them. Here is a rough breakdown of them:

10%: selling irrelevant products such as rolexes, web hosting. What I do: delete the post.
20%: technology-related but has nothing to do with AI. What I do: delete the post.
30%: AI-related news which comes from unreliable sources, or from a Page which just reposts a piece. Or sensational opinion about AI-related technology. What I do: I usually delete the post unless it reflect a certain zeitegeist in AI development.
10%: Members questions which are unclear. Usually these posts are poorly formatted and not proofread. These posts usually solicit angry responses from impatient AIDL members. What I do: sometimes I let them in, but comment on the quality of the questions. If they are “How do I learn AI?” I would just delete them.

Members questions which I have no idea the meanings are. Usually they are the results from poor formatting and poor or no proof-reading from the posters. They are usually gone because the post will only solicit angry response from the slightly more knowledgeable but impatient AIDL members. What I do: sometimes I let them in, but comment on the quality of the questions. But I don’t mind to delete them. If they are “How do I learn AI?” Sorry, I would just delete them.

So the rest is what you see in the feed. That accounts for ~10-15 posts. If they are posts, they are original form the authors, if they are code, they share from the programmers. If they are questions, they are usually non-trivial. And their answers are good for everybody knows.

One questions members often asked is how does the pre-approval affect our workload as admin? I’ll say : at the moment, it lighten up our load. The reason is we pretty much just used similar curation criteria before the pre-approval system. But now we see fewer group-wide outrages of poor quality posts. Spams such as porn, while infrequent, they are disruptive to our members’, and thus our life.

There are some members just completely disagree with any pre-approval system. I’ll say this: If you just look at the post breakdown, you should quickly notice that 60% of pending posts are inappropriate for the group. So we have always been removing them even before the system. We really tried to get the old system working, but it’s too hard.

I’ll also say we admins realize that we are just humans and can be biased and make mistakes. So let’s say we keep an open-minded and feel free to give us feedbacks.

On a lighter note: me and Zubair Ahmed found that there are always someone suggest that ML should be used to replace us admins/mods. Of course, we also repeatedly pointed out that this is a cliche idea. But let’s see how often they appear? 🙂

Artificial Intelligence and Deep Learning Weekly

About Us

Join our community for real-time discussions here: Expertify

Artificial Intelligence and Deep Learning Weekly

Uncategorized

AIDL Weekly Issue #61 – Project Maven

Post author By grandjanitor
Post date July 25, 2019
No Comments on AIDL Weekly Issue #61 – Project Maven

Issue 61 June 4th 2018

Editorial

Thoughts From Your Humble Curators

This week, we take a closer look at Google’s involvement in project Maven, and how it ends abruptly after complaints from Google’s employees.

As always, if you like our newsletter, feel free to subscribe and forward it to your colleagues.

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 145,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65. Join our community for real-time discussions with this iOS app here: https://itunes.apple.com/us/app/expertify/id969850760

Artificial Intelligence and Deep Learning Weekly

News

Google and Project Maven

This is one of the two pieces we include on Project Maven, which is mostly the background story of why Google is involved in defense projects in the first place, and why it could be a problem from its companies’ culture perspective.

We encourage you to read the whole piece, and only give some perspectives about the AI industry. And in some sense, guide you to understand the root of the internal conflicts of Google.

As you might know, defense projects are usually bid and picked up by big defense contractors such as Lookheed, Raytheon and Boeing. So for a web search company such as Google to join is quite strange. Of course, money is playing a role – Google’s web search business model is seeing its limitation and it’s hard to guarantee future growth.

But then, the more important part is that Google has been advancing on AI in a faster pace. Way before the time of deep learning, there was a saying that the largest database only exists in either Google or government agencies. So AI is not just an interesting research project for Google, her vast amount of data and brain power also created a tremendous opportunity for her to replace traditional big houses such as Lookheed. Obviously smart employees see that, that’s why Google now has a Head of Defense and Intelligence Sales, and in a way, it will likely be a solid revenue source for Google in the future.

This shouldn’t surprise you. U.S. defense has advocated research in automatic speech recognition and machine translation. In a way, you may even see is natural to see Google would have to make a decision of creating AI-based weapons once they decide to work on AI.

nytimes.com

No More Google in Project Maven

Worth mentioning: While the project is gone from Google, the department of Defense and Intelligence Sales still exists within Google.

gizmodo.com

Blog Posts

AI Winter Is Well On Its Way By Filip Piekniewski

This is widely circulated post by Dr. Filip Piekniewski (picked up by techmeme) on the hype of deep learning. Dr. Piekniewski looked at several evidence that the current AI hype is slowing down, that includes the recent SDC crash as well as lack of interesting research last year.

He praised long-time deep learning critics:

I respect Gary a lot, he behaves like a real scientist should, while most so called “deep learning stars” just behave like cheap celebrities.

We like Piekniewski’s criticism on deep learning. Although we reserve judgement on whether AI winter is coming. As he mentioned in the conclusion, predicting the next AI winter is like predicting marketing crash – by itself it’s an uncertain business.

And to mention another point: there are actually rather vibrant activities in machine learning and artificial intelligence in mid-2000s before the advent of deep learning. We doubt it would stop just because of one event or two. But of course, ours is also just a prediction.

piekniewski.info

Open Source

BDD100K: A Large-scale Diverse Driving Video Database

Perhaps the largest open source computer vision database, it spot 120Million images and suitable for road object detection and lane marking. You can imagine it can be very useful for researches on SDC.

berkeley.edu

Member’s Question

I realized ML is just Statistics, should I feel demotivated?

Question: Anyone had the feeling where you feel very motivated and eager to learn machine learning, and once you actually start, you realize it’s just stats which is something you really don’t like, and become completely demotivated?

Answer: (By Arthur) There are two parts of your frustrations. First, it’s that you equate machine learning and statistics. Second it is you feel that statistics is boring. So let’s address the second part first. Then I will come back to the first part.

Is statistics boring? I guess many people who learn statistics usually learn Math first. If that’s your route, then perhaps one of the reasons why statistics is boring is that it is empirical and deal with imperfect phenomenon of the world. So unlike Euclidean geometry, or solving quadratic or cubic, you can’t quite come up with an exact solution.

To many people’s dissatisfaction though, the world is better to be described to be uncertain, rather than certain. Unfortunately, only statistics can teach us more in the realm of uncertainty. So statistics is actually a rescue, and I personally feel grateful for the subject.

Can statistics be fun? You ask. It all depends on how you look at it. e.g. It took me a while to find a good proof of how a full covariance matrix can be estimated through maximum likelihood. In particular, in matrix form, the math is quite interesting. I end-up bought a book by Abadir and Magnus called “Matrix Algebra” and browse it from time to time. I am pretty sure it is boring to some, but it’s a lot of fun to me. Btw, Matrix Algebra can be quite mathematical too. But you may say it is more technical type of Math.

My conclusion of the second part is: are you sure you see everything in statistics and machine learning? There are many deep topics in both subjects. But then you might miss nuance in your first glance. So that’s that. Of course, your frustration might come from your personal philosophy. Perhaps you don’t like uncertainty? Perhaps you don’t like the time-consuming process of collecting data? No one can blame you for that. You just have to be honest to yourself.

Let’s go back to the first part on whether machine learning is just statistics. And this is slightly controversial. Let me just quote one prominent person. Say if you ask Prof. Nil Nilsson, he once said machine learning just the subject of “machine to learn”. But statistics is clearly more focused on the data and its observation. So if the fancy image of an intelligent robot is doing things was what attracts you, yeah, ML is the subject to learn. It’s just that modern theory of ML has found that statistics is very important.

So why is that the case? Oh well, it’s not like people love to be statistical, it has to do with nature is better described by uncertain rules. So say speech recognition? People would love to create several rules of phonetics and do speech recognition. In fact they were tried by PhD students in 60s. So in 70s, people start to realize that’s not the way to go. Ha tada, here comes HMM. Boring it is. But it is the basis of many previous generation ASR before seq2seq NN models. In fact, there are still many HMM-based system.

So, to summarize, if you are disappointed by ML. Ask whether reality is better described by certainty or uncertainty. It will help you to have a closure.

Artificial Intelligence and Deep Learning Weekly

Paper/Thesis Review

Do Better ImageNet Models Transfer Better?

This is a paper from Google investigating the limitation of transfer learning. Several findings catch our eyes. For example, the researchers find that better the model, generally transfer learning would have better performance. But then resnet consistently gives better performance than better models. These are intriguing results, and have practical values when you do image classification.

arxiv.org

About Us

This newsletter is published by Waikit Lau and Arthur Chan. We also run Facebook’s most active A.I. group with 145,000+ members and host an occasional “office hour” on YouTube. To help defray our publishing costs, you may donate via link. Or you can donate by sending Eth to this address: 0xEB44F762c58Da2200957b5cc2C04473F609eAA65. Join our community for real-time discussions with this iOS app here: https://itunes.apple.com/us/app/expertify/id969850760

Artificial Intelligence and Deep Learning Weekly