It seems as if not a week goes by in which the artificial intelligence concepts of deep learning and neural networks make it into media headlines, either due to an exciting new use case or in an opinion piece speculating whether such rapid advances in AI will eventually replace the majority of human labor. Deep learning has improved speech recognition, genomic sequencing, and visual objection recognition, among many other areas.
The availability of exceptionally powerful computer systems at a reasonable cost, combined with the influx of large swathes of data that define the so-called Age of Big Data and the talents of data scientists, have together provided the foundation for the accelerated growth and use of deep learning and neural networks.
Companies are now beginning to adopt AI frameworks and libraries, such as MxNet, which is a deep learning framework that gives users the ability to train deep learning models using a variety of languages. There are also dedicated AI platforms aimed at supporting data scientists in deep learning modeling and training which professionals can integrate into their workflows.
It’s important, though, to specify that deep learning, neural networks, and machine learning are not interchangeable terms. This article helps to clarify the definitions for you with an introduction to deep learning and neural networks.
Deep Learning and Neural Networks Defined
An artificial neural network, shortened to neural network for simplicity, is a computer system that has the ability to learn how to perform tasks without any task-specific programming. For example, a simple neural network might learn how to recognize images that contain elephants using data alone.
The term neural network comes from the inspiration behind the architectural design of these systems, which was to mimic the basic structure of a biological brain’s own neural network so that computers could perform specific tasks.
The neural network has a layered design, with an input layer, an output layer, and one or more hidden layer between them. Mathematical functions—termed neurons—operate at all layers. Neurons essentially receive inputs and produce an output. Initially, random weights are associated with inputs, making the output of each neuron random. However, by using an algorithm that feeds errors back through the network, the system adapts the weights at each neuron and becomes better at producing an accurate output.
Image credit: Shutterstock
The learning occurs in a neural network by feeding it labeled input and output data, and the network improves its performance by feeding it more and more data. This form of learning is supervised learning because it requires data scientists to provide the algorithm with labeled data for the learning to occur.
Deep learning is a method of learning that occurs when computer systems learn how to recognize patterns and classify things using raw, unlabeled data (unsupervised learning) instead of the task-specific algorithms of standard neural networks that rely solely on supervised learning.
Deep learning models can perform both unsupervised and supervised learning. The characteristic feature of the neural networks underlying these models is multiple hidden layers, hence the term “deep learning”.
Because real-world data such as images, video, and IoT sensor data are typically unstructured and not convenient to process, deep learning models meet the need of getting computers to learn autonomously by discovering characteristic features of data using special algorithms.
The aspects of modern technology that facilitate deep learning include the availability of large-scale datasets, cheaper access to fast graphics processing units (GPU) that can rapidly perform mathematical calculations, and improved training algorithms.
Deep learning, machine learning, and AI
Putting it together in a broader context:
- Artificial intelligence is the capability of a machine or computer system to imitate intelligent human behavior.
- Machine learning is an application of AI concerned with getting computer systems to improve their accuracy at tasks using data alone, without specific programming.
- Deep learning is a subfield of machine learning involved with modeling high-level data abstractions to determine high-level meaning from unstructured data.
- Neural networks are systems designed to mimic the human brain and progressively improve at tasks using algorithms and labeled data.
Hopefully this brief introduction has clarified what a neural network is and how deep learning is a distinct method that builds on the idea of a neural network.
About the Author