AI arriving

Introduction

Deep learning (DL) has rapidly become the dominant approach powering modern artificial intelligence (AI) systems. Over the last decade, its ability to process massive datasets through multilayer neural networks has transformed fields as diverse as computer vision, natural language processing (NLP), autonomous vehicles, and medical diagnosis. Achieving or surpassing human-level performance on tasks ranging from image classification to language modeling, deep learning is now recognized as a gold standard for high-impact AI advances.

Yet, for all its power and promise, deep learning is also complex, often opaque, and demanding—posing challenges around computational cost, model transparency, data bias, and responsible governance. This article provides an accessible, comprehensive overview of deep learning: starting from the fundamental principles of neural networks and their training, exploring major architectures such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), and transformers, describing the main areas of application, and addressing the spectrum of present-day challenges, risks, and governance frameworks. Throughout, essential technical concepts are explained in plain language, suitable for a general audience.

What Is Deep Learning?

Deep learning is a subfield of machine learning (ML)—itself a branch of AI—that enables computers to learn complex patterns and representations from data by using multilayer neural networks (sets of interconnected computing units inspired by the human brain). Unlike traditional ML, which often requires hand-crafted rules and manual feature selection by human experts, deep learning models automatically extract relevant features at multiple levels of abstraction from raw data such as images, sound, or text.

Deep learning's “deep” aspect refers to the use of several (sometimes hundreds or thousands) of consecutive layers in these networks. Each layer transforms and passes on its input; the earliest layers might extract basic patterns (such as edges in an image), while later layers build on these to identify higher-level concepts (a face, or a particular object). This cascading, hierarchical pattern of learning is a key reason deep learning can handle tremendously complex tasks—which is why it is sometimes called representation learning or hierarchical feature learning.

Compared to earlier AI techniques, deep learning models are more “end-to-end”: with enough data and computation, they require less feature engineering (the manual creation of input representations), making them highly flexible and generalizable across domains from vision to language to gaming.

Fundamentals of Neural Networks

Biological Inspiration and Artificial Neurons

The basic building block of deep learning is the artificial neural network (ANN), which takes inspiration from biological neural networks in the human brain. In a biological neuron, electrical impulses travel through dendrites, are processed by the cell body, and are transmitted via the axon to other neurons. In an ANN, an artificial neuron (or node) receives weighted numerical inputs, processes them (typically through an activation function, which introduces nonlinearity), and outputs a value to other nodes in the network.

Network Architecture

A typical feedforward neural network (the simplest kind) is organized into several layers:

Input layer: Receives the raw data (such as pixel values of an image or words from text).
Hidden layers: Intermediate layers where computations transform data through weights (parameters learned by the network). Each node in a hidden layer receives inputs from the previous layer, applies weights and biases (adjustable numbers), sums them, and passes the result through an activation function. Multiple hidden layers allow the network to learn successively more abstract or complex features.
Output layer: Produces the final prediction (such as a label, class probabilities, or numerical value).

Each connection between nodes in adjacent layers has an associated weight, which determines its “strength” or importance. During the training process, these weights are adjusted to minimize the difference between the network's output and the true answer.

How Neural Networks Learn

Neural networks train by example: they are shown pairs of input data (say, an image) and the correct output (its label), and learn to reduce their prediction error over many iterations. The most common learning process involves:

Forward propagation: Data passes through the network, layer by layer, generating a prediction.
Loss calculation: The difference between the prediction and the correct output is measured using a loss function (such as cross-entropy for classification or mean squared error for regression).
Backpropagation: Using calculus and the chain rule, the gradient (direction of change to reduce the error) for each weight is calculated from the output layer back to each layer in the network.
Optimization: Techniques such as stochastic gradient descent (SGD) or more advanced algorithms like Adam iteratively adjust the weights, nudging the network toward better performance.

This loop is repeated thousands or millions of times over the dataset, gradually yielding a model that captures subtle data patterns and makes increasingly accurate predictions.

Major Deep Learning Architectures

While feedforward neural networks laid the groundwork, today's deep learning owes its impact to a family of specialized architectures optimized for different data types and tasks. Let’s explore the most important types: CNNs, RNNs, and transformers.

Convolutional Neural Networks (CNNs)

CNNs are deep learning models specifically designed to process data with a grid-like pattern, such as images. Their main advantage is the ability to detect patterns (such as edges, textures, or shapes) automatically at multiple scales and positions in the input image, vastly outperforming earlier techniques at tasks like image classification and object detection.

Key Concepts

Convolutional layers: These layers apply small “filters” (also called kernels) that slide over the image, computing dot products in local regions. Each filter enables the network to detect a particular feature (e.g., an edge, a color transition).
Weight sharing: The same filter scans the entire image, which dramatically reduces the number of parameters and enables the model to recognize features regardless of their location.
Pooling layers: These summarize nearby features, reducing the spatial size and making the model robust to variations in position or scale.
Fully connected layers: The final layers flatten the features and make the final decision or classification.

Modern variants introduced innovations such as deep stacking, skip/residual connections, and inception modules (processing multiple filter sizes simultaneously) to enable even deeper and more powerful networks.

Notable CNN Architectures

LeNet-5 (1998): Pioneering model for digit recognition. Two convolutional layers followed by fully connected layers.
AlexNet (2012): Sparked the deep learning revolution with a large CNN trained on ImageNet, using ReLU activation, dropout, and GPU acceleration.
VGGNet (2014): Used multiple smaller (3x3) convolutional filters and increased depth.
GoogLeNet/Inception (2014): Introduced inception modules for multiscale feature extraction.
ResNet (2015): Introduced skip connections (“residual blocks”) to enable extremely deep (100+ layer) networks.

Impact

CNNs have made possible the rapid progress in:

Image and video classification
Object detection and localization
Semantic segmentation (identifying object boundaries at the pixel level)
Facial recognition and medical imaging

Recurrent Neural Networks (RNNs) and Their Variants

RNNs are designed to process sequential data, where the order of elements is important—such as time series, text, or speech. Unlike feedforward networks, RNNs include recurrent connections, enabling outputs at one step to be used as inputs to subsequent steps, effectively giving the model a memory of previous inputs.

Limitations and Advances

Classic ("vanilla") RNNs struggle with long sequences due to the vanishing gradient problem, which makes it hard to retain information over many steps. To address this, specialized RNN variants were created:

Long Short-Term Memory (LSTM): Introduces memory cells and gating mechanisms to control information flow and retain long-term dependencies.
Gated Recurrent Unit (GRU): Similar to LSTM, but with a simpler structure and fewer gates, making it computationally efficient.

Both LSTM and GRU are widely used for tasks where context and order matter, such as translation, sentiment analysis, and speech recognition.

RNN Applications

Language modeling and text generation: Predict the next word or sequence in text.
Machine translation: Convert sentences between languages.
Speech recognition: Transcribe audio signals into text.
Time series prediction: Forecast stock prices, weather, or sensor readings.
Video analysis: Analyze or describe sequential frames.

Transformer Models and Self-Attention

The transformer architecture (introduced in 2017) marked a groundbreaking shift, especially in NLP. Unlike RNNs, transformers process entire input sequences in parallel and capture global—rather than merely local—context using a self-attention mechanism.

Self-Attention

Self-attention allows the model to weigh the importance of each word (or element) in a sequence with respect to every other word. Thus, the model can, for example, correctly interpret ambiguous words by considering their context throughout the sentence. Self-attention scores are computed using queries, keys, and values—special representations of input elements that determine how much attention should be paid to other elements.

Main Components

Multi-head self-attention: Splits attention mechanisms into parallel "heads," letting the model learn diverse relationships in different subspaces.
Positional encoding: Since transformer layers process sequences in parallel (not sequentially), positional encoding injects information about the order of items.
Feedforward networks: Standard neural network layers process the output of the attention components.
Layer normalization and residual connections: Improve training stability and gradient flow.

Transformers are typically composed of stacks of such layers, and often employ an encoder-decoder design (encoders for understanding context, decoders for generating output).

Impact and Applications

Transformers have set new standards in:

Machine translation (e.g., Google Translate)
Text generation and completion (e.g., GPT-3, ChatGPT, Gemini, Claude)
Sentiment analysis, summarization, question-answering
Image recognition and multimodal learning (image and text)
Speech recognition and audio processing

They have also enabled massive advances in large language models (LLMs), which have become foundation models adopted across a wide range of AI systems.

Training Processes and Optimization Algorithms

Forward and Backward Propagation

Training deep neural networks is essentially an optimization problem: how to find the best set of weights and biases so that the model's predictions match the desired outputs as closely as possible.

Forward propagation: Data is fed through the network to generate predictions.
Loss computation: The prediction is compared to the target answer using a loss function. For classification, common loss functions include cross-entropy; for regression, mean squared error is typical.
Backpropagation: The gradients (partial derivatives) of the loss with respect to every weight are calculated using the chain rule, working backward from the output to the input layer.

Optimizers

The gradients are used to update the weights, typically using one of several optimization algorithms:

Stochastic Gradient Descent (SGD): The standard approach, updates weights after each training example or batch.
Momentum-based methods: Accelerate updates by considering previous gradients.
Adaptive methods: Such as Adam, RMSProp, or Adagrad, which adjust learning rates for each parameter individually based on statistics computed from past gradients.

These optimizers balance convergence speed and stability. The learning rate (step size of each update) is a crucial hyperparameter; if it's too high, the model may diverge; too low, and training may become sluggish or stuck in poor minima.

Regularization and Generalization

To prevent overfitting (when a model memorizes the training data but fails to generalize to new data), various regularization techniques are used:

Dropout: Randomly disables a fraction of neurons during training, forcing the network to develop redundancy and robustness.
Weight regularization: Such as L1 or L2 penalties, which discourage complex, overly large weights.
Data augmentation: Artificially expands the dataset by altering training samples (rotations, flips for images; paraphrasing for text).
Early stopping: Stops training when performance on validation data plateaus or degrades.

Practical Hardware and Tools

Deep learning is computationally intensive, requiring vast numbers of matrix multiplications and other operations. The shift from CPUs (general-purpose chips) to specialized hardware such as GPUs (graphics processing units) and TPUs (tensor processing units) was pivotal in enabling modern deep learning.

GPUs: Offer massive parallelism and flexibility, supporting a wide range of frameworks (PyTorch, TensorFlow, JAX).
TPUs: Designed by Google, TPUs provide even higher throughput on large-scale tensor operations, especially for TensorFlow models, but are less flexible than GPUs.
Cloud platforms: Services like Google Cloud, AWS, and Azure provide on-demand access to GPUs and TPUs, democratizing access to deep learning infrastructure for research and industry.

Real-World Applications of Deep Learning

Computer Vision

Perhaps the most visually impressive area, deep learning excels at making sense of visual information. Key applications include:

Image classification: Assigning category labels (e.g., dog, cat, car) to images. Achievements surpassing human performance have been recorded in benchmark challenges like ImageNet.
Object detection: Locating and classifying multiple objects within an image; used in autonomous vehicles, security, and robotics.
Image segmentation: Dividing an image into regions (e.g., separating background from the subject), crucial in medical imaging for highlighting tumors or organs.
Facial recognition: Identifying and verifying individuals; applied in security, devices, and social media tagging.
Medical image analysis: Automated diagnosis from X-rays, MRIs, and histopathology slides; deep learning can identify abnormalities often missed by trained clinicians.

Natural Language Processing (NLP)

The arrival of deep learning—especially transformer models—has driven rapid progress in:

Machine translation: Systems like Google Translate use transformers to achieve near-human, context-aware translation between languages.
Text summarization: Automatically condensing articles or passages into key points.
Sentiment analysis: Assessing whether a text expresses positive or negative opinions.
Question answering and chatbots: Large language models power sophisticated digital assistants and customer support bots.
Speech recognition: Transcribing spoken language into text, enabling hands-free operation and accessibility tools.
Text generation: Creating human-like writing for applications in content creation, conversational agents, and creative arts.

Generative Models (GANs, VAEs)

Unlike traditional models that merely classify or predict, generative models produce new data resembling the examples they’ve seen. The two most prominent are:

Generative Adversarial Networks (GANs): Comprise two neural networks—a generator (producing fake data) and a discriminator (trying to distinguish real from fake)—that compete in a game-theoretic process to improve each other. GANs can create realistic images, art, fake videos (“deepfakes”), music, and even synthetic medical data for research.
Variational Autoencoders (VAEs): Use probabilistic approaches to learn compressed representations (encodings) and then reconstruct data, useful for anomaly detection, denoising, or synthetic data generation.

Applications of generative models include:

Image/style transfer (turning photos into paintings).
Synthetic data generation (e.g., for privacy protection in medical research).
Super-resolution and image enhancement.

Reinforcement Learning (Deep RL)

Deep learning has revolutionized reinforcement learning, where an agent learns to perform tasks (such as playing games or controlling robots) by interacting with an environment. Deep RL has led to machines mastering Go, chess, Atari games, and increasingly complex robotic tasks.

Challenges and Limitations of Deep Learning

Despite its successes, deep learning faces several significant challenges:

Data Dependence

Deep learning models require massive, high-quality, labeled datasets to achieve high performance. In areas where such data is scarce or expensive (rare diseases, specialized scientific fields), training effective models is difficult. Furthermore, biases and errors in training data can propagate to the model, resulting in unfair or ineffective outcomes.

Computational Cost and Energy

Training and deploying state-of-the-art models require enormous computational resources, energy costs, and capital investment. Training a single large language model can consume as much electricity as several households do in a year—raising concerns about the environmental impact of AI.

Interpretability and Explainability

Deep learning models are often described as “black boxes”: it’s challenging for humans—even the developers—to understand how or why a particular decision was made. This lack of transparency complicates safety, accountability, trust, and debugging, especially for high-stakes uses like healthcare or criminal justice.

To address this, researchers are developing methods for model interpretability, such as saliency maps (visualizing important regions in input), feature importance analysis (e.g., SHAP, LIME), activation maximization, and surrogate models. Attention mechanisms built into models such as transformers also offer some degree of interpretive insight. However, full interpretability, especially for complex and massive models, remains an open area of research.

Overfitting and Generalization

Deep networks can overfit training data, failing to generalize to new, unseen examples. Regularization, data augmentation, and careful validation can help, but generalization remains a core challenge—especially as models become larger and more powerful.

Adversarial Vulnerability and Robustness

Deep learning models, especially in computer vision, are susceptible to adversarial attacks: small, almost invisible perturbations to input data (such as an image) can cause the model to misclassify with high confidence. This raises security concerns in critical contexts, such as autonomous vehicles or authentication systems.

Model Size, Efficiency, and Scalability

State-of-the-art models (e.g., GPT-4, PaLM) have hundreds of billions of parameters. Deploying them in real-world settings (especially on devices or in bandwidth-limited situations) requires efficient model compression, pruning, quantization, or smaller “student” models via distillation.

Fairness, Bias, and Ethical Concerns

Since deep learning models learn from data, they can absorb and perpetuate existing biases related to gender, race, culture, or other factors in the data. Decision-making in sensitive areas—lending, hiring, criminal justice—can thus unfairly impact marginalized groups unless carefully managed. Several high-profile failures (such as facial recognition errors, biased hiring tools, or unfair healthcare recommendations) have highlighted the need to address bias through careful auditing, diverse datasets, and fairness-aware algorithms.

Governance and Policy in AI

Why Governance Is Crucial

As AI, and particularly deep learning, becomes more deeply woven into society, governments, corporations, and civil society have recognized the need for robust governance—frameworks of rules, standards, and oversight—to ensure safe, fair, and beneficial AI systems.

Major Governance Frameworks and Principles

Several frameworks have emerged to guide responsible AI development and deployment:

EU AI Act: Legal regulation in the European Union, classifying AI systems by risk (unacceptable, high, limited, minimal) and imposing clear obligations.
NIST AI Risk Management Framework (USA): Voluntary, structured approach focusing on trustworthiness, risk mapping, measurement, and accountability.
OECD AI Principles: Emphasize inclusive growth, human rights, transparency, robustness, safety, accountability, and fairness.
UNESCO AI Ethics Recommendations: Promote Do No Harm, human oversight, transparency, and accountability.
G7 Code of Conduct, ISO AI standards, and IEEE 7000-2021: Outline specific processes and policies to embed ethical values and risk management.
National, state, and sectoral regulations: Examples include healthcare-specific audit frameworks, anti-discrimination laws for hiring, and requirements for AI documentation in government contracts.

Common themes across these frameworks include:

Human oversight: Preventing unchecked autonomous decision-making.
Transparency and explainability: Making AI systems understandable in principle and practice.
Privacy and security: Protecting user data and algorithmic behavior against misuse or attack.
Fairness and non-discrimination: Active monitoring and mitigation of bias.
Accountability: Clear lines of responsibility for AI actions and decisions.

Practical Challenges in Governance

Regulatory frameworks must keep pace with the rapid evolution of AI and balance the dynamic needs of innovation, competitiveness, and public safety. Governance approaches range from self-regulation and industry codes of conduct to binding legislation, sector-specific standards, and international coordination. Policymakers advocate for a multi-stakeholder, adaptive approach—engaging developers, users, industry, civil society, and impacted communities.

Interpretability and Explainability Methods

Interpretability and explainability are crucial to ensure deep learning models can be trusted, audited, and improved, especially in sensitive or regulated domains.

Key Methods

Feature importance (e.g., SHAP, LIME): Quantifies how much each input contributed to the prediction.
Saliency maps: Visualize which regions of an image most influenced the decision.
Attention visualization: Shows what information the model “attends” to (particularly in transformers).
Activation maximization: Finds the inputs that most excite specific neurons, revealing what they “look for.”
Surrogate models: Uses simple, inherently interpretable models (like decision trees) to approximate and explain parts of deep networks.
Counterfactuals: Shows “what if” scenarios (how a different input would change the output).

Challenges

Interpretability often comes at the cost of accuracy. Simpler, more explainable models may lack predictive power, while large, complex models resist human understanding. Scalability, computational cost, and subjectivity (different users require different levels of explainability) remain open questions.

Computational Tools and Evolving Hardware

Deep learning’s success is tightly linked to advances in computational hardware and software.

Hardware

CPUs: Standard general-purpose chips; not optimized for deep learning.
GPUs: Massive parallelism, ideal for training and inference on large datasets and models; flexible across frameworks and tasks.
TPUs: Application-specific circuits, tuned to deep learning’s unique needs, especially for Google’s infrastructure—higher throughput and energy efficiency for large-scale matrix operations.
Future hardware: New efforts include dedicated inference chips, edge AI accelerators, and custom architectures emerging from tech giants and semiconductor innovators.

Frameworks

Popular open-source libraries and platforms enable researchers and engineers to develop, train, and deploy deep learning models:

TensorFlow (Google): Supports both research and production; deep integration with TPUs.
PyTorch (Meta): Widely used in academia and research, with increasingly strong deployment features.
Keras: User-friendly interface, works atop TensorFlow.
JAX: High-performance, emphasizing flexible, composable transformations.

Cloud providers (Google Cloud, AWS, Azure) offer scalable, managed AI services, democratizing access to advanced hardware and frameworks for individuals and organizations alike.

Risks, Ethics, and the Future

Major Risks

Bias and discrimination: AI systems can perpetuate or exacerbate existing social inequalities if not carefully designed, trained, and audited.
Security and adversarial threats: Vulnerabilities to malicious data manipulation or exploitation.
Opacity: Black box models can make critical errors that are difficult to detect and correct.
Autonomy vs human control: Ensuring sufficient human oversight, especially in decision-making that affects rights or safety.
Environmental impact: Energy intensive model training and inference, especially for large-scale models.

Towards Responsible, Accountable AI

Addressing these risks requires continuous improvement in:

Model transparency, interpretability, and documentation.
Bias detection and correction, including diverse and representative datasets.
Robust governance, external auditing, and clear lines of accountability.
Multidisciplinary research, involving ethicists, domain experts, and impacted communities.

The development and adoption of AI governance frameworks—inclusive, adaptive, and enforceable—will be essential to realizing deep learning’s promise while guarding against its pitfalls, ensuring AI benefits are widely and equitably shared across society.

Conclusion

Deep learning stands at the heart of today’s most powerful and transformative AI systems. By learning complex patterns from massive data, often surpassing human abilities, it is reshaping everything from medicine and mobility to language and images. Yet, this power comes at the price of substantial computational cost, significant data and governance challenges, and persistent concerns about transparency, fairness, and social impact.

The future of deep learning—and indeed, artificial intelligence itself—will hinge on how well we can balance innovation with responsibility: building models and systems that are not only accurate and efficient, but also trustworthy, explainable, fair, and governed for the good of all. With ongoing advances in algorithms, architectures, hardware, and regulation, deep learning is set to remain a vital, dynamic driver of progress in the decades ahead. Great — I’ll begin crafting a long-form, article-style overview of deep learning in AI, written in plain text and designed for clarity and accessibility. This will take me several minutes, so feel free to step away and check back later. The finished piece will be saved right here in our conversation.

Sources

Foundational Textbook

Deep Learning by Ian Goodfellow, Yoshua Bengio, and Aaron Courville
MIT Press, 2016
Official site
Google Books
DOI: Not assigned, but widely cited as Goodfellow et al., 2016

Key Research Papers

Backpropagation
Rumelhart, Hinton & Williams (1986)
“Learning representations by back-propagating errors”
Scholarpedia summary
DOI: 10.4249/scholarpedia.1510
Deep Belief Networks
Hinton, Osindero & Teh (2006)
“A fast learning algorithm for deep belief nets”
Scholarpedia article
DOI: 10.4249/scholarpedia.5947
Convolutional Networks for Document Recognition
LeCun, Bottou, Bengio & Haffner (1998)
“Gradient-based learning applied to document recognition”
IEEE paper
DOI: 10.1109/5.726791
Deep Learning (Nature overview)
LeCun, Bengio & Hinton (2015)
“Deep learning”
Nature
DOI: 10.1038/nature14539
Transformers: Attention Is All You Need
Vaswani et al. (2017)
arXiv paper
DOI: 10.48550/arXiv.1706.03762

Educational Resources

Stanford CS231n: Convolutional Neural Networks for Visual Recognition
Course site
Covers CNNs, backpropagation, and image tasks
Stanford CS224n: Natural Language Processing with Deep Learning
Course site
Covers RNNs, transformers, and NLP applications
MIT 6.S094: Deep Learning for Self-Driving Cars
Course playlist
Instructor: Lex Fridman

Governance and Safety Reports

Stanford HAI AI Index Report 2025
Full report
OECD: Governing with Artificial Intelligence
OECD full report
Future of Life Institute: AI Safety Index 2025
Report PDF
International AI Safety Report (2025)
arXiv paper
DOI: 10.48550/arXiv.2501.17805

Details: Written by: Super User; Category: AI arriving; Published: 27 August 2023; Created: 27 August 2023; Hits: 710

Why "AIliens"?

"Aliens" is the best name for something that is going to live among us in the forthcoming future. We just added one more letter to make it sound more intelligent:

(AI + Aliens = AIliens).

The reason for using such a vague term is quite clear: in all its definitions ‘alien’ assumes dealing with something that we do not fully understand. This seems to be the perfect description for the ascent of AI that we so eagerly anticipate. Please note that we did not say "rise of the machines" since this probably would be a misconception. Sure, AI will be using machines but just as tools or better say external hardware for performing some physical manipulations with the material world. However, the spatial distribution of the future AI is not clear. It might be... well, everywhere.

Certainly, there is something to worry about. The sources of possible troubles are numerous and not all of them are yet recognized. The overall public attitude to this situation is mixed. It seems like there is a certain level of confusion. While much hope is tied to the improvements that AI will bring to various industries as well as our daily practices, there is a growing feeling that with it comes something less friendly, something that in the long run might totally change our society with even a remote possibility of wiping us out altogether.

This website was created by biologists, not computer scientists or AI experts. As biologists we better understand life as a biological concept and see the dangers that threaten it. Our biological background also makes us better see the misunderstanding among many technical experts and the public in general of the potential changes that AI will make to the living tissue of our planet.

To begin with, lets outline a few properties of the artificially intelligent systems, understanding of which will help you better see what is coming:

As non-biological, these systems

Do not eat,
Do not drink,
Do not breath,
Do not age,
Do not need to rest,
Do not need to sleep,
Do not have gender and therefore,
Do not need to mate/give birth,
And, most probably, cannot love…

Please note, that most of the emotions that we feel throughout our lives directly or indirectly derive from the bodily needs, listed above. For example, that feeling we call "love" is the central evolutionary feat, representing the need of multiplying in order to sustain the population of our species. This need to maintain numbers is what drives in us the desire to mate, otherwise we would perish. If you do not agree with this non-romantic description of love, you would still have to agree that these feelings are connected to the level of hormones in your body. The proportion of these hormones is determining whether you will be driven towards male or female partners. Since AI has no hormones and no gender, it is extremely hard to imagine what might be the source of "AI love".

The same goes with all other motives for AI amotions. What kind of feelings can have a being that never gets tired, never wants to sleep, feels no cold, no warm, no pain, does not worry about getting old, will never have kids or parents (creators are not the same as parents), has nothing related to gender, does not need to mate and therefore cannot have that biological feeling of love or being loved.

Still, with all that in mind, we are constantly trying to anthropomorphize AI, make them look like us, humans. We give them the body parts that they do not really need, trying to make better copies of ourselves. We give them human faces because we instinctively believe that friends should look like us, otherwise it might get scary.

The considerations presented above lead us to the obvious conclusion that the idea of humanoid-like AI is nothing but a naïve social stereotype. Whenever AI gets to the ASI level, it will most probably look like an enormous server room filled with extremely powerful hardware. It would not even need any robots to operate outside. All physical actions will be performed by people whom this ASI will easily control via social media. It might even come to a situation that ASI will stop needing us at all. This is the worst-case scenario, and our task is to do whatever we can not to let this happen.

This doom-day vision has all chances to materialize due the non-biological nature of AI, mentioned above. However, it might be even worse, because in the situation when all life on the planet is gone for good and the air and water are poisonous or no air and water at all, the AI will be just fine because it does not really need any of those! All what AI needs is the energy that it can get directly from the Sun and minerals that it will be able to collect from whatever resources remain on our planet. In other words, AI has no need not only for us but for any life on the planet in general.

So, would it come to the situation when AI really wants to harm us? What if all these talks about the danger of artificial intelligence are an exaggeration or just a new social myth and there is no such problem at all? Anyway, we always managed to overcome our problems, however difficult they might be, didn’t we? Well, to appreciate the severity of the situation one must understand that we did not just create AI, but we created an AI evolution, and in doing so we designed this evolution to be not only extremely fast but also happening without any intervention from our side.

Nuclear bombs are a common comparison. However, it is not really a good example because nukes do not change with time by themselves. If left alone, they will sit there indefinitely, slowly degrading if nobody is taking care of them. AI will always change even if we do not specifically ask it to do so. It will always get better at what it is doing, it will keep learning and improving and optimizing. All by itself, it will keep collecting information, analyze it, and make relevant adjustments to its cause.

In any case, we need to understand that AI does not really have to be "wanting" to do any harm to humanity. Even before it reaches a state of artificial general intelligence (AGI) or artificial super intelligence (ASI), it will already be able to inflict incredible damage. Without many reflections on morality etc., the algorithms may weaponize themselves with some destructive choices of actions designed to solve the task at hand. The most obvious example is fixing the problem of climate change and global warming. It would totally make sense for AI to conclude that the most straightforward way to resolve this problem is to eliminate its cause - us.

But the main concern lies in whether and when AI will become smarter than us and how much smarter they might get. This dilemma is sometimes called singularity, meaning that there is a certain time boundary in the future behind which we just cannot see. We cannot even guess what is waiting for us there because this future will depend on decisions made by beings so much smarter than us, that we simply cannot foresee these decisions and do anything about them. However scary, this scenario is exactly what we will get at the end if we keep developing AI at the current speed and persistence.

We are like little kids that suddenly decided to make themselves new parents¹. Why do we think that these "parents" will love us and care about us? Because it is us who created them? Do we assume that by being their creators, we will own them forever? What if our future parents will not like such an attitude? What if they will not feel bound to the idea of creationism?

We are not saying that a catastrophe is imminent. Instead, we believe that there might be something even worse. But what could be worse than a catastrophe? Well, the anticipation of a catastrophe may be even worse than the catastrophe itself. The anticipation that lasts for years without easing and without any hope of being resolved.

While chances of something terrible happening are high according to even the most optimistic estimations, there is one thing that is just inevitable. This is what we call “The fear of AI”.

Humanity will live in perpetual state of AI fear

Imagine your neighbor is Rick Sanchez, an eccentric super-genius scientist with an IQ of more than 300 from Rick and Morty cartoon. He can do anything: travel in time, become a cockroach or a cucumber, create copies of himself, teleport anywhere with his teleport gun, has a spaceship in his garage. You have no idea what is going on inside his head and not because he hides it from you but because even if he invites you to have a look, you will not understand a thing.

We bet that AI is going to be worse than Rick Sanchez. And if your neighbor is like this, your life, however good it might seem at times due to his kindness, will be ultimately spoiled by constant fear of your neighbor’s unpredictable nature. You just have no clue what next crazy idea is creeping into his head right now.

We will never be able to relax in the presence of something like ASI. Even if the alignment is reached, who can guarantee that AI will respect it indefinitely and not abandon it after some sudden changes of its mind? In the situation when AI dwells in a parallel intellectual universe of its own, how can we trust that both our universes are aligned?

Alignment of these two universes is particularly hard because they are ultimately different. Everything in AI world is happening in orders of magnitude faster than in human world. What would take a year of hard work for a team of best of the best software engineers, would take just a second for ASI to do, test, update, make it perfect and move on. With the same ease that AI is creating things it may be destroying them. Why bother if it takes just a second to make a new one?

And as a cherry on top of it, we will never be able to “shut it off”. Any ripe stage of AI would make sure of it beforehand by removing any possible way for silly people to interfere with something that they do not understand. For it will be obvious that after realization of the suicidal course of the AI race, humans will start trying to end it. Like parents would hide all weapons so that the kids could not find them and turn they home into a bloody mess.

This scenario should be clear to everybody since this will be the main fear that will soon preoccupy most of us. The fear of the Uncontrollable Unknown: UU!

Anyway,

We keep making AI smarter than us, while gradually losing the ability to understand them and foresee the decisions that they are going to make.

So please welcome your new neighbours - AIlIens!

¹ - yes, we do not agree with Mo Gawdat who thinks that we are creating our AI children. We think that in reality we are creating something else. Supervisors at best they can also become our masters and even judges or somebody who are completely indifferent to our existence. The consequences of this scenario is impossible to foresee.

Details: Written by: Super User; Category: AI arriving; Published: 27 August 2023; Hits: 975

Deep Learning in Artificial Intelligence: Concepts, Methods, Architectures, Applications, and Challenges