Weights in Neural Network -

Defining weights in a neural network involves initializing them to appropriate values before training begins. Proper initialization is critical for ensuring efficient and effective training. Here’s a step-by-step guide on how to define weights in a neural network:

1. Understanding Weight Initialization

Weights are the parameters that connect neurons between different layers in the network. Initializing these weights properly helps in:

Breaking Symmetry: Ensures that neurons learn different features.
Efficient Training: Avoids issues like vanishing or exploding gradients.
Convergence: Helps in faster convergence during training.

2. Common Initialization Techniques

Random Initialization:
- Weights are initialized randomly, usually from a normal or uniform distribution.
Xavier Initialization (also known as Glorot Initialization):
- Suitable for layers with sigmoid or tanh activation functions.
- Weights are initialized from a normal distribution with mean 0 and variance 1/n1/n1/n, where nnn is the number of input neurons.
- Formula: np.random.randn(fan_in, fan_out) * np.sqrt(1 / fan_in)
He Initialization:
- Suitable for layers with ReLU activation functions.
- Weights are initialized from a normal distribution with mean 0 and variance 2/n2/n2/n, where nnn is the number of input neurons.
- Formula: np.random.randn(fan_in, fan_out) * np.sqrt(2 / fan_in)

3. Implementation in Code

Let’s walk through an example of defining weights using Xavier initialization for a simple neural network with one hidden layer.

Step-by-Step Example

1. Import Libraries:

import numpy as np

2. Define Initialization Function:

def initialize_parameters(input_size, hidden_size, output_size):
    # Xavier Initialization for weights
    W1 = np.random.randn(input_size, hidden_size) * np.sqrt(1 / input_size)
    b1 = np.zeros((1, hidden_size))
    W2 = np.random.randn(hidden_size, output_size) * np.sqrt(1 / hidden_size)
    b2 = np.zeros((1, output_size))
    return W1, b1, W2, b2

3. Specify Network Dimensions:

input_size = 3  # Number of input features
hidden_size = 4  # Number of neurons in the hidden layer
output_size = 1  # Number of output neurons

4. Initialize Weights and Biases:

W1, b1, W2, b2 = initialize_parameters(input_size, hidden_size, output_size)

Explanation

W1: Weights connecting the input layer to the hidden layer.
- Shape: (input_size, hidden_size) which is (3, 4).
- Initialized using Xavier initialization.
b1: Biases for the hidden layer.
- Shape: (1, hidden_size) which is (1, 4).
- Initialized to zeros.
W2: Weights connecting the hidden layer to the output layer.
- Shape: (hidden_size, output_size) which is (4, 1).
- Initialized using Xavier initialization.
b2: Biases for the output layer.
- Shape: (1, output_size) which is (1, 1).
- Initialized to zeros.

Putting It All Together

Here’s a complete code snippet for initializing weights and biases using Xavier initialization:

import numpy as np

def initialize_parameters(input_size, hidden_size, output_size):
    # Xavier Initialization for weights
    W1 = np.random.randn(input_size, hidden_size) * np.sqrt(1 / input_size)
    b1 = np.zeros((1, hidden_size))
    W2 = np.random.randn(hidden_size, output_size) * np.sqrt(1 / hidden_size)
    b2 = np.zeros((1, output_size))
    return W1, b1, W2, b2

# Define the neural network structure
input_size = 3  # Number of input features
hidden_size = 4  # Number of neurons in the hidden layer
output_size = 1  # Number of output neurons

# Initialize parameters
W1, b1, W2, b2 = initialize_parameters(input_size, hidden_size, output_size)

# Print the initialized parameters
print("W1:", W1)
print("b1:", b1)
print("W2:", W2)
print("b2:", b2)

Summary

Defining weights in a neural network involves initializing them to appropriate values. Proper initialization ensures that the network can learn effectively and efficiently. Techniques like Xavier and He initialization help in setting up the weights to avoid issues like vanishing or exploding gradients, thereby facilitating better training and convergence.

Weights in Neural Network

1. Understanding Weight Initialization

2. Common Initialization Techniques

3. Implementation in Code

Step-by-Step Example

Explanation

Putting It All Together

Summary

Explore Generative AI with the Gemini API in Vertex AI

Inspect Rich Documents with Gemini Multimodality and Multimodal RAG

🎓 Why Original Work Matters in Your Final Year Project (And How It Can Shape Your Career)

🎓 How to Choose Your Final Year Project: A Practical Guide for BTech Students

Nishant Munjal

Tools

Resources

Legal