विभिन्न न्यूरल नेटवर्क आर्किटेक्चरहरु विभिन्न डेटा र कार्यहरुको लागि उपयुक्त हुन्छन् — CNNs छविहरुको लागि, RNNs अनुक्रमहरुको लागि, र transformers भाषाको लागि (र बढ्दै गरेर सबै चीजको लागि)। मुख्य प्रकारहरु बुझ्नुले AI ले विभिन्न समस्याहरु कसरी सम्भाल्छ भन्ने कुरा स्पष्ट गर्छ।
मुख्य आर्किटेक्चरहरु
CNN (Convolutional Neural Network) → for IMAGES/spatial data:
→ uses convolutions to detect local features (edges, shapes) hierarchically
→ for: image classification, object detection, computer vision
RNN (Recurrent Neural Network) → for SEQUENCES/time-series:
→ processes sequences step by step, maintaining a 'memory' of previous inputs
→ for: text, time-series, speech (older approach; LSTM/GRU variants)
⚠️ struggles with long sequences; largely SUPERSEDED by transformers for language
TRANSFORMER → for SEQUENCES (language) and increasingly everything:
→ attention mechanism; parallel; the dominant modern architecture (LLMs)
→ for: language (LLMs), and now vision, audio, multimodal
