소개
This skill provides specialized guidance for reverse engineering black-box neural networks, specifically targeting two-layer ReLU architectures. It outlines robust methodologies for recovering hidden parameters like weight matrices and biases through techniques such as critical point analysis, gradient-based extraction, and activation pattern enumeration. By emphasizing true black-box treatment, the skill ensures that extracted models are verified based on functional equivalence rather than hardcoded implementation details, making it an essential resource for security researchers and machine learning developers working on model stealing, extraction, or auditing tasks.