This study aimed to assess the feasibility of employing audio and vision technologies as an integrated monitoring system to monitor the health and behavior of pigs under different environmental conditions and to associate these factors. A total of 81 growing pigs ((Landrace x Largewhite) x Duroc) with distinct weight categories (10.47 ± 2.57 kg and 28.81 ± 5.03 kg) were distributed into two distinct housing conditions: Control (characterized by low NH3 levels and low stocking density) and Treatment (marked by elevated NH3 levels and varying stocking densities). Each house was installed with a SoundTalks monitor for automated daily evaluation of respiratory health status (ReHS), and a camera with a microphone to facilitate manual tracking of respiratory symptoms and behavioral patterns. Results showed that the Treatment group encountered significantly high room temperature, NH3 concentration, and carbon dioxide levels, resulting in compromised growth performance—a phenomenon further exacerbated in high stocking density conditions. Pigs within the treatment exhibited increases (p < 0.05) in lateral and total lying behaviors, and incidences of ear and tail injuries, increases screaming frequency, conjunctivitis incidences, coughing frequency, elevated respiration rates, and a decrease in ReHS. The environmental conditions affect the behavior and health of the pigs. Moreover, the behavioral patterns of the pigs are associated with their health conditions. The SoundTalks system did not trigger any warnings during the experimental period. Nevertheless, trend analysis indicated a significant reduction in respiratory health in the treatment. In conclusion, this study underscores the efficacy of merging audio and visual technologies to holistically monitor pig health and behavior, enabling enhanced management strategies. Findings emphasize that monitoring the ReHS trend serves as a pivotal marker for identifying respiratory health problems, complementing the system's innate alarm functions.