Models for estimating animal density from camera traps require four parameters informing detection: movement speed, daily activity level, staying time (duration animals remain within the detection zone), and effective detection distance. These parameters traditionally come from labor-intensive manual measurements and auxiliary telemetry. Recent advances in computer vision can provide the positions of animals in camera trap images, which have been used for distance sampling. We extend this approach to extract all four parameters from imagery, providing the first AI-derived estimates of movement speed and staying time from automated coordinate tracking. We also introduce a new joint multi-species hierarchical distance function that estimates deployment-specific effective detection distances while borrowing strength across species through partial pooling. Our pipeline integrates MegaDetector for animal detection, the Segment Anything Model for segmentation, and Dense Prediction Transformers for monocular depth estimation. From frame-level coordinates, we reconstruct movement trajectories across burst sequences to estimate speed with size-biased distribution corrections, calculate staying time through bounding box interpolation, and estimate activity levels from detection timestamps. The joint hierarchical distance function decomposes the detection scale parameter into a shared deployment-level effect and species-specific offsets, so species effects represent deviations from the multi-species average, allowing data-rich species to inform detection conditions where rare species have few observations. AI-derived scene depth enters the model as a covariate on detection range, providing a vegetation openness metric from the same pipeline. To address position errors from depth estimation, we apply data quality filters. We processed 122,574 frames from 181 deployments across montane forests in Washington and Montana, generating parameter estimates for 12 species without manual annotation. Automated speed estimates produced day ranges 2.7 to 4.3 times GPS telemetry-derived daily distances, reflecting differences between encounter velocity within detection zones and landscape-scale displacement. Deployment-level variation in detectability exceeded species-level differences 3:1, with scene depth strongly predicting detection range; mean effective detection distances ranged from 4.1 to 7.6 m. Applied to a Random Encounter Model, these parameters yielded a white-tailed deer density estimate of 21.4 animals/km^2 and the Random Encounter Staying Time model yielded 11.6 animals/km^2 in Montana. This pipeline enables scalable density estimation across large camera trap networks.
McMurry, S., Alyetama, M., Goldstein, B., Kays, R.
Advertisement
Stats
- Recommendations n/a n/a positive of 0 vote(s)
- Views 6
- Comments 0
