ABSTRACT Gap: Automated spike sorting algorithms have revolutionized the way neuronal activity is extracted from extracellular recordings, yet they remain imperfect. Specifically, inaccurate acceptance of noise-based units not only leaves researchers with clusters that require extensive manual curation, an essential but time-consuming process, that also leads to significant subjectivity in the selection of units. In an era of high-density probes like Neuropixels, where an hour of data can exceed 80 GB, manual curation is no longer scalable, automation of standard criteria can speed data curation and ensure quality of datasets. Here, we developed a semi-automated curation pipeline to label the quality of units after automated curation by Kilosort. Approach: Our algorithm standardizes criteria for labeling of Noise, Multi-Unit Activity (MUA), and Good Units using a combination of spike rate, spike timing metrics (from autocorrelogram), and waveform-based physiological features such as peak amplitude, slopes, half-width, and inter-channel correlation. Based on these features, clusters are assigned standardized labels (good, noise, multi-unit activity) that can be imported directly into Phy, where they serve as curation aids rather than absolute classifications, supporting but not replacing expert judgment. Heuristically, 'noise' units are those unlikely to be neuronal in origin; 'MUA' includes units with significant neural contribution (i.e., neuronal waveform) but with some degree of clear imperfection to be further cleaned, and 'good' units are those without any clear deviation from ideal unit criteria. By ensuring accurate selection of acceptable units, we enable robust downstream analyses such as neural decoding and longitudinal tracking of neuron identity. Thresholds for all metrics were chosen to maximize the matching of algorithm output to that of 2 expert manual curators. Of note, users may alter thresholds either based on their own judgment or using an included tool to semi-automatically find thresholds that optimize SpikeCleaner with their own expert curation. Results: To benchmark, we compared the outputs of our algorithm to expert-labels curated in Phy by two expert users across three recordings. SpikeCleaner achieved an average of 97% accuracy vs. experts & 92% F1 score in classifying Single Units. It achieved an accuracy of 97% & 92% F1 score in full-category agreement (SU, MUA, Noise), and 97% accuracy & 95% F1 score in distinguishing Neuronal vs. Non-Neuronal units.
Zutshi, D., Berezhnoi, D., Ghimire, A., Hartner, J., Kim, D., Watson, B. O.
Advertisement
Stats
- Recommendations n/a n/a positive of 0 vote(s)
- Views 2
- Comments 0
