Conversation
| "behavior_dist_threshold", | ||
| "behavior_dist_seg", | ||
| "avg_bout_duration", | ||
| "_stats_sample_count", |
There was a problem hiding this comment.
curious because _stats_sample_count is not like the rest. Is this intentional?
There was a problem hiding this comment.
It's supposed to signify pseudo private.
There was a problem hiding this comment.
Yes. _stats_sample_count is an internal value that should not be used as a feature by the end-user. The difference in style is intended to add caution to the reader (and hopefully those familiar with pythonic style an indicator to not touch it).
| # Calculate time spent in behavior | ||
| # TODO: Do we need to make `5` a configurable parameter? | ||
| aggregated[f"bin_avg_{bin_size*5}.{behavior}_time_secs"] = ( | ||
| aggregated[f"bin_sum_{bin_size * 5}.{behavior}_time_secs"] = ( | ||
| aggregated[time_behavior_col] | ||
| / (aggregated[time_behavior_col] + aggregated[time_not_behavior_col]) |
There was a problem hiding this comment.
I feel like it would be good to make a configurable param (defaulted to 5) or if not appropriate to be a param, a constant (well... as close as python gets to constants anyway)
There was a problem hiding this comment.
Agreed. The actual change is to make is detect-able from the input table data (and not a param). Right now, there is implied coupling of this script where the input summary table is 5-minute bins.
Integrates newly added bout statistics into nextflow pipeline.
Depends on KumarLabJax/JABS-postprocess#51 and KumarLabJax/JABS-postprocess#50
JABS feature tables will now include:
New JABS heuristic classifiers:
Additional tweaks: