Has anyone ever done inter-rater reliability calculations with qualitative data (especially where codes emerged from the data)? If so, how do you calculate it? What formula(s) do you use?
Thoughts?
Let me know if this is unclear or you need more info.