Ushahidi and our community provides datasets for research purpose.This is a document to determine what will be the best process to open datasets for research, but be respectful of privacy and security guidelines.
Outline for software company and deployers
purpose of the dataset and potential uses
what is the policy for releasing data
what needs to be anonymized
what is the policy for releasing data
what is the data storage policy if released to the Ushahidi community
methodology for cleaning data
what needs to be anonymized:
1. Name(be it first, last or family name)
2. Phone numbers
3. Email addresses
4. twitter handles
What are the data guidelines to be followed for sharing and guidelines
methodology for cleaning data
What are the data guidelines to be followed
For researchers:
1) purpose
2) why this was cleaned/filtered
3) process
Use Case: Haiti
Document process for Haiti data cleanup
do an internal check - eg. random number generated and spot check