ccs-amsterdam / digitaltrackingworkshop18 Goto Github PK

View Code? Open in Web Editor NEW

8.0 8.0 1.0 3.13 MB

Github page for the Digital Tracking Workshop in Amsterdam, 2018

R 0.24% CSS 1.85% HTML 95.44% JavaScript 2.47%

digitaltrackingworkshop18's People

Contributors

Stargazers

Watchers

Forkers

damian0604

digitaltrackingworkshop18's Issues

Secure storage

In order to ensure replicability of our results when reaching tracking data, data needs to be stored securely for at least 5 years. I would be interested in best practices around data storage.

Sampling strategies & rare events

A major challenge in my experience is that even with large samples, most interesting behavior is very rare. I.e., most people do not visit a given, specific website on a given day, let alone read the same article. I'd like to discuss strategies to cope with and/or address this problem.

Roxy Development Mailing List

Anyone interested in continuing the conversation about Roxy 3.0 can join this mailing list: http://eepurl.com/dLHfXs

From Roxy to Web Historian and Back Again

A presentation by Ericka Menchen-Trevino and Chris Karr

We will discuss the open source tools we have already developed to incorporate web browsing history into social science methods (interviews, surveys, experiments), and sketch out our plans for a new tool that incorporates mobile data.

Funding and publishing opportunities

Maybe we should also discuss what kind of (collaborative) funding opportunities there are, and whether it makes sense to work to something like a special issue or something related?

Legal issues regarding digital tracking data

Notwithstanding our best intentions, there can be legal complications with collecting and storing digital tracking data. As such, it might be good to have a round table discussion, led by participants who have dealt with (or are dealing with) this issue.

I already heard from some participants who have experience in this matter. Who would be willing to take a lead on this, and are there any points in particular the we really should discuss?

Network analysis?

When working with the browser tracking data that Judith and colleages collected, for a while I considered relying primarily on concepts (and code libraries/tools) for network analysis, such as igraph for analysis. For various reasons we didn't really do this, with the result that our code feels somewhat home brew to me (in essence, we turned logs of visited sites into "grams" such as "google.com -> facebook.com" and then worked with that).

I would be interested in whether people at the workshop are considering seriously relying on network analysis for studying browser clickstream data and if not, what alternative strategies they have. What we did feels in retrospect like reinventing the wheel, and I personally feel better relying on well-developed packages for some of the very generic data processing issues involved, but on the other hand graph analysis has its own caveats. I know that Sandra Gonzales-Bailon and colleages have used such an approach with ComScore data, but I think in that particular case the approach fit well with their interest in the centrality of news sources.

In any case, I think a dedicated library for turning clickstream information in table format into something more meaningful along with some pretty plots would be useful. We wrote a few functions that point into this direction, but nothing comprehensive yet.

Tools for digital tracking

Many of us have used and/or developed tools for collecting digital trace data. It would be great if one of the outcomes of this workshop is a curated list of good tools, and a general overview of what types of tools work well for what purposes. Also, for the tools being developed by participants, it would be great to share ideas regarding future plans, and to combine efforts where possible.

We can combine this with tool demo's (but please let us know beforehand so we can make a schedule)

Tracking mobile devices

It seems especially tricky to track mobile devices (in particular if we are interested in more than the top-level domain). I have done some exploring with regard to the options and am curious how far others got.

Overview of tracking techniques

Who will present: Vincent van Hees (NLeSC) & The Amsterdam Team
Presentation topic: Tracking techniques: what works, what doesn't? [link to paper]
Linked to: #2

Relevant articles

The literature list compiled by @fe_loe :

literature.pdf

Interface and visualization

Having talked to many communication scholars about tracking data in the last years I noticed that one major obstacle to get engaged with computational communication science is that it looks way too difficult. So it would be worthwhile to map useful interfaces and visualization options for non-coding researchers interested in tracking data.