Code Monkey home page Code Monkey logo

grisham's People

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

grisham's Issues

Use Lucene/solr for text search

Cane we use Lucene or solr for our text search?
Can we do this while still respecting our algorithms (e.g. weighting results using the user model).

Rewrite the Graph visualization code

The current graph visualization code is broken.
Additionally, It uses arbor.js.
Lets explore the d3.js at http://d3js.org/ because this is the more popular package.
We need to make sure d3.js is dynamic and can respond to events such as clicks.

6 degrees of separation

Here is a problem for you @virup @clintpgeorge @supriyan,

Given two papers how would you write a db query to find the "shortest" path between them.

Vertices are papers and edges and citations/references. We can think of the edges as being undirected. It is certainly possible that no path exists between two papers.

Can you implement a solution to this?

PAssword protect the website

password protect the website while it is undergoing construction. a simple .htaccess password should be sufficient.

Visualize the paper data

In order to understand the data we need to look at the arrangement of papers.

We need a graph (preferable interactive) to see be able to explore our current dataset.

We could you a python package such as http://networkx.lanl.gov/.

You can get the connection info from the references table in the db.

Would anybody be able to do this ASAP?

CitationRank (like PageRank accept for papers/citations)

Here is another problem for you @virup @clintpgeorge @supriyan

We want to calculate a global importance factor for all the papers in the data set.
This is similar to page rank. The value of a paper CR(p) should produce a value that is the probability that if I am randomly looking for an important paper I land on p.

A paper with citations should have a higher value than a paper with no citations.

A paper with P citations should have a smaller value compared to a paper with G citations of citations where |P| - |G| < sigma.

The references of a paper do no affect the paper's score. (Although we should have a self-citation penalty)

Also, can we compute these values using SQL?

Add user feedback to define topics

Can we add user feedback to improve/further define topics?
One way is to drag and drop the order of the words in the topic page.

This is opening up the ML black box.

Graph visualization meanings

The size of a node is dependent on the number of papers that cite the paper

The length of a link is the time between the two papers.

The thickness/color of the line may be the similarity of the papers and the user model.

Multiple topic exploration

Can we allow users to add more than one topic during their search?
We want to allow users to explore more than one topic. We need to combine the single topic algorithms.

SIAM DM 13 paper

@virup @clintpgeorge @supriyan
You guys think we could push out a paper to SIAM DM http://www.siam.org/meetings/sdm13/ ?

The deadline is 10/12.

It would be a < 9 page paper., we would have to develop some of our ideas more and provide nice evaluation/performance numbers but I think it is possible if you guys are up for a challenge.

Check out the themes at the bottom of that link.

Virtual topics

Can we allow users to develop virtual topics. These are topics that are weighted combinations of existing topics.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.