-
Apple Inc.
- San Francisco
- https://www.linkedin.com/in/singhkaranjeet
-
sparkler Public
Forked from USCDataScience/sparklerSpark-Crawler : Evolving Apache Nutch to run on Spark.
Java Apache License 2.0 UpdatedOct 14, 2020 -
-
nba-analysis Public
Forked from inner-product/nba-analysisSimple data engineering task
Scala UpdatedApr 24, 2020 -
essential-scala-code Public
Forked from inner-product/essential-scala-codeExercises for Inner Product's Essential Scala Course
Scala Apache License 2.0 UpdatedApr 20, 2020 -
tika Public
Forked from apache/tikaMirror of Apache Tika
Java Apache License 2.0 UpdatedJul 17, 2018 -
drat Public
Forked from apache/dratA distributed, parallelized (Map Reduce) wrapper around Apache™ RAT to allow it to complete on large code repositories of multiple file types where Apache™ RAT hangs forever.
JavaScript Apache License 2.0 UpdatedJul 18, 2017 -
polar-domain-discovery Public
Forked from USCDataScience/polar-domain-discoveryDomain Discovery on Polar Domain
HTML Apache License 2.0 UpdatedJun 7, 2017 -
drat-ontosoft Public
DRAT on OntoSoft code repositories
-
-
startbootstrap-sb-admin-2 Public
Forked from StartBootstrap/startbootstrap-sb-admin-2A free, open source, Bootstrap admin theme created by Start Bootstrap
HTML MIT License UpdatedApr 4, 2017 -
-
SnapWorld Public
Capture your memories with SnapWorld and help others!
Java Apache License 2.0 UpdatedNov 29, 2016 -
banana Public
Forked from lucidworks/bananaBanana for Solr - A Port of Kibana
-
-
counterfeit Public
Forked from nasa-jpl-memex/counterfeitPilot for CE domain.
JavaScript Apache License 2.0 UpdatedJul 26, 2016 -
cdr-pipeline Public
Place where Nutch segments are extracted followed by post crawl analysis
Python Apache License 2.0 UpdatedJul 25, 2016 -
PCF-Nutch-on-Wrangler Public
A repository for Nutch crawl evaluation
-
memex-cdr Public
Forked from istresearch/memex-cdrThis repository hosts code and schema information related to the Memex Crawl Data Repository (CDR)
Python UpdatedJul 15, 2016 -
-
Political-Inclination-NLP Public
Project for CSCI-544 - Applied Natural Language Processing
-
FocusedCrawl-Weapons Public
Nutch Protocol Interactive-Selenium handlers to fetch focused results from Weapons URL. This also includes some extractor scripts to fetch relevant seeds from the website.
-
nutch Public
Forked from apache/nutchMirror of Apache Nutch
Java Apache License 2.0 UpdatedApr 18, 2016 -
NutchPlugin-HtmlUnit Public
This is a HtmlUnit plugin for Apache Nutch. Leverage headless browsing capability.
-
Applied-NLP Public
CSCI-544 (Applied Natural Language Processing) homework assignments
Python Apache License 2.0 UpdatedFeb 18, 2016 -
SolrMerge Public
An open source project to merge Solr cores in an extremely customizable way.
-
startbootstrap-scrolling-nav Public
Forked from StartBootstrap/startbootstrap-scrolling-navAn unstyled Bootstrap HTML template for creating smooth scrolling, one page websites - created by Start Bootstrap
HTML Apache License 2.0 UpdatedNov 28, 2015 -
project-mango Public
Weapons Dashboard with D3 Visuals - CSCI572 - Assignment 3
-
tika-python Public
Forked from chrismattmann/tika-pythonPython Apache License 2.0 UpdatedNov 17, 2015 -
oodt Public
Forked from apache/oodtMirror of Apache OODT
Java Apache License 2.0 UpdatedNov 10, 2015 -