Apache Solr Language Identifier


Apache Solr Language Identifier

This module is intended to be used while indexing documents. It is implemented as an UpdateProcessor to be placed in an UpdateChain. Its purpose is to identify language from documents and tag the document with language code.

Compile зависимости (94)

Группа / Артифакт Версия Более новая версия
org.apache.hadoop » hadoop-common 2.2.0 3.3.1
org.apache.httpcomponents » httpmime 4.3.1 4.5.12
org.apache.httpcomponents » httpclient 4.3.1 4.5.11
org.apache.hadoop » hadoop-hdfs 2.2.0 3.3.1
xerces » xercesImpl 2.9.1 RELEASE
org.apache.hadoop » hadoop-auth 2.2.0 3.3.1
dom4j » dom4j 1.6.1 1.4-dev-8
commons-lang » commons-lang 2.6 Нет
org.apache.hadoop » hadoop-annotations 2.2.0 3.3.1
org.apache.httpcomponents » httpcore 4.3 4.4.15
commons-io » commons-io 2.3 2.11.0
org.noggit » noggit 0.5 0.8
org.eclipse.jetty » jetty-server 8.1.10.v20130312 9.4.44.v20210927
rome » rome 0.9 1.0
org.eclipse.jetty » jetty-security 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty » jetty-servlet 8.1.10.v20130312 10.0.12
org.eclipse.jetty » jetty-io 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty » jetty-http 8.1.10.v20130312 10.0.6
org.eclipse.jetty » jetty-continuation 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty » jetty-webapp 8.1.10.v20130312 9.4.44.v20210927
org.eclipse.jetty.orbit » javax.servlet 3.0.0.v201112011016 Нет
org.eclipse.jetty » jetty-deploy 8.1.10.v20130312 10.0.6
com.googlecode.juniversalchardet » juniversalchardet 1.0.3 Нет
org.eclipse.jetty » jetty-xml 8.1.10.v20130312 9.4.44.v20210927
commons-codec » commons-codec 1.9 1.15
org.bouncycastle » bcprov-jdk15 1.45 1.46
org.eclipse.jetty » jetty-util 8.1.10.v20130312 9.4.44.v20210927
org.bouncycastle » bcmail-jdk15 1.45 1.46
org.eclipse.jetty » jetty-jmx 8.1.10.v20130312 9.4.44.v20210927
de.l3s.boilerpipe » boilerpipe 1.1.0 Нет
org.codehaus.woodstox » wstx-asl 3.2.7 4.0.6
org.restlet.jee » org.restlet 2.1.1 Нет
org.restlet.jee » org.restlet.ext.servlet 2.1.1 Нет
org.tukaani » xz 1.4 1.9
org.apache.commons » commons-compress 1.7 1.21
com.google.protobuf » protobuf-java 2.5.0 3.25.3
com.spatial4j » spatial4j 0.4.1 0.5
commons-configuration » commons-configuration 1.6 1.10
org.ccil.cowan.tagsoup » tagsoup 1.2.1 Нет
org.gagravarr » vorbis-java-core 0.1 0.8
org.gagravarr » vorbis-java-tika 0.1 0.8
com.ibm.icu » icu4j 53.1 73.1
org.apache.zookeeper » zookeeper 3.4.6 3.6.3
org.apache.pdfbox » pdfbox 1.8.4 3.0.0-alpha2
org.apache.pdfbox » fontbox 1.8.4 3.0.0-alpha2
org.apache.pdfbox » jempbox 1.8.4 1.8.16
org.apache.poi » poi-ooxml-schemas 3.10.1 4.1.2
org.apache.james » apache-mime4j-dom 0.7.2 0.8.4
org.apache.james » apache-mime4j-core 0.7.2 0.8.4
com.adobe.xmp » xmpcore 5.1.2 6.1.11
org.antlr » antlr-runtime 3.5 3.5.2
org.slf4j » slf4j-log4j12 1.7.6 2.0.12
org.ow2.asm » asm-commons 4.1 9.2
org.apache.poi » poi 3.10.1 5.0.0
com.googlecode.mp4parser » isoparser 1.0-RC-1 1.1.22
org.ow2.asm » asm 4.1 9.2
com.google.guava » guava 14.0.1 33.0.0-jre
org.slf4j » slf4j-api 1.7.6 2.0.12
org.apache.poi » poi-scratchpad 3.10.1 5.0.0
org.apache.poi » poi-ooxml 3.10.1 5.0.0
org.slf4j » jul-to-slf4j 1.7.6 2.0.12
commons-fileupload » commons-fileupload 1.2.1 1.4
com.googlecode.concurrentlinkedhashmap » concurrentlinkedhashmap-lru 1.2 1.4.2
jdom » jdom 1.0 1.1
org.apache.solr » solr-core 4.10.3 9.6.0
joda-time » joda-time 2.2 2.12.7
org.apache.tika » tika-xmp 1.5 1.27
org.apache.solr » solr-solrj 4.10.3 9.6.0
org.apache.lucene » lucene-analyzers-common 4.10.3 8.10.1
org.apache.lucene » lucene-queries 4.10.3 9.9.1
com.drewnoakes » metadata-extractor 2.6.2 2.16.0
org.apache.lucene » lucene-suggest 4.10.3 9.9.1
org.aspectj » aspectjrt 1.6.11 1.9.21.2
org.apache.lucene » lucene-expressions 4.10.3 9.9.1
org.apache.lucene » lucene-join 4.10.3 9.9.1
org.apache.tika » tika-core 1.5 1.27
org.apache.tika » tika-parsers 1.5 1.27
org.apache.lucene » lucene-queryparser 4.10.3 9.9.1
org.apache.lucene » lucene-codecs 4.10.3 9.9.1
com.uwyn » jhighlight 1.0 Нет
com.cybozu.labs » langdetect 1.1-20120112 Нет
org.apache.lucene » lucene-core 4.10.3 9.9.1
org.apache.lucene » lucene-analyzers-phonetic 4.10.3 8.10.1
org.apache.lucene » lucene-analyzers-kuromoji 4.10.3 8.10.1
org.apache.lucene » lucene-grouping 4.10.3 9.9.1
org.apache.lucene » lucene-spatial 4.10.3 7.7.3
org.apache.lucene » lucene-memory 4.10.3 9.9.1
org.apache.lucene » lucene-highlighter 4.10.3 9.9.1
org.apache.lucene » lucene-misc 4.10.3 9.9.1
org.apache.xmlbeans » xmlbeans 2.6.0 5.0.1
net.arnx » jsonic 1.2.7 1.3.10
commons-cli » commons-cli 1.2 1.4
log4j » log4j 1.2.17 Нет
com.carrotsearch » hppc 0.5.2 0.9.0

Test зависимости (5)