Apache Solr Language Identifier


Apache Solr Language Identifier

This module is intended to be used while indexing documents. It is implemented as an UpdateProcessor to be placed in an UpdateChain. Its purpose is to identify language from documents and tag the document with language code.

Compile зависимости (108)

Группа / Артифакт Версия Более новая версия
org.apache.lucene » lucene-analyzers-common 5.5.5 8.10.1
org.apache.lucene » lucene-sandbox 5.5.5 9.9.1
org.apache.lucene » lucene-suggest 5.5.5 9.9.1
org.apache.hadoop » hadoop-annotations 2.6.0 3.3.1
org.apache.lucene » lucene-analyzers-kuromoji 5.5.5 8.10.1
org.apache.lucene » lucene-backward-codecs 5.5.5 9.9.1
org.apache.hadoop » hadoop-auth 2.6.0 3.3.1
org.apache.lucene » lucene-core 5.5.5 9.9.1
dom4j » dom4j 1.6.1 1.4-dev-8
org.apache.hadoop » hadoop-common 2.6.0 3.3.1
org.apache.lucene » lucene-join 5.5.5 9.9.1
org.apache.hadoop » hadoop-hdfs 2.6.0 3.3.1
org.apache.lucene » lucene-analyzers-phonetic 5.5.5 8.10.1
com.spatial4j » spatial4j 0.5 Нет
org.apache.lucene » lucene-highlighter 5.5.5 9.9.1
org.htrace » htrace-core 3.0.4 Нет
org.noggit » noggit 0.6 0.8
org.apache.lucene » lucene-codecs 5.5.5 9.9.1
xerces » xercesImpl 2.9.1 RELEASE
org.apache.lucene » lucene-misc 5.5.5 9.9.1
com.fasterxml.jackson.core » jackson-core 2.5.4 2.17.1
commons-lang » commons-lang 2.6 Нет
org.apache.solr » solr-core 5.5.5 9.6.0
com.googlecode.mp4parser » isoparser 1.0.2 1.1.22
commons-fileupload » commons-fileupload 1.3.2 1.4
org.restlet.jee » org.restlet.ext.servlet 2.3.0 Нет
org.restlet.jee » org.restlet 2.3.0 Нет
org.apache.lucene » lucene-memory 5.5.5 9.9.1
org.apache.lucene » lucene-spatial 5.5.5 7.7.3
org.apache.lucene » lucene-grouping 5.5.5 9.9.1
com.googlecode.juniversalchardet » juniversalchardet 1.0.3 Нет
org.apache.lucene » lucene-queries 5.5.5 9.9.1
org.apache.lucene » lucene-queryparser 5.5.5 9.9.1
com.carrotsearch » hppc 0.7.1 0.9.0
org.apache.lucene » lucene-expressions 5.5.5 9.9.1
org.aspectj » aspectjrt 1.8.0 1.9.21.2
org.slf4j » jul-to-slf4j 1.7.7 2.0.12
org.slf4j » jcl-over-slf4j 1.7.7 2.0.12
org.slf4j » slf4j-log4j12 1.7.7 2.0.12
org.bouncycastle » bcprov-jdk15 1.45 1.46
org.bouncycastle » bcmail-jdk15 1.45 1.46
de.l3s.boilerpipe » boilerpipe 1.1.0 Нет
org.apache.solr » solr-solrj 5.5.5 9.6.0
org.slf4j » slf4j-api 1.7.7 2.0.12
org.apache.httpcomponents » httpcore 4.4.1 4.4.15
org.apache.httpcomponents » httpclient 4.4.1 4.5.11
org.apache.httpcomponents » httpmime 4.4.1 4.5.12
org.eclipse.jetty » jetty-http 9.2.13.v20150730 10.0.6
javax.servlet » javax.servlet-api 3.1.0 4.0.1
org.eclipse.jetty » jetty-io 9.2.13.v20150730 9.4.44.v20210927
org.eclipse.jetty » jetty-xml 9.2.13.v20150730 9.4.44.v20210927
org.eclipse.jetty » jetty-jmx 9.2.13.v20150730 9.4.44.v20210927
rome » rome 1.0 Нет
org.eclipse.jetty » jetty-util 9.2.13.v20150730 9.4.44.v20210927
org.eclipse.jetty » jetty-webapp 9.2.13.v20150730 9.4.44.v20210927
com.google.protobuf » protobuf-java 2.5.0 3.25.3
org.eclipse.jetty » jetty-server 9.2.13.v20150730 9.4.44.v20210927
org.eclipse.jetty » jetty-security 9.2.13.v20150730 9.4.44.v20210927
org.eclipse.jetty » jetty-servlet 9.2.13.v20150730 10.0.12
org.ow2.asm » asm-commons 5.0.4 9.2
org.eclipse.jetty » jetty-continuation 9.2.13.v20150730 9.4.44.v20210927
org.apache.tika » tika-core 1.7 1.27
org.ow2.asm » asm 5.0.4 9.2
org.apache.tika » tika-xmp 1.7 1.27
org.eclipse.jetty » jetty-servlets 9.2.13.v20150730 10.0.11
org.apache.tika » tika-parsers 1.7 1.27
org.ccil.cowan.tagsoup » tagsoup 1.2.1 Нет
org.codehaus.woodstox » stax2-api 3.1.4 4.2.1
org.apache.tika » tika-java7 1.7 1.27
com.fasterxml.jackson.dataformat » jackson-dataformat-smile 2.5.4 2.17.1
org.gagravarr » vorbis-java-tika 0.6 0.8
org.apache.zookeeper » zookeeper 3.4.6 3.6.3
org.codehaus.woodstox » woodstox-core-asl 4.4.1 Нет
org.apache.poi » poi-scratchpad 3.11 5.0.0
org.gagravarr » vorbis-java-core 0.6 0.8
org.apache.poi » poi 3.11 5.0.0
org.apache.poi » poi-ooxml 3.11 5.0.0
org.apache.poi » poi-ooxml-schemas 3.11 4.1.2
commons-configuration » commons-configuration 1.6 1.10
org.antlr » antlr4-runtime 4.5.1-1 4.13.1
org.tallison » jmatio 1.2 1.5
org.eclipse.jetty » jetty-rewrite 9.2.13.v20150730 10.0.11
org.apache.james » apache-mime4j-dom 0.7.2 0.8.4
org.eclipse.jetty » jetty-deploy 9.2.13.v20150730 10.0.6
org.apache.james » apache-mime4j-core 0.7.2 0.8.4
com.adobe.xmp » xmpcore 5.1.2 6.1.11
org.apache.commons » commons-exec 1.3 Нет
com.googlecode.concurrentlinkedhashmap » concurrentlinkedhashmap-lru 1.2 1.4.2
com.google.guava » guava 14.0.1 33.0.0-jre
joda-time » joda-time 2.2 2.12.7
com.cybozu.labs » langdetect 1.1-20120112 Нет
jdom » jdom 1.0 1.1
commons-collections » commons-collections 3.2.2 Нет
org.tukaani » xz 1.5 1.9
com.drewnoakes » metadata-extractor 2.6.2 2.16.0
com.pff » java-libpst 0.8.1 0.9.3
org.apache.pdfbox » fontbox 1.8.8 3.0.0-alpha2
commons-cli » commons-cli 1.2 1.4
org.apache.pdfbox » jempbox 1.8.8 1.8.16
org.apache.pdfbox » pdfbox 1.8.8 3.0.0-alpha2
com.tdunning » t-digest 3.1 3.3
log4j » log4j 1.2.17 Нет
org.apache.commons » commons-compress 1.8.1 1.21
com.ibm.icu » icu4j 54.1 73.1
commons-io » commons-io 2.4 2.11.0
org.apache.xmlbeans » xmlbeans 2.6.0 5.0.1
net.arnx » jsonic 1.2.7 1.3.10
commons-codec » commons-codec 1.10 1.15

Test зависимости (4)