Apache Solr Language Identifier


Apache Solr Language Identifier

This module is intended to be used while indexing documents. It is implemented as an UpdateProcessor to be placed in an UpdateChain. Its purpose is to identify language from documents and tag the document with language code.

Compile зависимости (146)

Группа / Артифакт Версия Более новая версия
org.apache.lucene » lucene-analyzers-nori 7.7.1 8.10.1
org.apache.lucene » lucene-misc 7.7.1 9.9.1
com.fasterxml.jackson.core » jackson-databind 2.9.6 2.17.1
org.apache.lucene » lucene-memory 7.7.1 9.9.1
org.apache.commons » commons-math3 3.6.1 Нет
com.fasterxml.jackson.core » jackson-core 2.9.6 2.17.1
org.apache.lucene » lucene-queries 7.7.1 9.9.1
com.googlecode.mp4parser » isoparser 1.1.22 Нет
org.apache.tika » tika-parsers 1.19.1 1.27
org.apache.lucene » lucene-queryparser 7.7.1 9.9.1
org.apache.lucene » lucene-spatial-extras 7.7.1 9.9.1
org.apache.zookeeper » zookeeper 3.4.13 3.6.3
com.fasterxml.jackson.core » jackson-annotations 2.9.6 2.17.1
org.apache.lucene » lucene-highlighter 7.7.1 9.9.1
org.apache.lucene » lucene-analyzers-phonetic 7.7.1 8.10.1
com.drewnoakes » metadata-extractor 2.11.0 2.16.0
org.apache.lucene » lucene-analyzers-common 7.7.1 8.10.1
org.apache.lucene » lucene-grouping 7.7.1 9.9.1
org.apache.lucene » lucene-classification 7.7.1 9.9.1
com.adobe.xmp » xmpcore 5.1.3 6.1.11
org.eclipse.jetty » jetty-deploy 9.4.14.v20181114 10.0.6
org.apache.lucene » lucene-analyzers-kuromoji 7.7.1 8.10.1
org.apache.james » apache-mime4j-dom 0.8.2 0.8.4
org.apache.lucene » lucene-spatial3d 7.7.1 9.9.1
org.apache.james » apache-mime4j-core 0.8.2 0.8.4
com.healthmarketscience.jackcess » jackcess-encrypt 2.1.4 4.0.2
org.apache.lucene » lucene-expressions 7.7.1 9.9.1
org.apache.lucene » lucene-suggest 7.7.1 9.9.1
com.google.guava » guava 14.0.1 33.0.0-jre
org.eclipse.jetty » jetty-rewrite 9.4.14.v20181114 10.0.11
org.apache.lucene » lucene-sandbox 7.7.1 9.9.1
org.apache.commons » commons-collections4 4.2 RELEASE
org.codehaus.woodstox » stax2-api 3.1.4 4.2.1
org.apache.tika » tika-core 1.19.1 1.27
com.carrotsearch » hppc 0.8.1 0.9.0
org.ccil.cowan.tagsoup » tagsoup 1.2.1 Нет
com.ibm.icu » icu4j 62.1 73.1
org.locationtech.spatial4j » spatial4j 0.7 0.8
org.apache.solr » solr-core 7.7.1 9.6.0
org.apache.solr » solr-solrj 7.7.1 9.6.0
org.brotli » dec 0.1.2 Нет
org.apache.xmlbeans » xmlbeans 3.0.1 5.0.1
commons-configuration » commons-configuration 1.6 1.10
com.github.virtuald » curvesapi 1.04 1.06
org.codehaus.jackson » jackson-core-asl 1.9.13 Нет
javax.servlet » javax.servlet-api 3.1.0 4.0.1
org.codehaus.woodstox » woodstox-core-asl 4.4.1 Нет
org.codehaus.jackson » jackson-mapper-asl 1.9.13 Нет
org.apache.tika » tika-java7 1.19.1 1.27
org.eclipse.jetty » jetty-xml 9.4.14.v20181114 9.4.44.v20210927
com.rometools » rome-utils 1.5.1 1.16.0
org.bouncycastle » bcprov-jdk15on 1.60 1.70
org.apache.tika » tika-xmp 1.19.1 1.27
org.aspectj » aspectjrt 1.8.0 1.9.21.2
org.bouncycastle » bcpkix-jdk15on 1.60 1.70
org.bouncycastle » bcmail-jdk15on 1.60 1.69
org.apache.httpcomponents » httpclient 4.5.6 4.5.11
org.apache.httpcomponents » httpmime 4.5.6 4.5.12
commons-io » commons-io 2.5 2.11.0
org.apache.logging.log4j » log4j-api 2.11.0 2.19.0
org.apache.lucene » lucene-core 7.7.1 9.9.1
de.l3s.boilerpipe » boilerpipe 1.1.0 Нет
org.slf4j » jul-to-slf4j 1.7.24 2.0.12
org.apache.lucene » lucene-codecs 7.7.1 9.9.1
org.apache.commons » commons-exec 1.3 Нет
com.healthmarketscience.jackcess » jackcess 2.1.12 4.0.6
org.apache.lucene » lucene-backward-codecs 7.7.1 9.9.1
org.gagravarr » vorbis-java-core 0.8 Нет
org.apache.logging.log4j » log4j-slf4j-impl 2.11.0 2.19.0
org.eclipse.jetty » jetty-continuation 9.4.14.v20181114 9.4.44.v20210927
org.apache.logging.log4j » log4j-1.2-api 2.11.0 2.17.2
org.apache.lucene » lucene-join 7.7.1 9.9.1
org.gagravarr » vorbis-java-tika 0.8 Нет
org.slf4j » jcl-over-slf4j 1.7.24 2.0.12
org.apache.logging.log4j » log4j-core 2.11.0 2.18.0
org.antlr » antlr4-runtime 4.5.1-1 4.13.1
com.fasterxml.jackson.dataformat » jackson-dataformat-smile 2.9.6 2.17.1
org.slf4j » slf4j-api 1.7.24 2.0.12
org.apache.curator » curator-client 2.8.0 5.2.0
org.apache.curator » curator-framework 2.8.0 5.2.0
org.apache.curator » curator-recipes 2.8.0 5.2.0
org.apache.calcite.avatica » avatica-core 1.10.0 1.18.0
org.jdom » jdom2 2.0.6 Нет
org.eclipse.jetty » jetty-servlets 9.4.14.v20181114 10.0.11
org.apache.pdfbox » jempbox 1.8.16 Нет
net.hydromatic » eigenbase-properties 1.1.5 1.1.6
com.rometools » rome 1.5.1 1.16.0
org.codehaus.janino » commons-compiler 2.7.6 3.1.6
org.tallison » jmatio 1.5 Нет
org.codehaus.janino » janino 2.7.6 3.1.7
org.rrd4j » rrd4j 3.2 3.8
com.googlecode.juniversalchardet » juniversalchardet 1.0.3 Нет
commons-fileupload » commons-fileupload 1.3.3 1.4
org.apache.httpcomponents » httpcore 4.4.10 4.4.15
com.tdunning » t-digest 3.1 3.3
dom4j » dom4j 1.6.1 1.4-dev-8
org.apache.poi » poi 4.0.0 5.0.0
org.ow2.asm » asm-commons 5.1 9.2
org.apache.poi » poi-ooxml-schemas 4.0.0 4.1.2
org.apache.poi » poi-scratchpad 4.0.0 5.0.0
org.apache.poi » poi-ooxml 4.0.0 5.0.0
info.ganglia.gmetric4j » gmetric4j 1.0.7 1.0.10
org.noggit » noggit 0.8 Нет
org.ow2.asm » asm 5.1 9.2
xerces » xercesImpl 2.9.1 RELEASE
com.google.protobuf » protobuf-java 3.1.0 3.25.3
commons-collections » commons-collections 3.2.2 Нет
commons-lang » commons-lang 2.6 Нет
com.lmax » disruptor 3.4.0 3.4.4
commons-cli » commons-cli 1.2 1.4
org.restlet.jee » org.restlet.ext.servlet 2.3.0 Нет
org.restlet.jee » org.restlet 2.3.0 Нет
org.apache.hadoop » hadoop-annotations 2.7.4 3.3.1
com.github.ben-manes.caffeine » caffeine 2.4.0 3.1.8
org.apache.hadoop » hadoop-common 2.7.4 3.3.1
io.dropwizard.metrics » metrics-ganglia 3.2.6 Нет
org.apache.calcite » calcite-core 1.13.0 1.27.0
io.dropwizard.metrics » metrics-graphite 3.2.6 4.2.23
org.apache.commons » commons-compress 1.18 1.21
org.apache.hadoop » hadoop-auth 2.7.4 3.3.1
org.apache.hadoop » hadoop-hdfs 2.7.4 3.3.1
commons-codec » commons-codec 1.11 1.15
io.dropwizard.metrics » metrics-jetty9 3.2.6 4.2.23
org.apache.htrace » htrace-core 3.2.0-incubating 4.0.0-incubating
net.arnx » jsonic 1.2.7 1.3.10
joda-time » joda-time 2.2 2.12.7
org.eclipse.jetty » jetty-server 9.4.14.v20181114 9.4.44.v20210927
org.apache.pdfbox » fontbox 2.0.12 3.0.0-alpha2
org.eclipse.jetty » jetty-servlet 9.4.14.v20181114 10.0.12
org.tukaani » xz 1.8 1.9
org.eclipse.jetty » jetty-util 9.4.14.v20181114 9.4.44.v20210927
org.eclipse.jetty » jetty-io 9.4.14.v20181114 9.4.44.v20210927
org.eclipse.jetty » jetty-http 9.4.14.v20181114 10.0.6
com.cybozu.labs » langdetect 1.1-20120112 Нет
io.dropwizard.metrics » metrics-core 3.2.6 4.2.23
io.dropwizard.metrics » metrics-jvm 3.2.6 4.2.23
com.epam » parso 2.0.9 2.0.14
org.eclipse.jetty » jetty-jmx 9.4.14.v20181114 9.4.44.v20210927
org.eclipse.jetty » jetty-security 9.4.14.v20181114 9.4.44.v20210927
org.apache.commons » commons-lang3 3.6 3.12.0
org.eclipse.jetty » jetty-webapp 9.4.14.v20181114 9.4.44.v20210927
org.apache.pdfbox » pdfbox 2.0.12 3.0.0-alpha2
com.pff » java-libpst 0.8.1 0.9.3
org.apache.pdfbox » pdfbox-tools 2.0.12 2.0.24
org.apache.opennlp » opennlp-tools 1.9.0 1.9.3
org.apache.calcite » calcite-linq4j 1.13.0 1.27.0

Test зависимости (4)