Apache Solr Language Identifier


Apache Solr Language Identifier

This module is intended to be used while indexing documents. It is implemented as an UpdateProcessor to be placed in an UpdateChain. Its purpose is to identify language from documents and tag the document with language code.

Compile зависимости (164)

Группа / Артифакт Версия Более новая версия
com.googlecode.juniversalchardet » juniversalchardet 1.0.3 Нет
io.opentracing » opentracing-util 0.33.0 Нет
commons-collections » commons-collections 3.2.2 Нет
org.apache.pdfbox » pdfbox-tools 2.0.12 2.0.24
net.hydromatic » eigenbase-properties 1.1.5 1.1.6
org.ow2.asm » asm-commons 5.1 9.2
org.apache.pdfbox » pdfbox 2.0.12 3.0.0-alpha2
commons-fileupload » commons-fileupload 1.3.3 1.4
org.jdom » jdom2 2.0.6 Нет
org.ow2.asm » asm 5.1 9.2
org.eclipse.jetty.http2 » http2-http-client-transport 9.4.19.v20190610 9.4.44.v20210927
org.eclipse.jetty.http2 » http2-server 9.4.19.v20190610 9.4.44.v20210927
org.eclipse.jetty.http2 » http2-client 9.4.19.v20190610 9.4.44.v20210927
org.apache.pdfbox » jempbox 1.8.16 Нет
org.apache.commons » commons-lang3 3.8.1 3.12.0
com.rometools » rome 1.5.1 1.16.0
org.eclipse.jetty » jetty-http 9.4.19.v20190610 10.0.6
org.apache.curator » curator-recipes 2.13.0 5.2.0
org.apache.curator » curator-framework 2.13.0 5.2.0
org.eclipse.jetty » jetty-continuation 9.4.19.v20190610 9.4.44.v20210927
io.opentracing » opentracing-api 0.33.0 Нет
org.apache.curator » curator-client 2.13.0 5.2.0
com.google.protobuf » protobuf-java 3.6.1 3.25.3
io.opentracing » opentracing-noop 0.33.0 Нет
org.bitbucket.b_c » jose4j 0.6.5 0.9.3
org.apache.commons » commons-compress 1.18 1.21
de.l3s.boilerpipe » boilerpipe 1.1.0 Нет
com.tdunning » t-digest 3.1 3.3
org.apache.lucene » lucene-spatial-extras 8.2.0 9.9.1
org.apache.lucene » lucene-spatial3d 8.2.0 9.9.1
org.eclipse.jetty » jetty-rewrite 9.4.19.v20190610 10.0.11
org.apache.lucene » lucene-suggest 8.2.0 9.9.1
org.apache.solr » solr-core 8.2.0 9.6.0
org.apache.solr » solr-solrj 8.2.0 9.6.0
org.apache.lucene » lucene-grouping 8.2.0 9.9.1
org.apache.lucene » lucene-highlighter 8.2.0 9.9.1
org.apache.lucene » lucene-join 8.2.0 9.9.1
org.apache.lucene » lucene-memory 8.2.0 9.9.1
org.apache.lucene » lucene-misc 8.2.0 9.9.1
org.codehaus.janino » commons-compiler 3.0.9 3.1.6
org.apache.pdfbox » fontbox 2.0.12 3.0.0-alpha2
org.apache.lucene » lucene-queries 8.2.0 9.9.1
org.apache.lucene » lucene-queryparser 8.2.0 9.9.1
org.codehaus.janino » janino 3.0.9 3.1.7
org.apache.lucene » lucene-sandbox 8.2.0 9.9.1
org.apache.lucene » lucene-analyzers-kuromoji 8.2.0 8.10.1
org.apache.lucene » lucene-analyzers-nori 8.2.0 8.10.1
org.apache.lucene » lucene-analyzers-phonetic 8.2.0 8.10.1
com.jayway.jsonpath » json-path 2.4.0 2.6.0
org.apache.xmlbeans » xmlbeans 3.0.1 5.0.1
org.apache.lucene » lucene-backward-codecs 8.2.0 9.9.1
javax.servlet » javax.servlet-api 3.1.0 4.0.1
org.antlr » antlr4-runtime 4.5.1-1 4.13.1
org.apache.lucene » lucene-classification 8.2.0 9.9.1
net.arnx » jsonic 1.2.7 1.3.10
com.ibm.icu » icu4j 62.1 73.1
org.apache.lucene » lucene-codecs 8.2.0 9.9.1
org.apache.hadoop » hadoop-annotations 3.2.0 3.3.1
org.apache.lucene » lucene-core 8.2.0 9.9.1
org.codehaus.woodstox » woodstox-core-asl 4.4.1 Нет
org.apache.lucene » lucene-expressions 8.2.0 9.9.1
org.apache.kerby » kerb-util 1.0.1 2.0.1
org.gagravarr » vorbis-java-core 0.8 Нет
org.gagravarr » vorbis-java-tika 0.8 Нет
org.apache.lucene » lucene-analyzers-common 8.2.0 8.10.1
com.github.virtuald » curvesapi 1.04 1.06
org.apache.kerby » kerb-core 1.0.1 2.0.1
io.sgr » s2-geometry-library-java 1.0.0 1.0.1
org.slf4j » slf4j-api 1.7.24 2.0.12
org.apache.commons » commons-exec 1.3 Нет
org.slf4j » jul-to-slf4j 1.7.24 2.0.12
org.apache.hadoop » hadoop-hdfs-client 3.2.0 3.3.1
org.apache.hadoop » hadoop-auth 3.2.0 3.3.1
org.slf4j » jcl-over-slf4j 1.7.24 2.0.12
org.apache.hadoop » hadoop-common 3.2.0 3.3.1
org.apache.kerby » kerby-pkix 1.0.1 2.0.1
xerces » xercesImpl 2.9.1 RELEASE
com.rometools » rome-utils 1.5.1 1.16.0
org.aspectj » aspectjrt 1.8.0 1.9.21.2
org.eclipse.jetty » jetty-security 9.4.19.v20190610 9.4.44.v20210927
org.apache.zookeeper » zookeeper 3.5.5 3.6.3
org.eclipse.jetty » jetty-jmx 9.4.19.v20190610 9.4.44.v20210927
org.eclipse.jetty » jetty-deploy 9.4.19.v20190610 10.0.6
org.apache.commons » commons-collections4 4.2 RELEASE
org.eclipse.jetty » jetty-client 9.4.19.v20190610 9.4.44.v20210927
org.apache.tika » tika-core 1.19.1 1.27
commons-beanutils » commons-beanutils 1.9.3 1.9.4
com.healthmarketscience.jackcess » jackcess 2.1.12 4.0.6
org.apache.calcite.avatica » avatica-core 1.13.0 1.18.0
com.healthmarketscience.jackcess » jackcess-encrypt 2.1.4 4.0.2
org.apache.commons » commons-configuration2 2.1.1 2.7
com.googlecode.mp4parser » isoparser 1.1.22 Нет
org.eclipse.jetty » jetty-util 9.4.19.v20190610 9.4.44.v20210927
org.apache.zookeeper » zookeeper-jute 3.5.5 3.6.3
org.apache.commons » commons-text 1.6 1.9
org.eclipse.jetty » jetty-xml 9.4.19.v20190610 9.4.44.v20210927
com.fasterxml.jackson.core » jackson-annotations 2.9.8 2.17.1
org.apache.tika » tika-parsers 1.19.1 1.27
org.apache.commons » commons-math3 3.6.1 Нет
com.cybozu.labs » langdetect 1.1-20120112 Нет
org.brotli » dec 0.1.2 Нет
com.google.re2j » re2j 1.2 1.6
org.apache.kerby » kerby-asn1 1.0.1 2.0.1
com.carrotsearch » hppc 0.8.1 0.9.0
org.apache.tika » tika-xmp 1.19.1 1.27
com.fasterxml.jackson.core » jackson-core 2.9.8 2.17.1
org.eclipse.jetty » jetty-io 9.4.19.v20190610 9.4.44.v20210927
com.fasterxml.jackson.core » jackson-databind 2.9.8 2.17.1
org.bouncycastle » bcprov-jdk15on 1.60 1.70
org.apache.tika » tika-java7 1.19.1 1.27
org.codehaus.woodstox » stax2-api 3.1.4 4.2.1
org.apache.httpcomponents » httpclient 4.5.6 4.5.11
commons-io » commons-io 2.5 2.11.0
org.bouncycastle » bcmail-jdk15on 1.60 1.69
org.eclipse.jetty » jetty-alpn-server 9.4.19.v20190610 9.4.44.v20210927
org.bouncycastle » bcpkix-jdk15on 1.60 1.70
com.drewnoakes » metadata-extractor 2.11.0 2.16.0
org.eclipse.jetty » jetty-alpn-java-server 9.4.19.v20190610 9.4.44.v20210927
com.adobe.xmp » xmpcore 5.1.3 6.1.11
org.eclipse.jetty.http2 » http2-hpack 9.4.19.v20190610 9.4.43.v20210629
org.eclipse.jetty.http2 » http2-common 9.4.19.v20190610 9.4.44.v20210927
org.apache.james » apache-mime4j-core 0.8.2 0.8.4
org.apache.httpcomponents » httpmime 4.5.6 4.5.12
com.fasterxml.jackson.dataformat » jackson-dataformat-smile 2.9.8 2.17.1
commons-cli » commons-cli 1.2 1.4
org.apache.james » apache-mime4j-dom 0.8.2 0.8.4
org.apache.opennlp » opennlp-tools 1.9.1 1.9.3
org.apache.htrace » htrace-core4 4.1.0-incubating 4.2.0-incubating
org.restlet.jee » org.restlet 2.3.0 Нет
io.dropwizard.metrics » metrics-jetty9 4.0.5 4.2.23
org.restlet.jee » org.restlet.ext.servlet 2.3.0 Нет
commons-codec » commons-codec 1.11 1.15
com.lmax » disruptor 3.4.2 3.4.4
org.apache.poi » poi-ooxml 4.0.0 5.0.0
org.eclipse.jetty » jetty-alpn-java-client 9.4.19.v20190610 10.0.6
org.apache.poi » poi-scratchpad 4.0.0 5.0.0
org.eclipse.jetty » jetty-alpn-client 9.4.19.v20190610 9.4.43.v20210629
org.apache.poi » poi-ooxml-schemas 4.0.0 4.1.2
org.apache.calcite » calcite-linq4j 1.18.0 1.27.0
org.apache.calcite » calcite-core 1.18.0 1.27.0
io.dropwizard.metrics » metrics-graphite 4.0.5 4.2.23
org.ccil.cowan.tagsoup » tagsoup 1.2.1 Нет
org.eclipse.jetty » jetty-webapp 9.4.19.v20190610 9.4.44.v20210927
org.eclipse.jetty » jetty-servlets 9.4.19.v20190610 10.0.11
org.eclipse.jetty » jetty-server 9.4.19.v20190610 9.4.44.v20210927
com.pff » java-libpst 0.8.1 0.9.3
io.dropwizard.metrics » metrics-jmx 4.0.5 4.2.23
org.eclipse.jetty » jetty-servlet 9.4.19.v20190610 10.0.12
org.tukaani » xz 1.8 1.9
com.epam » parso 2.0.9 2.0.14
org.apache.logging.log4j » log4j-1.2-api 2.11.2 2.17.2
org.apache.logging.log4j » log4j-web 2.11.2 2.13.1
org.locationtech.spatial4j » spatial4j 0.7 0.8
org.tallison » jmatio 1.5 Нет
com.google.guava » guava 25.1-jre 33.0.0-jre
io.dropwizard.metrics » metrics-jvm 4.0.5 4.2.23
io.dropwizard.metrics » metrics-core 4.0.5 4.2.23
org.rrd4j » rrd4j 3.5 3.8
org.apache.httpcomponents » httpcore 4.4.10 4.4.15
org.apache.poi » poi 4.0.0 5.0.0
com.github.ben-manes.caffeine » caffeine 2.4.0 3.1.8
org.apache.logging.log4j » log4j-api 2.11.2 2.19.0
org.apache.logging.log4j » log4j-core 2.11.2 2.18.0
org.apache.logging.log4j » log4j-slf4j-impl 2.11.2 2.19.0

Test зависимости (4)