如何解决Lucene自动完成工具suggest.InputIterator包含多个字段
我正在使用lucene AnalyzingSuggerter API实现自动完成。
我有一个如下的数据类
class Student implements java.io.Serializable {
int id;
String fullName;
String description;
public Student(int id,String fullName,String description) {
this.id = id;
this.description = description;
this.fullName = fullName;
}
}
我实现了以下迭代器
import java.io.ByteArrayOutputStream;
import java.io.IOException;
import java.io.ObjectOutputStream;
import java.io.UnsupportedEncodingException;
import java.util.Comparator;
import java.util.HashSet;
import java.util.Iterator;
import java.util.Set;
import org.apache.lucene.search.suggest.InputIterator;
import org.apache.lucene.util.BytesRef;
class CourseIterator implements InputIterator
{
private Iterator<Course> courseIterator;
private Course currentCourse;
CourseIterator(Iterator<Course> courseIterator) {
this.courseIterator = courseIterator;
}
public boolean hasContexts() {
return false;
}
public boolean hasPayloads() {
return true;
}
public Comparator<BytesRef> getComparator() {
return null;
}
public BytesRef next() {
if (courseIterator.hasNext()) {
currentCourse = courseIterator.next();
try {
return new BytesRef(currentCourse.fullName.getBytes("UTF8"));
} catch (UnsupportedEncodingException e) {
throw new Error("Couldn't convert to UTF-8");
}
} else {
return null;
}
}
// payload is a serialized Java object representing our course.
public BytesRef payload() {
try {
ByteArrayOutputStream bos = new ByteArrayOutputStream();
ObjectOutputStream out = new ObjectOutputStream(bos);
out.writeObject(currentCourse);
out.close();
return new BytesRef(bos.toByteArray());
} catch (IOException e) {
throw new Error("Well that's unfortunate.");
}
}
public Set<BytesRef> contexts() {
throw new Error("Unsupported Operation");
}
public long weight() {
return 1;
}
}
这对数据很好用
new Student(1,"Elizabeth Smith","test1");
new Student(2,"Will Smith","test2");
new Student(3,"Smith Brown","test3");
new Student(4,"Queen Elizabeth","test4");
new Student(5,"Elena sam","test1");
on auto suggestion search for
Eliz --> [1]
Smit --> [3]
El --> [1,5]
Que --> [4]
我还想将全名的姓氏部分考虑进去以进行自动搜索,因此结果应该是考虑到姓氏并且权重较小。
Eliz --> [1,4]
Smit --> [3,2]
El --> [1,5,4]
Que --> [4]
有人可以建议如何实现吗?
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。