Home Reference Source

Function

Static Public Summary
public

doouble normalisation score functon generator K + (1-K) (count / max count)

public

binary tf (1)

public

count(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number

raw term count

public

async evaluate(system: System, parser: QueryParser, cases: object): object

Evaluate an IR system.

public

extractObjectValues(document: object, fields: object, callback: function)

extract values from a document object

public

inverseDocumentFrequency(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number

idf - number of documents / number of documents with term

public

inverseDocumentFrequencyMax(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number

idf max

public

inverseDocumentFrequencySmooth(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number

idf smooth

public

logNormal(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number

1 + log(count)

public

naiveBayes(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number

anonymous function - description

public

probabilisticInverseDocumentFrequency(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number

probailistic idf

public

termFrequency(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number

term frequency (raw cunt / total raw count)

public

unary idf (1)

Static Public

public augmented(K: number): function source

doouble normalisation score functon generator K + (1-K) (count / max count)

Params:

NameTypeAttributeDescription
K number
  • optional
  • default: 0.5

augment weight

Return:

function

score function

public binary(): number source

binary tf (1)

Return:

number

calculated score

public count(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number source

raw term count

Params:

NameTypeAttributeDescription
t number

number of times term occurs in document

sum_t number

sum of t for all terms in document

max_t number

maximum number of times any term occurs in document

sum_dt number

total number of terms in document

sum_all number

total number of terms in document collection

count_d number

total number of documents

count_dt number

total number of documents with term

max_dt number

maximum number of documents per term

Return:

number

calculated score

public async evaluate(system: System, parser: QueryParser, cases: object): object source

Evaluate an IR system.

Params:

NameTypeAttributeDescription
system System

description

parser QueryParser

description

cases object

description

Return:

object

description

public extractObjectValues(document: object, fields: object, callback: function) source

extract values from a document object

Params:

NameTypeAttributeDescription
document object

target document

fields object

array of fields or map of fields to score

callback function

callback function will be called for each value

public inverseDocumentFrequency(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number source

idf - number of documents / number of documents with term

Params:

NameTypeAttributeDescription
t number

number of times term occurs in document

sum_t number

sum of t for all terms in document

max_t number

maximum number of times any term occurs in document

sum_dt number

total number of terms in document

sum_all number

total number of terms in document collection

count_d number

total number of documents

count_dt number

total number of documents with term

max_dt number

maximum number of documents per term

Return:

number

calculated score

public inverseDocumentFrequencyMax(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number source

idf max

Params:

NameTypeAttributeDescription
t number

number of times term occurs in document

sum_t number

sum of t for all terms in document

max_t number

maximum number of times any term occurs in document

sum_dt number

total number of terms in document

sum_all number

total number of terms in document collection

count_d number

total number of documents

count_dt number

total number of documents with term

max_dt number

maximum number of documents per term

Return:

number

calculated score

public inverseDocumentFrequencySmooth(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number source

idf smooth

Params:

NameTypeAttributeDescription
t number

number of times term occurs in document

sum_t number

sum of t for all terms in document

max_t number

maximum number of times any term occurs in document

sum_dt number

total number of terms in document

sum_all number

total number of terms in document collection

count_d number

total number of documents

count_dt number

total number of documents with term

max_dt number

maximum number of documents per term

Return:

number

calculated score

public logNormal(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number source

1 + log(count)

Params:

NameTypeAttributeDescription
t number

number of times term occurs in document

sum_t number

sum of t for all terms in document

max_t number

maximum number of times any term occurs in document

sum_dt number

total number of terms in document

sum_all number

total number of terms in document collection

count_d number

total number of documents

count_dt number

total number of documents with term

max_dt number

maximum number of documents per term

Return:

number

calculated score

public naiveBayes(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number source

anonymous function - description

Params:

NameTypeAttributeDescription
t number

number of times term occurs in document

sum_t number

sum of t for all terms in document

max_t number

maximum number of times any term occurs in document

sum_dt number

total number of terms in document

sum_all number

total number of terms in document collection

count_d number

total number of documents

count_dt number

total number of documents with term

max_dt number

maximum number of documents per term

Return:

number

calculated score

public probabilisticInverseDocumentFrequency(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number source

probailistic idf

Params:

NameTypeAttributeDescription
t number

number of times term occurs in document

sum_t number

sum of t for all terms in document

max_t number

maximum number of times any term occurs in document

sum_dt number

total number of terms in document

sum_all number

total number of terms in document collection

count_d number

total number of documents

count_dt number

total number of documents with term

max_dt number

maximum number of documents per term

Return:

number

calculated score

public termFrequency(t: number, sum_t: number, max_t: number, sum_dt: number, sum_all: number, count_d: number, count_dt: number, max_dt: number): number source

term frequency (raw cunt / total raw count)

Params:

NameTypeAttributeDescription
t number

number of times term occurs in document

sum_t number

sum of t for all terms in document

max_t number

maximum number of times any term occurs in document

sum_dt number

total number of terms in document

sum_all number

total number of terms in document collection

count_d number

total number of documents

count_dt number

total number of documents with term

max_dt number

maximum number of documents per term

Return:

number

calculated score

public unary(): number source

unary idf (1)

Return:

number

calculated score