Basic Settings

–problem_type

define a point-wise or pair-wise problem.

point-wise: point-wise algorithm

pair-wise: pair-wise algorithm

–optimization_metric

the metric to be optimized for hyper-parameter tuning via HyperOpt

ndcg

precision

recall

hr

map

mrr

–hyperopt_trail

the number of trails of HyperOpt

–hyperopt_pack

record the searching space of hyper-parameters for HyperOpt

–algo_name

the algorithm to be executed

mostpop

itemknn

puresvd

slim

mf

fm

neumf

nfm

ngcf

multi-vae

–dataset

the dataset to be evaluated

ml-100k

ml-1m

ml-10m

ml-20m

lastfm

book-x

amazon-cloth

amazon-electronic

amazon-book

amazon-music

epinions

yelp

citeulike

netflix

–prepro

the data pre-processing strategy

origin: adopt the raw data

Fcore: recursively filter users and items that have interactions no less than N, e.g., 5core

Ffilter: only filter users and items that have interactions no less than N once, e.g., 5filter

–val_method

training and validation data splitting strategy

tsbr: time-aware split-by-ratio

rsbr: random-aware split-by-ratio

tloo: time-aware leave-one-out

rloo: random-aware leave-one-out

–test_method

training and test data splitting strategy, which should be consistent with the settings for val_method

–val_size

ratio of validation set size in the range of (0,1), e.g., 0.1 means retaining 10% of training data as validation data

–test_size

ratio of test set size in the range of (0,1), e.g., 0.2 means retaining 20% of the whole data as test data

–topk

the length of recommendation list

–fold_num

the fold number of cross-validation

–cand_num

the number of candidate items used for ranking

–sample_method

negative sampling strategy

uniform: uniformly sample negative items

low-pop: sample popular items with low rank

high-pop: sample popular items with high rank

–sample_ratio

control the ratio of popularity sampling for the hybrid sampling strategy in the range of (0,1), e.g., for the hybrid sampling strategy uniform+low-pop, –sample_ratio=0.1 means 10% of the negative items are sampled via low-pop

–num_ng

the number of negative samples

–positive_threshold

the threshold for binarizing the ratings into positve samples (for exmaple if the threshold = 4, it means the items with ratings no less than 4 will be treated as positive items)

–loss_type

type of loss function

CL: cross-entropy loss for point-wise problem

SL: square error loss for point-wise problem

BPR: BPR loss for pair-wise problem

HL: hinge loss for pair-wise problem

TL: top-1 Loss for pair-wise problem

–gpu

the ID of GPU card