WWW2013 Full Paper: Interactive Exploratory Search for Multi Page Search Results

February 13th, 2013 Comments off
Modern information retrieval interfaces typically involve multiple
pages of search results, and users who are recall minded or engaging
in exploratory search using ad hoc queries are likely to access more
than one page. Document rankings for such queries can be improved
through introducing additional context to the query provided by the
user herself through using explicit ratings or implicit actions such
as clickthroughs. Existing methods using this information usually
involved detrimental UI changes that can lower user
satisfaction. Instead, we propose a new feedback scheme that makes use
of existing UIs and does not alter user’s browsing behaviours; to
maximise retrieval performance over multiple result pages, we propose
a novel retrieval optimisation framework and show that the optimal
ranking policy should choose a diverse, exploratory ranking to display
on the first page. Then, a personalised re-ranking of the next pages
can be generated based on the user’s feedback from the first page. We
show that document correlations used in result diversification have a
significant impact on relevance feedback and its effectiveness over a
search session. TREC evaluations demonstrate that our optimal rank
strategy (including approximative Monte Carlo Sampling) can naturally
optimise the trade-off between exploration and exploitation and
maximise the overall user’s satisfaction over time against a number of
similar baselines.
  • X. Jin, M. Sloan, and J. Wang, "Interactive Exploratory Search for Multi Page Search Results," in WWW, 2013. bibtex
    Go to document
    @inproceedings{www2013-1,
      author = {Xiaoran Jin and Marc Sloan and Jun Wang},
      title = {Interactive Exploratory Search for Multi Page Search Results},
      booktitle = {WWW},
      pdf={http://web4.cs.ucl.ac.uk/staff/jun.wang/papers/www2013-1.pdf},
      year = {2013},
      }

Post to Twitter

Categories: WWW2013 Tags:

WWW2013 Full Paper: Probabilistic Group Recommendation via Information Matching

February 11th, 2013 Comments off
Increasingly, web recommender systems face scenarios where they need to serve suggestions two groups of users; for example, when families share e-commerce or movie rental web accounts. Research to date in this domain has proposed two approaches: computing recommendations for the group by merging any members’ ratings into a single profile, or computing ranked recommendations for each individual that are then merged via a range of heuristics. In doing so, none of the past approaches reason on the preferences that arise in individuals when they are members of a group. In this work, we present a probabilistic framework, based on the notion of information matching, for grouprecommendation. This model defines group relevance as a combination of the item’s relevance to each user as an individual and as a member of the group; it can then seamlessly incorporate any group recommendation strategy in order to rank items for a set of individuals. We evaluate the model’s efficacy of generating recommendations for both single individuals and groups using the MovieLens and MoviePilot data sets. In both cases, we compare our results with baselines and state-of-the-art collaborative filtering algorithms, and show that the model outperforms all others over a variety of ranking metrics.
  • J. Gorla, N. Lathia, S. Robertson, and J. Wang, "Probabilistic Group Recommendation via Information Matching," in WWW, 2013. bibtex
    Go to document
    @inproceedings{www2013-2,
      author = {Jagadeesh Gorla and Neal Lathia and Stephen Robertson and Jun Wang},
      title = {Probabilistic Group Recommendation via Information Matching},
      booktitle = {WWW},
      pdf={http://web4.cs.ucl.ac.uk/staff/jun.wang/papers/www2013-2.pdf},
      year = {2013},
      }

Post to Twitter

Categories: WWW2013 Tags:

KTP Associate in Data Mining and Analytics for Home Energy Management Services

August 10th, 2012 No comments

Are you looking for a challenging role within a dynamic and ambitious team, who are rapidly becoming the leaders in the home energy management sector? Do you have the ability and confidence to manage a strategic and technical project, crucial to support increased company growth?

The successful candidate will deliver a Knowledge Transfer Project (KTP) in partnership with PassivSystems Ltd, based in Newbury. PassivSystems Ltd (http://www.passivsystems.com/) develops and markets energy management and monitoring systems to home owners.

Under the co-supervision of Dr Jun Wang at UCL and Dr Edwin Carter, Senior Engineer at PassivSystems Ltd, the postholder will develop intelligent energy advisory and feedback data systems to improve home energy advice and feedback, using consumer profiles and energy consumption data to automatically recommend and optimise services and influence new product development.

This challenging position is based at PassivSystems in Newbury and will provide an ambitious and committed candidate with invaluable personal and career development. The post is funded for 24 months initially from October 2012.

For more information, please check here.

Post to Twitter

Categories: News Tags:

CIKM2012 Full Paper: Sequential Selection of Correlated Ads by POMDPs

July 19th, 2012 No comments

Online advertising has become a key source of revenue for both web search engines and online publishers. For them, the ability of allocating right ads to right webpages is critical because any mismatched ads would not only harm web users’ satisfactions but also lower the ad income. In this paper, we study how online publishers could optimally select ads to maximize their ad incomes over time. The conventional offline, content-based matching between webpages and ads is a fine start but cannot solve the problem completely because good matching does not necessarily lead to good payoff. Moreover, with the limited display impressions, we need to balance the need of selecting ads to learn true ad payoffs (exploration) with that of allocating ads to generate high immediate payoffs based on the current belief (exploitation). In this paper, we address the problem by employing Partially observable Markov decision processes (POMDPs) and discuss how to utilize the correlations of ads to improve the efficiency of the exploration and increase ad incomes in a long run. Our mathematical derivation shows that the belief states of correlated ads can be naturally updated using a formula similar to collaborative filtering. To test our model, a real world ad dataset from a major search engine is collected and categorized. Experimenting over the data, we provide an analyse of the effect of the underlying parameters, and demonstrate that our algorithms significantly outperform other strong baselines.

[bibtex file=mypublications.bib key=Wang:cikm2012]


Post to Twitter

Categories: CIKM2012 Tags:

Fully Funded PhD studentships in Customer Behaviour Analytics

July 6th, 2012 No comments

UCL invites expressions of interest in fully funded PhDs in Computer Science, Statistics and Psychology. The PhDs will involve working with a world leading company which has purchasing data from over 350 million people in 28 countries. They process and analyse over a billion rows of data each week and have customer data spanning 5 years.

The vast amount of loyalty card transaction data and market research attitudinal data allows a great depth of understanding in terms of what customers are doing and what they are thinking. These insights can be further validated through real‐life experiments, both online and in‐store.

A typical supermarket sells 50,000 items across 100 categories. When deciding on which product to buy factors such as: price, quality, brand, recommendation, offers etc. all play their part in the decision‐making process for a customer. By understanding the complex interaction of these factors customers can be treated as an individual and their shopping experience improved and made more relevant to them.

This is a fantastic chance to explore the cutting‐edge of extracting real‐life insight into customer behaviour from the increasingly massive and complex data available.

We are happy to consider UK/EU candidates from any scientific discipline. However, the successful candidate should possess:

  • strong academic background (based on educational qualifications)
  • strong quantitative skills including data manipulation strong research skills (especially in large scale quantitative studies)
  • strong communication skills – ability to explain complex techniques to people who are not technical creative problem solver
  • passionate about Customer Behaviour Analytics and retail

Expressions of interest (together with a CV) should be emailed to:

Dr. Bradley Love (Psychology) – bradley.c.love@gmail.com

Prof. Patrick Wolfe (Statistical Science) – patrick@stats.ucl.ac.uk, or,

Dr. Jun Wang (Computer Science) – j.wang@cs.ucl.ac.uk

Post to Twitter

Categories: Uncategorized Tags:

SIGIR2012 Full Paper: Adaptive Diversification of Recommendation Results via Latent Factor Portfolio

May 15th, 2012 No comments

The variance of Latent Factors

This paper studies result diversification in collaborative filtering. We argue that the diversification level in a  recommendation list should be adapted to the target users’  individual situations and needs. Different users may have different   ranges of interests — the preference of a highly focused user might  include only few topics, whereas that of the user with broad  interests may encompass a wide range of topics. Thus, the  recommended items should be diversified according to the interest  range of the target user. Such an adaptation is also required due to  the fact that the uncertainty of the estimated user preference model  may vary significantly between users. To reduce the risk of the  recommendation, we should take the difference of the uncertainty  into account as well.

In this paper, we study the adaptive diversification  problem theoretically. We start with commonly used latent factor models and  reformulate them using the mean-variance analysis from the portfolio  theory in text retrieval. The resulting Latent Factor Portfolio  (LFP) model captures the user’s interest range and the uncertainty  of the user preference by employing the variance of  the learned user latent factors. It is shown that the  correlations between items (and thus the item diversity) can be obtained by  using the correlations between latent factors (topical diversity), which in  return significantly reduce the computation load. Our mathematical derivation  also reveals that diversification is necessary, not only for risk-averse  system behavior (non-adpative), but also for the target users’ individual  situations (adaptive), which are represented by the distribution and  the variance of the latent user factors. Our experiments confirm the  theoretical insights and show that LFP succeeds in improving latent factor models by  adaptively introducing recommendation diversity to fit the individual user’s  needs.

  • Y. Shi, X. Zhao, J. Wang, M. Larsona, and A. Hanjalic, "Adaptive Diversification of Recommendation Results via Latent Factor Portfolio," in SIGIR, 2012. bibtex
    Go to document
    @INPROCEEDINGS{Wang:sigir2012,
      author = {Yue Shi and Xiaoxue Zhao and Jun Wang and Martha Larsona and Alan Hanjalic},
      TITLE = {Adaptive Diversification of Recommendation Results via Latent Factor Portfolio},
      BOOKTITLE = {SIGIR},
      pdf={http://web4.cs.ucl.ac.uk/staff/jun.wang/papers/2012-sigir-lfp.pdf},
      YEAR = {2012}
    }

Post to Twitter

Categories: SIGIR2012 Tags:

Internet Advertising: An Interplay among Advertisers, Online Publishers, Ad Exchanges and Web Users

May 2nd, 2012 No comments
The various players of Internet advertising and the trading process

The various players of Internet advertising and the trading process

Internet advertising, aka Web advertising or online advertising, is
a fast growing business. It has already proved to be significantly important in digital economics. For example, it is vitally important for both web search engines and online content providers and publishers because web advertising provides them with major sources of revenue. Its presence is increasingly important for the whole media industry due to the influence of the Web. For advertisers, it is a smarter alternative to traditional marketing media such as TVs
and newspapers. As the web evolves and data collection continues, the design of methods for more targeted, interactive, and friendly advertising may
have a major impact on the way our digital economy evolves, and to aid societal development.
Towards this goal mathematically well-grounded Computational Advertising methods are becoming necessary and will continue to develop as a fundamental tool towards the Web.  As a vibrant new discipline, Internet advertising requires effort from different research domains including Information Retrieval, Machine Learning, Data Mining and Analytics, Statistics, Economics, and even Psychology to predict and understand user behaviours.  In this paper, we provide a comprehensive survey on Internet advertising, discussing and classifying the research issues, identifying the recent technologies, and suggesting its future directions.  To have a comprehensive picture, we first start with a brief history, introduction, and classification of the industry and present a schematic view of the new advertising ecosystem. We then introduce four major participants, namely advertisers, online publishers, ad exchanges and web users; and through analysing and discussing the major research problems and existing solutions from their perspectives respectively, we discover and aggregate the fundamental problems that characterise the newly-formed research field and capture its potential future prospects.

This paper is under submission.

  • S. Yuan, A. Z. Abidin, M. Sloan, and J. Wang, "Internet Advertising: An Interplay among Advertisers, Online Publishers, Exchanges and Web Users," Under Submission, ArXiv e-prints http://arxiv.org/pdf/1206.1754v1.pdf, 2012. bibtex
    Go to document
    @ARTICLE{ipm2012,
      author = {Shuai Yuan and Ahmad Zainal Abidin and Marc Sloan and Jun Wang},
      title = {Internet Advertising: An Interplay among Advertisers, Online Publishers, Exchanges and Web Users},
      journal = {Under Submission, ArXiv e-prints http://arxiv.org/pdf/1206.1754v1.pdf},
      year = {2012},
      pdf={http://arxiv.org/pdf/1206.1754v1.pdf},
      }

Post to Twitter

Categories: Computational Advertising Tags:

ECIR2012 Best Paper: Top-k Retrieval using Facility Location Analysis

March 27th, 2012 No comments

The top-k retrieval problem aims to find the optimal set of k documents from a number of relevant documents given the user’s query.
The key issue is to balance the relevance and diversity of the top-k search results. In this paper, we address this problem using Facility Location
Analysis taken from Operations Research, where the locations of facilities are optimally chosen according to some criteria. We show how this
analysis technique is a generalization of state-of-the-art retrieval models for diversification (such as the Modern Portfolio Theory for Information
Retrieval), which treat the top-k search results like “obnoxious facilities” that should be dispersed as far as possible from each other. However,
Facility Location Analysis suggests that the top-k search results could be treated like “desirable facilities” to be placed as close as possible to their
customers. This leads to a new top-k retrieval model where the best representatives of the relevant documents are selected. In a series of
experiments conducted on two TREC diversity collections, we show that significant improvements can be made over the current state-of-the-art
through this alternative treatment of the top-k retrieval problem.

  • G. Zuccon, L. Azzopardi, D. Zhang, and J. Wang, "Top-k Retrieval using Facility Location Analysis," in ECIR Best Paper, 2012. bibtex
    Go to document
    @inproceedings{ecir2012,
      author = {Guido Zuccon and Leif Azzopardi and Dell Zhang and Jun Wang},
      title = {Top-k Retrieval using Facility Location Analysis},
      booktitle = {ECIR Best Paper},
      year = {2012},
      url={http://www.dcs.bbk.ac.uk/~dell/publications/dellzhang_ecir2012.pdf}
    }

Post to Twitter

Categories: Awards, ECIR2012, Events Tags:

WWW2012 Poster: Selling Futures Online Advertising Slots via Option Contracts

March 27th, 2012 No comments
Ad Prices in a Spot Market

Ad Prices in a Spot Market

Many online advertising slots are sold through bidding mechanisms by publishers and search engines. Highly affected by the dual force of supply and demand, the prices of advertising slots vary significantly over time. This then influences the businesses whose major revenues are driven by online advertising, particularly for publishers and search engines. To address the problem, we propose to sell the future ad- vertising slots via option contracts (also called ad options). The ad option can give its buyer the right to buy the future advertising slots at a prefixed price. The pricing model of ad options is developed in order to reduce the volatility of the income of publishers or search engines. Our experimental results confirm the validity of ad options and the embedded risk management mechanisms.

  • J. Wang and B. Chen, "Selling Futures Online Advertising Slots via Option Contracts," in WWW Poster, 2012. bibtex
    Go to document
    @inproceedings{www2012-adoptions,
      author = {J. Wang and B. Chen},
      title = {Selling Futures Online Advertising Slots via Option Contracts},
      booktitle = {WWW Poster},
      year = {2012},
      pdf={http://web4.cs.ucl.ac.uk/staff/jun.wang/papers/2012-www-adoptions.pdf},
      }

Post to Twitter

WWW2012 Full Paper: Using Control Theory for Stable and Efficient Recommender Systems

February 16th, 2012 No comments

The aim of a web-based recommender system is to provide highly accurate and up-to-date recommendations to its users; in

A mass attached to a spring and damper

practice, it will hope to retain its users over time. However, this raises unique challenges. To achieve complex goals such as keeping the recommender model up- to-date over time, we need to consider a number of external requirements. Generally, these requirements arise from the physical nature of

A controlled recommender system

the system, for instance the available computational resources. Ideally, we would like to design a system that does not deviate from the required outcome. Modeling such a system over time requires to describe the internal dynamics as a combination of the underlying recommender model and the its users’ behavior. We propose to solve this problem by applying the principles of modern con- trol theory—a powerful set of tools to deal with dynamical systems—to construct and maintain a stable and robust recommender system for dynamically evolving environments. In particular, we introduce a design principle by focusing on the dynamic relationship between the recommender sys- tem’s performance and the number of new training samples the system requires. This enables us to automate the control other external factors such as the system’s update frequency. We show that, by using a Proportional-Integral-Derivative controller, a recommender system is able to automatically and accurately estimate the required input to keep the out- put close to a pre-defined requirements. Our experiments on a standard rating dataset show that, by using a feedback loop between system performance and training, the trade- off between the effectiveness and efficiency of the system can be well maintained. We close by discussing the widespread applicability of our approach to a variety of scenarios that recommender systems face.

  • T. Jambor, J. Wang, and N. Lathia, "Using Control Theory for Stable and Efficient Recommender Systems," in WWW, 2012. bibtex
    Go to document Go to document
    @inproceedings{www2012,
      author = {T. Jambor and J. Wang and N. Lathia},
      title = {Using Control Theory for Stable and Efficient Recommender Systems},
      booktitle = {WWW},
      year = {2012},
      pdf={http://web4.cs.ucl.ac.uk/staff/jun.wang/papers/2012-www12-control-cf.pdf},
      url={http://web4.cs.ucl.ac.uk/staff/jun.wang/blog/2012/02/16/control-theory-recommender/},
      }

Post to Twitter