aprendizaje de reglas para sistemas de recomendación

Aprendizaje de reglaspara sistemas de recomendacion aplicando minerıa semantica

Blanca A. Vargas Govea

Grupo de Sistemas de RecomendacionDepartamento de Ciencias Computacionales

cenidet

30 de noviembre de 2011

Contenido

1 Motivacion

2 Estado actual

3 Propuesta de solucion: minerıa semantica

2 / 16

Resultados selecci on de atributos

Lambda

Pre

cisi

on

0.45

0.50

0.55

0.60

0.65

0.70

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

0.1

1.0

subset

All

B

C

D

E

F

Lambda

Rec

all

0.15

0.20

0.25

0.30

0.35

0.0

0.2

0.4

0.6

0.8

1.0

subset

All

B

C

D

E

F

Lambda

ND

CG

0.46

0.48

0.50

0.52

0.54

0.56

0.0

0.2

0.4

0.6

0.8

1.0

subset

All

B

C

D

E

F

Enfoque contextual: el peor evaluado

3 / 16

Problemas comunes

Overfitting

El numero deparametros en elmodelo es muy grande.

Redundancia.

Ajuste al ruido.

Pobres resultados deprediccion.

Underfitting

El modelo tiene muypocos parametros.

Incapacidad decapturar la variabilidadimportante en losdatos.

Pobres resultados deprediccion.

Objetivo

Obtener mediante aprendizaje, reglas que describan el conjunto dedatos de modo que la evaluacion mejore.

4 / 16

Surfeous: ası est an las cosas

Modelodel usuario

Modelode servicios

Modelodel entorno

Reglas

5 / 16

Atributos

Service model (23 attributes)latitude,longitude,address,city,state,country,fax,ZIP,alcohol,smoking,dress,accessibility,price,franchise,ambiance,space,services,parking,cuisine,phone,accepts,days,hoursUser model (21 attributes)latitude,longitude,smoking,alcohol,dress,ambiance,age,transportation,marital-status,children,interests,personality,religion,occupation,favorite-color,weight,height,budget,accepts,accessibility,cuisineEnvironment model (2 attributes)time,weather

6 / 16

Las reglas 1/4

Son 95 / a partir de un estudio de mercadotecnia / grupos deconsumidores

user - service profileperson(X ) ∧ hasOccupation(X , student) ∧restaurant(R) ∧ hasCost(R, low) → select(X ,R)

user - environment profileperson(X ) ∧ isJapanese(X , true) ∧queryPlace(X ,USA) ∧ restaurant(R) ∧isVeryClose(R, true) → select(X ,R)

environment - service profilecurrentWeather(today , rainy) ∧ restaurant(R) ∧space(R, closed) → select(R)

RelationslikesFood(X ,Y ) X : person, Y : cuisine-typecurrentWeather(X ,Y ) X : query, Y : weatherspace(X ,Y ) X : restaurant, Y : {closed , open}

7 / 16

Las reglas 2/4

</regla>

<regla nombre="chaviza3">

<antecedente>Persona(?x) ^ tieneEdad(?x, ?y) ^

swrlb:greaterThanOrEqual(?y, 12) ^

swrlb:lessThanOrEqual(?y, 24) ^ Restaurant(?r) ^

IDPlace(?r,?id) ^ esFranquicia(?r, true)</antecedente>

<consecuente>sqwrl:selectDistinct(?id)</consecuente>

<tipo>2</tipo>

</regla>

8 / 16

Las reglas 3/4

<regla nombre="retro">



swrlb:lessThanOrEqual(?y, 50) ^ interes(?x, "retro") ^

tieneNacionalidad(?x, ?n) ^ tieneCuisineNacional(?n, ?c) ^

Restaurant(?r) ^ IDPlace(?r,?id) ^

sirveAlimento(?r, ?c)</antecedente>


<tipo>2</tipo>

</regla>

9 / 16

Las reglas 4/4

<regla nombre="bioconsumidor">



swrlb:lessThanOrEqual(?y, 60) ^ interes(?x, "naturaleza") ^

Restaurant(?r) ^ IDPlace(?r,?id) ^

sirveAlimento(?r, Vegetarian)</antecedente>


<tipo>2</tipo>

</regla>

10 / 16

¿Que se hace?

Generalmente disenadas por un experto

Tendencia

Semantic Web Mining[Lavrac et al., 2011, Vavpeti & Lavrac, 2011,Eiter et al., 2008]

Relaciones entre conceptos[Ciaramita et al., 2005]

Reglas[Aitken, 2002, Tegos et al., 2008,

Manine et al., 2009, Teflioudi, 2011,Siebers & Kruijsen, 2006]

Reglas de comportamiento[Valiente-Rocha & Tello, 2010,Angeletou et al., 2011]

11 / 16

Minerıa Sem antica

Con la expansion de la Web Semantica y la disponibilidad deontologıas la cantidad de informacin semantica esta en rapidocrecimiento.

Las ontologıas de dominio definen el espacio de hipotesis.

Los datos son usados como medio para restringir y guiar elproceso de busqueda de hipotesis y evaluacion.

El reto actual es minar el conocimiento contenido en ontologıasde dominio.

[Lavrac et al., 2011]

12 / 16

Lo que se propone

1 Seleccion de atributos de los modelos. Uno ya esta.

2 Proceso de aprendizaje relacional a partir de datos y ontologıa,ILP (Programacion Logica Inductiva).

3 Obtener relaciones/reglas.

4 Pruebas: se espera que el enfoque contextual mejore.

13 / 16

ILP

Enfoque de aprendizaje automatico en el cual las relaciones sonaprendidas a partir de ejemplos.

La logica de predicados es usada como lenguaje de hipotesis ylos resultados son programas en Prolog.

A diferencia de los enfoques proposicionales permite incorporarconocimiento del dominio.

Aleph (A Learning Engine for Proposing Hypotheses).

14 / 16

ILP

15 / 16

En proceso

Estado del arte detallado.

Seleccion de atributos del perfil de usuario.

16 / 16

Aitken, J. S. (2002).

Learning information extraction rules: An inductive logic programming approach.

Angeletou, S., Rowe, M., & Alani, H. (2011).

Modelling and analysis of user behaviour in online communities.In International Semantic Web Conference (1) (pp. 35–50).

Ciaramita, M., Gangemi, A., Ratsch, E., Saric, J., & Rojas, I. (2005).

Unsupervised learning of semantic relations between concepts of a molecular biology ontology.In Proceedings of the 19th international joint conference on Artificial intelligence, IJCAI’05 (pp. 659–664). San Francisco,CA, USA: Morgan Kaufmann Publishers Inc.

Eiter, T., Ianni, G., Krennwallner, T., & Polleres, A. (2008).

Reasoning web.(pp. 1–53).

Lavrac, N., Vavpetic, A., Soldatova, L., Trajkovski, I., & Novak, P. K. (2011).

Using ontologies in semantic data mining with segs and g-segs.In Proceedings of the 14th international conference on Discovery science, DS’11 (pp. 165–178). Berlin, Heidelberg:Springer-Verlag.

Manine, A.-P., Alphonse, ., & Bessires, P. (2009).

Learning ontological rules to extract multiple relations of genic interactions from text.I. J. Medical Informatics, 78(12), 31–38.

Siebers, Q. H. & Kruijsen, P.-P. (2006).

Implementing inference rules in the topic maps model.In Proceedings of the Topic Maps Research and Applications Conference (TMRA).

Teflioudi, C. (2011).

Learning soft inference rules in large and uncertain knowledge bases.Master’s thesis, Universitat des Saarlandes, Saarbrucken.

Tegos, A., Karkaletsis, V., & Potamianos, A. (2008).

Learning of semantic relations between ontology concepts using statistical techniques.

16 / 16

In Proceedings of the Workshop on High-Level Information Extraction at ECML-PKDD 2008, HLIE 2008 Antwerp, Belgium.

Valiente-Rocha, P. A. & Tello, A. L. (2010).

Ontology and swrl-based learning model for home automation controlling.In ISAmI (pp. 79–86).

Vavpeti, A. & Lavrac, N. (2011).

Using ontologies in semantic data mining with g-segs and aleph.In The 21st International Conference on Inductive Logic Programming (ILP 2011) Cumberland Lodge, United Kingdom.

16 / 16

aprendizaje de reglas para sistemas de recomendación

Education