nancieras en m xico - itam

Descomposición de las crisis financieras en México

Sergio Villalvazo Martín

Binary decision making with imperfect information

Diego J. Jiménez Hernández

A survey on the weak evidence of non-fundamentalness in economicapplications

Eduardo Laguna-Müggenburg e Ignacio Lobato

The effects of the access to transportation networks on economicperformance. Evidence from Mexican municipalities

José María Rodríguez Valadez

Análisis del comportamiento de pignorantes en contratos prendarios

Pedro Felipe Magaña Sáenz

El impacto de la discapacidad en el hogar sobre la asistencia escolar:evidencia de México

Zeky Ahmed Murra Antón

2015

3421

Mesa Directiva

Daniel Ramos Menchelli Diana Sverdlin Lisker Gabriel Levin Konigsberg

Consejo Editorial

Alejandro Coronado Narváez, Aldo Juárez Acevedo, Ana María Gutiérrez Jiménez, Andrés Nigenda Zárate, Ari Bronsoler Nurko, Arístides Adán Martínez Arriaga, Bernardo

Chabert Bravo, Diego Cid Ortiz, Eduardo Hidalgo García, Federico Kochen Gómez, Héctor Huitzil Granados, Ilan Morgenstern Kaplan, José Alejandro García Cueva, José

Luis Cruz Álvarez, José Luis Navarrete Martínez, José Ramón Morán van Gelderen, Kin Salvador Gutiérrez Olivares, Mario Nava Casiano, Mario Vázquez Corte, Rafael Josymar

Jiménez Durán, Stefano Lord Medrano, Victor Leyva Vega.

Alejandro Hernández, Arturo Aguilar, Arturo Fernández, Diego Domínguez, Enrique Seira, Felipe Meza, Germán Rojas, Ignacio Lobato, Ignacio Trigueros, Jorge Alonso, Joyce

Sadka, Martina Copelman, Nicolas Melissas, Valeria Moy.

Fundada en MCMXCV

Responsable: Jorge Nuño, Número de Certificado de Licitud de Título 9691 Número de Certificado de Licitud de Contenido 6762 Número de Reserva de Derechos de Autor 002330/96

ISSN: 1405-7085 © Derechos Reservados ITAM, México D.F, 2015

Publicación de los estudiantes de Economía del ITAM

Número 34

Precio general: $120.00 Suscripciones anuales

La Gaceta de Economía del ITAM aparece en primavera y otoño. Los artículos y ensayos

publicados son responsabilidad del autor. Correspondencia: Edificio de Investigadores. Camino a Santa Teresa # 930, Héroes de Padierna, 10700

México, D.F. Tel. +52 (55) 56 28 4000 ext. 2905 E-mail: [email protected]

Página Web: http:// www.gacetadeeconomia.itam.mx

EDITORIAL

Siguiendo con el proceso institucional de la Gaceta de Economía, a princi-pios del semestre Primavera 2015 una nueva Mesa Directiva fue elegida. Losnuevos miembros estamos comprometidos en mantener el nivel y los estánda-res de excelencia que han caracterizado a la Gaceta de Economía durante sus20 años de existencia, así como seguir fomentando la investigación de calidadtanto de alumnos como profesores.

Para este número, el nuevo equipo se complace en presentarles seis ar-tículos compuestos por investigaciones originales realizadas por alumnos, exalumnos y profesores del Instituto Tecnológico Autónomo de México, cuyostemas y metodologías varían, generando un número rico e interesante en con-tenido.

En el primer artículo de esta edición, Sergio Villalvazo Martín, alumno dela maestría en teoría económica del ITAM, hace un estudio comparativo entrela reciente crisis financiera en México y la que tuvo lugar en el año de 1994.Para ello utiliza un modelo de equilibrio general para descomponer los efectosde ambas crisis.

El artículo de Diego Javier Jiménez Hernández, alumno de la maestría enteoría económica del ITAM, que en esta ocasión publica la Gaceta de Economía,encuentra condiciones para que un resultado binario, dada cierta evidencia,sea racionalizable. No sólo esto, el autor va más allá y propone medidas quedeterminan qué tan lejos de la racionalidad puede estar un cierto grupo deestas decisiones y propone algoritmos para implementar sus resultados.

Por otro lado Eduardo Laguna-Müggenburg e Ignacio Lobato, alumno yprofesor e investigador del ITAM respectivamente, realizan una valiosa re-visión de la literatura en su artículo A survey on the weak evidence of non-fundamentalness in economic applications. En él argumentan que los ejemplos en-contrados en la literatura sobre modelos no-fundamentales no son suficientespara argumentar con certezas que estos son de relevancia empírica.

En Análisis del Comportamiento de Pignorantes en Contratos Prendarios,Pedro Magaña, ex alumno del ITAM, presenta un experimento aleatorio quetiene como fin la comprensión del actuar de los pignorantes y su respuestaante diversos contratos prendarios.

José María Rodríguez, en What is the Effect of Access to Transportation Net-works on Economic Performance?, busca explicar la relación entre el acceso a lared de transporte y el desempeño económico usando como variable instrumen-tal la red histórica de transporte.

En el último artículo, Zeky Ahmed Murra Antón explora el impacto que tie-ne la presencia de un individuo con discapacidad sobre la asistencia escolar delos individuos dentro del hogar, explorando, primeramente, las implicacionesteóricas y después contrastando con los datos.

El nuevo número está compuesto de artículos que tratan acerca de temasvariados e interesantes, conformando una mezcla rica en contenidos y meto-dologías. Esperamos que este nuevo número de la Gaceta de Economía sea desu agrado.

Índice

1 Descomposición de las crisis financieras en MéxicoSergio Villalvazo Martín .................................................................................... 1

2 Binary decision making with imperfect informationDiego J. Jiménez Hernández ............................................................................... 19

3 A survey on the weak evidence of non-fundamentalness in econo-mic applicationsEduardo Laguna-Müggenburg e Ignacio Lobato................................................ 41

4 The effects of the access to transportation networks on economicperformance. Evidenci from Mexican muncipalititesJosé María Rodríguez Valadez ............................................................................ 55

5 Análisis del comportamiento de pignorantes en contratos prenda-riosPedro Felipe Magaña Sáenz................................................................................ 96

6 El impacto de la discapacidad en el hogar sobre la asistencia escolar:evidencia de MéxicoZeky Ahmed Murra Antón................................................................................. 127

DESCOMPOSICIÓN DE LAS CRISIS FINANCIERAS EN MÉXICO*

SERGIO VILLALVAZO MARTÍN**

Durante las crisis de 1994-95 y 2008-09 la economía mexicana registró fuertes caí-das en la producción agregada que parecerían similares. Sin embargo, al desagregarla producción entre bienes comerciables y no comerciables, las producciones tuvie-ron variaciones distintas en cada crisis. En este trabajo se utiliza un modelo dinámicode equilibrio general de economía pequeña y abierta para analizar las variaciones dela producción sectorial, el consumo, la inversión y las cuentas externas durante unacrisis y para poder explicar las diferencias entre la crisis de 1994-95 y de 2008-09.Se encuentra que durante las dos crisis, la caída en el PIB agregado se explica en sumayoría por una caída en las productividades sectoriales. Sin embargo, cuando seanalizan los efectos de cada choque por separado, la diferencia en las magnitudesde las variaciones sectoriales puede ser explicada por choques en la tasa de interésinternacional y en la demanda externa.

CLASIFICACIÓN JEL: F41, O40PALABRAS CLAVE: Crecimiento, crisis financieras, tipo de cambio real

1. INTRODUCCIÓN

En las últimas décadas la economía mexicana atravesó dos grandes crisis.La primera, inició en diciembre de 1994 y provocó una caída de 8.6 % en el pro-ducto interno bruto por trabajador de 1995. Dicha crisis es conocida como lacrisis del tequila y se extendió a varios países latinoamericanos en desarrollo.En particular, la economía mexicana experimentó un aumento en la tasa de in-terés y como se encontraba con altos niveles de deuda externa, los efectos de lacrisis se amplificaron. Por otro lado, la segunda crisis inició en 2008 y ocasionóuna caída de 6.8 % en el producto por trabajador de 2009. Esta segunda crisis esconocida como la gran recesión la cual inició en Estados Unidos y se extendióa varios países hasta convertirse en una crisis global. Ambos periodos de crisisestuvieron caracterizados por una amplia caída en la producción acompañadapor una fuerte recuperación (Gráfica 1). Asimismo, durante las dos crisis el ac-ceso a los mercados financieros se restringió debido a un aumento en la tasa deinterés internacional.

Si bien ambos episodios presentan un comportamiento agregado muy simi-lar, al desagregar los datos se encuentran diferencias importantes entre ambascrisis. En particular, separando la producción de bienes en dos sectores: comer-ciables y no comerciables,1 se puede apreciar dichas diferencias (Gráfica 2).

*Las opiniones expresadas en este artículo son responsabilidad del autor y no necesariamentereflejan la postura del Banco de México.

**Economista de la Dirección General de Investigación Económica del Banco de México y estu-diante de la Maestría en Teoría Económica del ITAM. Agradezco a Carlos Urrutia y Felipe Mezapor sus comentarios que mejoraron y enriquecieron el presente trabajo. Cualquier error u omisiónes responsabilidad del autor. Email: [email protected].

1Para desagregar la producción entre sectores se siguió la convención usada por Kehoe y Ruhl(2009). Los bienes comerciables corresponden al sector agropecuario, minero y manufacturero.Mientras que los bienes no comerciables corresponden al sector de la construcción, electricidady servicios.

2 DESCOMPOSICIÓN DE LAS CRISIS FINANCIERAS EN MÉXICO

FIGURA 1: PIB POR PERSONA EN EDAD DE TRABAJAR

(A) T = 1995. ÍNDICE 1994 = 100 (B) T = 2009. ÍNDICE 2008 = 100

FIGURA 2: PIB POR PERSONA EN EDAD DE TRABAJAR

En 1994-95, la producción de bienes no comerciables experimentó una ma-yor caída en relación a la producción de bienes comerciables, mientras quedurante 2008-09 se observa lo contrario. Esta diferencia en magnitudes es laprincipal motivación de estudiar ambas crisis y analizar cuáles son los diferen-tes choques que llevan a la economía a comportarse de manera diferenciada.

En este trabajo se estudian ambas crisis con un modelo de equilibrio ge-neral dinámico sin incertidumbre en el cual los agentes son sorprendidos pordistintos choques. En específico, se analizan tres choques: a las productivida-des factoriales totales de cada sector (PTF), a la demanda externa y a la tasade interés. Se calibra el modelo con datos de la economía mexicana y se buscadescomponer las variaciones de distintas variables de la economía para po-der identificar las contribuciones a dichas variaciones de los distintos choques.Aunque parecería que en ambas ocasiones la producción de bienes comercia-bles viene disminuyendo desde años anteriores a la crisis, en este trabajo se

SERGIO VILLALVAZO MARTÍN 3

estudia únicamente la dinámica de la economía durante los años en que seobserva la mayor caída en la producción y que vienen seguidos por recupera-ciones importantes. Es decir, los episodios de 1994-95 y 2008-09.

Después de resolver el modelo y analizar los periodos por separado, se en-cuentra que durante ambas crisis la caída en el PIB agregado se explica en sumayoría por la caída en las productividades sectoriales. Sin embargo, cuandose analizan los efectos por separado, en la crisis de 2008-09, la mayor caída enla producción de los bienes comerciables relativa a la de los bienes no comer-ciables es explicada por los choques a la tasa de interés y a la demanda externa.

El trabajo está estructurado de la siguiente manera. La Sección 2 da unabreve descripción de la literatura relacionada. En particular, se describe la lite-ratura de crisis de balanza de pagos (conocidas como crisis de “Sudden Stop”),de la sincronización de ciclos entre economías y del comportamiento de la eco-nomía mexicana durante las crisis. En la Sección 3 se describe el modelo utili-zado y se presenta el sistema de ecuaciones que caracteriza su solución. En laSección 4 se analizan las crisis con el modelo y se realiza una descomposiciónde los distintos choques y sus efectos sobre las variables de interés. La Sección5 concluye.

2. LITERATURA RELACIONADA

En la literatura existen varios documentos que estudian las crisis de tipo“Sudden Stop”. Por ejemplo, Kehoe y Ruhl (2009) analizan con un modelo dedos sectores la apertura a los mercados financieros internacionales de Méxicoen 1990. Encuentran que México, al ser un país en desarrollo y con bajos ni-veles de capital, pasa de ser un país acreedor a ser un país deudor en pocosaños después de la apertura. Posteriormente, estudian la crisis de 1994 intro-duciendo choques inesperados para los agentes y analizan los cambios en lasvariables de interés. En particular, van agregando choques con la intención deestudiar los movimientos de la balanza comercial y el PIB – los mismos que seestudiaron con la apertura financiera. Encuentran que la gran magnitud en lacaída en el PIB sólo se logra explicar cuando se introduce una caída exógenaen la PTF.

Otro ejemplo de estudios de crisis es el presentado por Conesa, Kehoe yRuhl (2007). En este artículo se analiza la depresión de los años noventa queexperimentó Finlandia bajo la óptica del modelo de crecimiento neoclásico. Enparticular, se inicia con el modelo básico de un bien de consumo y se van agre-gando elementos hasta llegar a un modelo de dos sectores con bienes distin-tos de inversión. Los autores encuentran que la fuerte caída en la producciónfue causada principalmente por una caída en la PTF y por un aumento en losimpuestos, tanto al trabajo como al consumo, lo que ocasionó que las horastrabajadas también disminuyeran.

Mendoza (2010) estudia las crisis de “Sudden Stop” y financieras con unenfoque de ciclos económicos reales. De esta forma, incorpora en el modelotres procesos estocásticos: en la PTF, en la tasa de interés, y en el precio del


bien intermedio importado. En su modelo, los agentes tienen restricciones decrédito endógenas que dependen del nivel de capital. Por esta razón, cuando laeconomía está muy endeudada, un choque de magnitud normal en un modelode ciclos económicos provoca una fuerte caída en el capital lo que a su vezprovoca que la restricción de crédito se endurezca, ocasionando una caída enel consumo. En este sentido, la restricción de crédito aumenta la magnitud eintroduce asimetría en los efectos de los choques.

Otra literatura relacionada es la que estudia la sincronización de los ciclosentre dos economías. En particular, Chiquiar y Ramos-Francia (2005) exploranla relación que existe entre el comercio y los ciclos económicos de México yEstados Unidos. Los autores encuentran que después de la entrada en vigor delTratado de Libre Comercio de América del Norte en 1995, los ciclos económicosentre los dos países se sincronizan. Además, también encuentran evidencia deque existe una relación de largo plazo entre las dos economías.

En cuanto al análisis de la economía mexicana con un modelo neoclásicode dos sectores, Meza y Urrutia (2011) estudian el periodo que abarca de 1988a 2002. En dicho periodo, México experimentó una apreciación importante ensu tipo de cambio real. En específico, los autores introducen a su modelo fric-ciones financieras, costos de ajuste en el capital y en la asignación de trabajoentre sectores. Encuentran que principalmente la apreciación del tipo de cam-bio real se puede explicar por una caída en los costos de financiamiento (caídaen la tasa de interés internacional) y por un crecimiento diferenciado entre lasproductividades de cada sector. Asimismo, usando un modelo similar, Pratapy Urrutia (2012) analizan el papel de las fricciones financieras, en particular derestricciones al crédito en el mercado de los bienes intermedios. Los autoresintroducen un choque inesperado en la tasa de interés y encuentran que lasfricciones financieras provocan una caída endógena en la PTF que logra expli-car la mitad de la caída observada en la productividad y el 74 % de la caída enel PIB.

El presente trabajo trata de contribuir a la literatura existente de crisis es-tudiando las consecuencias de los distintos choques que una economía puedeenfrentar. En este sentido, se estudian los choques a las productividades, al cos-to de financiamiento internacional y a la demanda externa. En la literatura seencuentra ya muy documentado como las variaciones en la productividad sonlas principales fuentes que contribuyen en la variación de la producción. Sinembargo, este trabajo aporta un marco en el cual, las diferencias entre las varia-ciones sectoriales provienen de distintas fuentes. Específicamente, los choquesa la tasa de interés y a la demanda externa provocan variaciones distintas quelas producidas por el choque a la productividad, y son capaces de explicar lasdiferencias sectoriales.

3. MODELO

En esta sección se describe el modelo utilizado para analizar a la economíamexicana durante los episodios de crisis. Ya que se busca estudiar el compor-


tamiento diferenciado de las crisis sobre los bienes comerciables y los no co-merciables, el marco natural de dicho estudio es un modelo de dos sectores deuna economía pequeña y abierta. A este se incorpora además un sector externoque consume bienes nacionales para poder analizar los choques en la demandaexterna. En particular, se utiliza un modelo de equilibrio general dinámico concertidumbre de dos sectores con costos de ajuste en el capital y en la asignaciónde trabajo entre sectores.

La economía consta de un agente representativo el cual maximiza su utili-dad sujeto a una restricción presupuestal. El agente tiene una unidad de trabajoinelástica la cual ofrece entre los dos sectores y renta su acervo de capital a lasempresas. El problema del agente es el siguiente:

maxcTt ,cNt ,kt+1,lDt ,lNt ,bt+1

∞∑i=0

βt[

1

φ

(εCρTt + (1 − ε)CρNt

)φρ

]s.a.

PTtCTt + PNtCNt + PTt(Kt+1 − (1 − δ)Kt) + bt+1 = wDt lDt + wNt lNt +

rktKt + (1 + rt)bt − PTtΩK2

(Kt+1−Kt

Kt

)2

− PTtΩL2

(lDt − lDt−1

)2+ Tt

Donde CTt es el consumo de bienes comerciables, CNt es el consumo debienes no comerciables, lDt es la cantidad de trabajo ofrecida en el sector inter-medio local (insumo en la producción del bien comerciable), lNt es la cantidadde trabajo ofrecida en el sector no comerciable, PTt es el precio de los bienescomerciables, PNt es el precio de los bienes no comerciables, Kt es el acervo decapital en términos del bien comerciable, δ es la tasa de depreciación del capi-tal, bt es la cantidad de bonos internacionales que tiene el agente en términosdel bien internacional, Tt son transferencias del gobierno de suma fija expresa-das en términos del bien internacional, Ωk se refiere a la magnitud de los costosde ajuste en el capital y Ωl se refiere a la magnitud de los costos de ajuste delempleo entre sectores.2

La existencia de fricciones en la asignación de trabajo hace posible que lossalarios en equilibrio sean distintos entre sectores, por esta razón wd es el sala-rio en el sector intermedio local y wNt es el salario en el sector no comerciable.En cuanto al precio de renta del capital, rKt sí será el mismo entre sectores y rtcorresponde a la tasa de interés internacional determinada exógenamente. Enla restricción presupuestal, todos los precios son relativos al bien internacionalel cual está normalizado a 1; por esta razón, la tenencia de bonos internacionalno muestra ningún precio en la restricción. Adicionalmente, tanto el capital co-mo los costos de ajuste de ambos insumos están expresados en términos delbien comercial mientras que los precios de los insumos están en términos delbien internacional.

2En Meza y Urrutia (2011), se describe la posible intuición detrás del costo de ajuste en la asigna-ción laboral. Principalmente, se refiere a las distintas habilidades necesarias en el trabajo específicoa cada sector.


El agente recibe utilidad de consumir bienes comerciables y no comercia-bles. Por ello su ingreso, que proviene del salario y la cantidad ofrecida detrabajo en cada sector y del ingreso de rentar el capital, se utiliza para consu-mir bienes, para pagar los costos de ajuste de los insumos y para pagar el costo(beneficio) neto de la tenencia de bonos.

En cuanto a la producción, la economía consta de cuatro bienes: un bien nocomerciable, un bien intermedio local, un bien internacional y un bien de con-sumo comerciable. El bien no comerciable se produce utilizando una funcióntipo Cobb-Douglas y utiliza como insumos capital y trabajo. El problema de laempresa representativa del bien no comerciable en cada periodo es el siguiente:

maxkNt ,lNt

PNtKNt(ANt lNt)(1−αN ) − rKtKNt − wNt lNt

Donde ANtes la productividad del sector no comerciable.El problema de la empresa del bien intermedio local es similar al del bien

no comerciable:

maxkDt ,lDt

PDtKDt(ADt lDt)(1−αD) − rDtKDt − wDt lDt

DondeADt es la productividad del sector intermedio local y PDt es el preciorelativo de los bienes intermedios locales.

La empresa que produce el bien comerciable agrega bienes intermedios lo-cales y bienes internacionales (importados):

maxxDt ,mt

PTtM(µxzDt + (1 − µ)mzt )

1z − PDtxDt − (1 + τt)mt

Donde M es la productividad del sector comerciable final, xDt es la canti-dad de bienes intermedios locales usados, mt es la cantidad de bienes interna-cionales o importaciones usadas y τt son impuestos a la importación de bienes.

Asimismo, existe una demanda externa por bienes comerciables:

xFt = Dt[(1 + τFt)PTt ]−11−z

Donde xFt es la cantidad de bienes comerciables exportados, Dt es el nivelde la demanda externa y τFt son impuestos extranjeros a la importación debienes.

Finalmente, la balanza de pagos se expresa de la siguiente manera:

mt + bt+1 = PTtxFt + (1 + rt)bt

Para resolver el modelo se calculan las condiciones de primer orden delproblema del agente y de las empresas. La definición del equilibrio competitivoy las ecuaciones que lo describen se presentan en el Anexo.


4. ANÁLISIS DE LAS CRISIS

En esta sección se calibran los parámetros y se resuelve el modelo para ob-tener resultados sobre el comportamiento de las variables durante los distintoschoques de cada crisis; se analiza también la dinámica de las secuencias queresuelven el modelo al introducir distintos choques. En particular, un episo-dio de crisis completo consistirá en choques a las productividades, a la tasa deinterés y a la demanda externa. De esta forma, se parte de un estado estacio-nario que es sorprendido por los choques y se simulan secuencias tales que sealcanza nuevamente un estado estacionario en 100 periodos.

4.1 EPISODIO 1994-95

Para analizar la primera crisis, se calibró el modelo suponiendo que en 1994la economía se encontraba en un estado estacionario. La Tabla 1 describe losparámetros utilizados.

TABLA 1: PARÁMETROS DE CALIBRACIÓN PARA CRISIS 1994-95

r β ε δ αD αN z ΩL ΩK0.10 0.909 0.23 0.04 0.45 0.30 0.5 140 40

M AD AN D µ φ ρ τ τF2 3 2 20 0.75 0.5 -1 0.1 0.1

Para la calibración de los parámetros, se siguió principalmente la metodo-logía empleada en Kehoe y Ruhl (2009) y de Meza y Urrutia (2011). Se utilizóuna tasa de interés internacional de 10 %, similar a lo que se obtendría prome-diando la tasa de notas del tesoro de Estados Unidos a tres meses durante ladécada de 1980 y los primeros años de la década de 1990. Se calibró la tasa dedescuento de tal forma que la economía se encuentre en un estado estaciona-rio. En cuanto al parámetro de consumo ε, se utilizó el estimado en Kehoe yRuhl (2009) utilizando la proporción de consumo comerciable respecto al totalde consumo. La tasa de depreciación se calibró utilizando el método de inven-tarios perpetuos con series históricas de la formación fija bruta de capital y delPIB de cuentas nacionales de México. Los coeficientes αD y αN son comunesen la literatura (Meza y Urrutia, 2011); en particular se utiliza un coeficientemayor para los bienes intermedios lo que implica que dichos bienes – que sonutilizados como insumos para la producción del bien comerciable – son másintensivos en el uso de capital.

En cuanto a la elasticidad de sustitución entre bienes importados y el bienintermedio local existe un gran rango de valores usados en la literatura; en esteestudio se fijó en 2 siguiendo a Kehoe y Ruhl (2009). Ruhl (2008) y Yiu (2003)analizan los efectos de los distintos valores de la elasticidad y encuentran quemodelos de ciclos económicos reales requieren una elasticidad pequeña (alre-dedor de 2) para generar la volatilidad en los términos de intercambio y lacorrelación negativa esperada entre los términos de intercambio y la balanza


comercial observada en los datos. Sin embargo, también encuentran que mode-los de crecimiento necesitan grandes elasticidades (alrededor de 10) para poderexplicar los movimientos en las cuentas de comercio observados por cambiosestructurales en las economías (como podría ser la apertura comercial de Mé-xico en 1995). Por ello se estima adecuado emplear un valor consistente conciclos económicos reales.

Las magnitudes de los costos de ajuste en los insumos, ΩK y ΩL se fijaronen niveles similares a los utilizados por Meza y Urrutia (2011). Los parámetrosde productividad sectorial, demanda externa, y tasas de impuestos se fijaron enlos niveles calibrados por Kehoe y Ruhl (2009). Se fijó la elasticidad de sustitu-ción entre bienes comerciables y no comerciables en el consumo en 0.5 (comúnen la literatura) lo que implica una ρ de -1. La elasticidad de sustitución inter-temporal se fijó en 0.5 por lo que el parámetro φ toma el valor de -1. En cuantoal parámetro de comercio µ, se calibró a partir de la matriz de insumo-productopublicada por el INEGI para 1980. Específicamente, se estimó como

µ

1 − µ=

PD1980

1 + τ1980

(m1980

xD1980

)z−1

y se normalizaron los precios a 1. Por último, para el nivel de la deuda deestado estacionario, bss, se utilizó -9 siguiendo a Kehoe y Ruhl (2009).

Una vez calibrados los parámetros, se realizaron simulaciones de la econo-mía. Como ya se mencionó, en el modelo se partió de un estado estacionariopor lo cual las condiciones iniciales se fijaron en los valores del estado estacio-nario. Posteriormente, se introdujeron choques y se consideró una trayectoriaen la que en 100 periodos la economía alcanza nuevamente un estado estacio-nario (siguiendo el método de solución descrito en Conesa et al., 2007).

En el episodio 1994-95, la economía mexicana experimentó un choque per-manente en los aranceles y tres choques temporales en las productividades, enla tasa de interés y en la demanda externa. En los choques temporales la caídaen la productividad factorial total de cada sector se calibró de tal forma queambas productividades cayeran en la misma magnitud y la caída en la produc-ción total igualara la caída en la producción por trabajador observada en losdatos. Para la tasa de interés se utilizó la tasa de las notas del tesoro de EstadosUnidos para tres meses más el cambio en una prima de riesgo para la economíamexicana estimada con el índice EMBI para México. La variación en la deman-da externa se calibró de forma tal que se igualara la variación simulada en lasexportaciones con la variación observada en los datos. Por último, el choquepermanente, consecuencia de la entrada en vigor del Tratado de Libre Comer-cio de América del Norte, se simuló como una eliminación en los aranceles alas importaciones mexicanas. La Tabla 2 resume las magnitudes de los choquessimulados.

Las siguientes gráficas muestran la evolución de las trayectorias simuladaspor el modelo al introducir todos los choques simultáneamente y la trayectoriaobservada en los datos.


Año PTFsectorial

Tasa deinterés

Demandaexterna Aranceles

1994 100 10 % 100 10 %1995 87.6 15.9 % 91.0 0 %1996 100 11.6 % 100 0 %

1997 en adelante 100 10 % 100 0 %TABLA 2: MAGNITUD DE LOS CHOQUES 1994-95

(A) PIB (B) PRODUCCIÓN SECTORIAL

(C) COMERCIO EXTERIOR (D) INVERSIÓN

(E) TIPO DE CAMBIO REAL

FIGURA 3: COMPARACIÓN DATOS Y MODELO. T=1995. ÍNDICE 1994=100.

Por la forma en que se calibró el choque a la productividad sectorial, la tra-yectoria del PIB generado por el modelo muestra la misma caída que el PIBobservado cómo se puede ver en la gráfica 3a. En cuanto a la producción sec-


torial, la gráfica 3b muestra como el modelo es capaz de replicar la diferenciaobservada en los datos de la magnitud de la variación sectorial. En la gráfica3c, las cuentas de comercio exterior que resultan del modelo logran replicarbien los datos observados. El movimiento de las exportaciones está determi-nado por la forma de calibrar el choque de la demanda externa, mientras queel movimiento de las importaciones se determinó endógenamente. Por último,la gráfica 3d y 3e muestran cómo la inversión obtenida del modelo tiene unacaída mayor a la observada en los datos y el tipo de cambio real muestra unaapreciación de menor magnitud que los datos observados.

Los efectos de los choques introducidos simultáneamente logran explicarla diferencia entre las magnitudes de la variación de la producción sectorial.Sin embargo, para analizar las contribuciones y los efectos de cada choque, seobtuvieron las trayectorias que resultan del modelo al introducir un choque ala vez. La Tabla 3a y 3b muestran las variaciones en las variables de interésocasionadas por cada choque.

(A) VARIACIONES PORCENTUALES ANUALES EN VARIABLES DE INTERÉS,1994-95

(B) CONTRIBUCIÓN A LA VARIACIÓN PORCENTUAL ANUAL DE VARIABLES DEINTERÉS, 1994-95

TABLA 3: RESULTADOS PERÍODO 1994-95


En las Tablas 3a y 3b se puede ver que el único choque que no contribuye aque la producción de no comerciables disminuya más que la de comerciableses el choque a la tasa de interés. Sin embargo, el efecto del aumento en la tasade interés es tan grande que es necesario introducir el cambio – estructural ypermanente – que resultó de la entrada en vigor del TLCAN en 1995. El mo-delo hace un buen trabajo en replicar la producción sectorial, el consumo, lainversión y las cuentas externas; sin embargo, en cuanto a la composición delempleo, todos los choques generan una recomposición contraria a lo que seobserva en los datos. Es decir, en el modelo, el empleo en el sector comerciableaumenta relativo al empleo no comercial debido a los choques mientras que enlos datos éste disminuye.

4.2 EPISODIO 2008-09

Como se analizó en la subsección anterior, el modelo realiza un buen tra-bajo en explicar la diferencia en las variaciones durante la crisis de 1994-95. Enesta subsección se utiliza el mismo modelo para analizar el episodio 2008-09.La calibración es similar a la utilizada en el primer episodio excepto que en estecaso se actualizaron algunos datos con la intención de modelar a la economíaen 2008 como si estuviera en un estado estacionario. La tasa de interés inter-nacional se fijó en 6.3 % y se calibró el correspondiente factor de descuento β.Es importante notar que, entre episodios, el factor de descuento es distinto de-bido a que se analizan las crisis desde una perspectiva de estado estacionario.De esta manera, es necesaria la recalibración de dicho parámetro para que laeconomía converja al correspondiente estado estacionario en cada episodio decrisis. Los niveles de los impuestos se fijaron en 0 debido a la plena entrada envigor del TLCAN. Finalmente, utilizando la matriz insumo-producto de INEGImás reciente (base 2008) se recalibró el parámetro de comercio µ. Con ello seobtuvo una menor µ lo cual podría deberse a una mayor integración en lascadenas productivas entre Estados Unidos y México. En el modelo esto se tra-duce en la necesidad de utilizar una mayor proporción de importaciones en laproducción del bien comerciable. La Tabla 4 muestra los parámetros utilizadosen la calibración de la crisis 2008-09.

r β ε δ αD αN z ΩL ΩK0.063 0.941 0.23 0.04 0.45 0.30 0.5 140 40

M AD AN D µ φ ρ τ τF2 3 2 20 0.64 0.5 -1 0 0

TABLA 4: PARÁMETROS DE CALIBRACIÓN PARA CRISIS 2008-09

Nuevamente se realizaron las simulaciones de la economía cómo en el epi-sodio anterior. Durante 2008-09, la economía experimentó tres choques tempo-rales: en las productividades, en la tasa de interés y en la demanda externa. LaTabla 5 describe las magnitudes de dichos choques.


Año PTFsectorial

Tasa deinterés

Demandaexterna

2008 100 6.3 % 1002009 93.9 8.8 % 79.52010 100 7.2 % 100 %

2011 en adelante 100 6.3 % 100 %TABLA 5: MAGNITUD DE LOS CHOQUES 2008-09

De manera análoga a la subsección anterior, las siguientes gráficas mues-tran la evolución de las trayectorias simuladas por el modelo al introducir to-dos los choques simultáneamente y la trayectoria observada en los datos.

(A) PIB (B) PRODUCCIÓN SECTORIAL

(C) COMERCIO EXTERIOR (D) INVERSIÓN

(E) TIPO DE CAMBIO REAL

FIGURA 4: COMPARACIÓN DATOS Y MODELO. T=2009. ÍNDICE 2008=100.


Por la forma en que se calibró el choque a la productividad sectorial, la tra-yectoria del PIB generado por el modelo muestra la misma caída que el PIBobservado cómo se puede ver en la gráfica 4a. En cuanto a la producción sec-torial, la gráfica 4b muestra cómo el modelo es capaz de replicar la diferenciaobservada en los datos de la magnitud de la variación sectorial. En la gráfica 4clas cuentas de comercio exterior que resultan del modelo logran replicar bienlos datos observados. El movimiento de las exportaciones corresponde a la for-ma de calibrar el choque de la demanda externa, mientras que el movimientode las importaciones se determinó endógenamente. Por último, las gráficas 4cy 4d, muestran cómo la inversión obtenida del modelo tiene una caída menora la observada en los datos y el tipo de cambio real muestra una apreciaciónaunque de menor magnitud que los datos observados.

Al igual que en el episodio anterior, los efectos de los choques introduci-dos simultáneamente logran explicar la diferencia entre las magnitudes de lavariación de la producción sectorial. Asimismo, para analizar las contribucio-nes y los efectos de cada choque, se obtuvieron las trayectorias que resultandel modelo al introducir un choque a la vez. Las tablas 6a y 6b muestran lasvariaciones en las variables de interés ocasionadas por cada choque:


(A) VARIACIONES PORCENTUALES ANUALES EN VARIABLES DE INTERÉS,2008-09

(B) CONTRIBUCIÓN A LA VARIACIÓN PORCENTUAL ANUAL DE VARIABLES DEINTERÉS, 2008-09

TABLA 6: RESULTADOS PERÍODO 2008-09

En la tabla 6a se puede ver cómo el choque a las productividades en elmodelo provocaría que las magnitudes en las variaciones sectoriales fuerancontrarias a las observadas. Es decir, que la producción de bienes comercia-bles disminuyera menos que la de bienes no comerciables. Por esta razón, esnecesario introducir los choques a la demanda externa y a la tasa de interés in-ternacional. Al simular simultáneamente el choque en las PTF’s y en la deman-da externa, la solución del modelo implica una expansión de gran magnituden la inversión – un resultado contra intuitivo. La causa de dicho resultado esprecisamente el efecto que tiene la demanda externa sobre la producción local.


La economía destina recursos para satisfacer una demanda externa exógena ycuando ésta disminuye, parte de dichos recursos se liberan en cierta medida ypasan a ser destinados a la inversión. Por otro lado, al simular simultáneamen-te el choque en las PTF’s y en la tasa de interés, la inversión disminuye perolas exportaciones aumentan. La intuición detrás de este resultado es que la em-presa que produce el bien final de consumo agrega importaciones con bieneslocales, y al aumentar la tasa de interés se vuelve más costoso para el agenteendeudarse para pagar las importaciones por lo que decide exportar más bie-nes. De esta forma, los choques que contribuyen a que tanto la inversión y lasexportaciones varíen de la manera esperada y que las variaciones en la produc-ción sectorial tengan las magnitudes observadas son los choques a la demandaexterna y a la tasa de interés. Finalmente, en cuanto al empleo, el modelo no escapaz de replicar los cambios en la composición observada. El modelo prediceque el empleo en el sector comercial aumentará relativo al empleo no comer-cial mientras que en los datos se observa lo contrario. Cambios en la oferta detrabajo, la cual es completamente inelástica, podrían resultar en prediccionesmás cercanas a lo observado en el mercado laboral. Sin embargo, el presentetrabajo se concentra en dar evidencia de las diferencias entre las variaciones dela producción por sector.

5. CONCLUSIONES

En este trabajo se utilizó un modelo de equilibrio general dinámico sin in-certidumbre calibrado con datos de México para analizar el comportamientode ciertas variables durante un episodio de crisis. En particular, se estudiaronlas crisis de 1994-95 y de 2008-09. Los resultados que se obtienen después deresolver el modelo logran replicar las diferencias entre las variaciones de laproducción por sector.

En cuanto a la descomposición de las variaciones, se encontró que duranteambas crisis la caída en el PIB agregado se explica en su mayoría por la caídaen las productividades sectoriales. Sin embargo, cuando se analizan los efectospor separado, en la crisis de 2008-09, la mayor caída en la producción de losbienes comerciables relativa a la de los bienes no comerciables es explicada porlos choques en la tasa de interés internacional y en la demanda externa. Asi-mismo, en ambos episodios, los choques a la demanda externa y a la tasa deinterés tienen efectos considerables sobre la inversión, el tipo de cambio real ylas cuentas externas. En cuanto al empleo, el modelo no es capaz de replicar losmovimientos en la composición sectorial. Una posible extensión al trabajo po-dría incluir cierta movilidad del empleo para poder capturar los movimientosmigratorios entre México y Estados Unidos. Específicamente, durante 1994-95se registró un creciente flujo migratorio mientras que durante 2008-09 la mi-gración entre México y Estados Unidos se estancó.

Este trabajo contribuye al análisis de las crisis y a la descomposición delos distintos choques que han experimentado economías pequeñas y abiertas.En la literatura, todavía existe la incógnita de qué está detrás de los choques


a la productividad y de cómo se relacionan éstos con otros tipos de choques.Posibles futuras líneas de investigación podrían venir en el sentido de aclarardicha incógnita.

REFERENCIAS

Chiquiar, D., y Ramos-Francia, M. (2005). Trade and business-cycle synchro-nization: Evidence from mexican and us manufacturing industries. TheNorth American Journal of Economics and Finance, 16(2), 187–216.

Conesa, J. C., Kehoe, T. J., y Ruhl, K. J. (2007). Modeling great depressions: thedepression in finland in the 1990s.

Instituto Nacional de Estadística Geografía e Información. (1980). Sistema decuentas nacionales: Matriz insumo producto 1980.

Instituto Nacional de Estadística Geografía e Información. (2008). Sistema decuentas nacionales: Matriz insumo producto 2008.

Kehoe, T. J., y Ruhl, K. J. (2009). Sudden stops, sectoral reallocations, and thereal exchange rate. Journal of Development Economics, 89(2), 235–249.

Mendoza, E. G. (2010). Sudden stops, financial crises, and leverage. The Ame-rican Economic Review, 100(5), 1941–1966.

Meza, F., y Urrutia, C. (2011). Financial liberalization, structural change, andreal exchange rate appreciations. Journal of International Economics, 85(2),317–328.

Pratap, S., y Urrutia, C. (2012). Financial frictions and total factor producti-vity: Accounting for the real effects of financial crises. Review of EconomicDynamics, 15(3), 336–358.

Yi, K.-M. (2003). Can vertical specialization explain the growth of world trade?Journal of Political Economy, 111(1), 52–102.


ANEXO

Un equilibrio general competitivo para el modelo presentado en la sección3 consiste en un conjunto de secuencias para las cantidades

CTt , , CNt , lDt , lNt ,Kt+1,KDt+1,KNt+1

,mt, bt+1, xDt , xFt

y para los precios

PTt , PNt , PDt , rKt , wDt , wNt

tales que

I) Dados KD0 > 0,KN0 > 0, PTt , PNt , rKt , wDt y wNt , las secuencias deCTt , CNt , lDt , lNt ,Kt+1 y bt+1 resuelven el problema del agente represen-tativo.

II) En cada periodo t, dados PTt , PNt , PDt , rKt , wDt y wNt , los valores de elDt , lNt ,KDt ,KNt ,mt y xDt resuelven el problema de las tres empresasrepresentativas: productora de bienes no comerciables, bienes intermediosy de bienes comerciables. Adicionalmente, los beneficios de las empresasson iguales a cero.

III) En cada periodo t, el mercado de capital, de empleo, de bienes comercia-bles, de bienes no comerciables, de exportaciones y la balanza de pagos sevacían.

Las ecuaciones que caracterizan la solución del modelo son:Ecuación de Euler

β

(εCρTt+1

+ (1 − ε)CρNt+1

εCρTt + (1 − ε)CρNt

)φρ−1(

CTt+1

CTt

)ρ−1

=PTt+1

PTt(1 + rt+1)(5.1)

Sustitución intratemporal de bienes

ε

1 − ε

(CTtCNt

)ρ−1

=PTtPNt

(5.2)

Equilibrio precio de empleo entre sectores

(1 + rt+1)[wDt − wNt − PTtΩL(lDt − lDt−1)] = −PTt+1

ΩL(lDt+1− lDt) (5.3)

No arbitraje entre activo financiero y capital

rKt+1+PTt+1

[(1 − δ) +

ΩkKt+2

K2t+1

(Kt+2 −Kt+1

Kt+1

)]= PTt(1+rt+1)

[1 +

ΩkKt

(Kt+1

Kt

)](5.4)


Ecuaciones de equilibrio de las empresas representativas:

rKt = αDPDtADt1−αDKaD−1

Dtl1−αDDt

(5.5)

wDt = (1 − αD)PDtADt1−αDKaD

Dtl−αDDt

(5.6)

rKt = αNPNtANt1−αNKaN−1

Ntl1−αNNt

(5.7)

wNt = (1 − αN )PNtANt1−αNKaN

Ntl−αNNt

(5.8)

PDt1 + τt

(mt

xDt

)z−1

=µ

1 − µ(5.9)

PTtM(µxzDt + (1 − µ)mzt )

1z−1µxz−1

Dt= PDt (5.10)

Condiciones de vaciado de mercado

xFt = Dt[(1 + τFt)PTt ]−11−z (5.11)

xDt = KDt(ADt lDt)(1−αD) (5.12)

cNt = KNt(ANt lNt)(1−αN ) (5.13)

CTt + xFt +Kt+1 − (1 − δ)Kt +ΩK2

(Kt+1 −Kt

Kt)2 +

ΩL2

(lDt − lDt−1)

)2

= M(µxzDt + (1 − µ)mzt )

1z (5.14)

mt + bt+1 = PTtxFt + (1 + rt)bt (5.15)

Kt = KDt +KNt (5.16)

1 = LDt + LNt (5.17)

BINARY DECISION MAKING WITH

IMPERFECT INFORMATION∗

DIEGO JAVIER JIMÉNEZ HERNÁNDEZ†

This paper develops a framework where an individual has to make a binary de-cision, but finds it difficult to assess the true state of the world. In the model, statescan be described completely by finite sequences of zeros and ones. By observing par-tial information, the decision maker can reveal the true state of certain bits, whichhe uses to update his prior beliefs about the true state of the world. I describe thedata such a model would generate and show that which data can be characterizedby the model. In particular, the datasets need to abide by an axiom that eliminateschoice reversals. Later I introduce the idea of distance to rationalizability; this mea-sure describes how far a given dataset is from being rationalizable using our model.Finally, the computational process to generate such distance and to recover prefer-ences is shown using a dataset from Mexican labor courts.

JEL CODES: D83, D84, C11, C61.KEYWORDS: binary choice, choice under uncertainty.

Most of the standard economic models of decision making have, as an un-derlying assumption, that information will be captured and taken into con-sideration as it becomes available. These models discard cognition costs andreadily assume that any feasible information can be understood at a very low,negligible, cost. However, this is not always the case. Information can be diffi-cult to assess and therefore hard to understand. A decision maker may find itoptimal not to process every piece of information available if this is the case.1

Take for example an individual assessing whether it will rain or not. Amyriad of factors may alter the weather and considering all of them can be timeconsuming. Instead of observing the whole set of factors, this individual maydecide to observe a subset of those (s)he considers the most important onesand decide according to what this factors say about the state of the world. Forexample, this decision maker may decide to observe the sky to assess if thereare any clouds. If there are no clouds, the decision maker can conjecture thatthere will be no rain. In this situation, the decision maker leaves informationoutside of his/her consideration, and (s)he finds it optimal to do so. By lookingat a subset of information, the consumer may not end up observing the true

∗This article relies on the work produced by Caplin, Gomberg, and Sadka (2014). It is impor-tant to disclaim that the proof for Theorem 2.1 belongs to said authors and is reproduced here forcompleteness.†I am indebted to Fernando Payro and Kin Gutiérrez for their support and suggestions during

the project. I am also grateful to Andrei Gomberg, Joyce Sadka, Diego Domínguez and EnriqueSeira for their suggestions. Furthermore, I am grateful to Brianey Ramirez for her invaluable helpduring the project.

1We will abstract from the fact that if cognition is costly, then optimizing cognition should alsobe costly. This is called an infinite regress problem. For more information on this approach, seeLipman (1991).

20 BINARY DECISION MAKING WITH IMPERFECT INFORMATION

state; nonetheless, this state can be approximated properly by the informationdecided to be observed. Moreover, some information can be more costly thanother. In this case, the individual may decide to look out of the window as thecost of doing that is almost negligible, but not to use an electric heated wireto measure the liquid water content in the air, since the latter measure may becostly, both in terms of cognition and in terms of acquiring the equipment todo so.2

Information may also be costly because there can be a shrinking benefit at-tainable. Take for example a situation where an investor has to decide whetherto place money in a particular company or not. In this situation deciding beforeothers can lead to greater returns, yet it also poses a threat of greater loses. Anaverse decision maker could prefer to sacrifice part of the return to make it cer-tain. Think about a situation where the true state of the return can become lessdifficult to predict as time passes and that the prior beliefs of the best and worstcase scenarios about the return bound its future possible realizations.3 If this isthe case, the market will adjust the return. In the aforementioned situation, thedecision maker may find it optimal to choose with the information available ata particular moment of time. There is a clear trade off between waiting to getinformed and choosing on the go. By waiting, an uninformed individual maybe able to gather more information and take a reasoned decision. However, astime passes, waiting leads to less benefits from the decision taken since, bothby shrinking benefits and because people tend to discount the future.

In this paper, I introduce a bayesian model where a decision maker hasto make a binary decision, but cannot properly identify the true state of theworld. I characterize the dataset that would be generated by such a modeland show how to, when possible, recover the parameters needed to generatethe dataset using the model. After the characterization, I look at a real datasetand measure how far the dataset is from one that can be rationalized usingthis model. This method is applied to a problem that uses data coming formMexican labor courts. However, these procedure can be applied to any binarydecision where we observe the decision and the information available. FinallyI propose an extension to the model that includes a specific cost function and Iderive how would a dataset generated by this model look like.

This paper relates to different literature aspects. The first one studies (ratio-nal) inattention. Usually, the models in this literature incorporate a rationaleto argue why it is optimal not to select some alternatives even when they arefeasible. Simon (1955) was one of the first to introduce rational inattention

2Costly information acquisition may happen because information itself can be difficult to un-derstand. For example, Seira and Elizondo (2014) did a credit card experiment where they testedhow well did seven different personalized messages -which we can call free information- workedfor inducing a change in the cosumer’s credit card debt on already indebted consumers. Easiermessages, such as a peer comparisons, were more effective in reducing debt than other advisingmessages which, even though had more information, where more difficult to understand.

3If this is not the case, then the situation described does not need to hold. On the contrary,there are situations were more information can lead to an increase in uncertainty. For example, thediscovery of a new oil drilling technique can represent an opportunity for greater, riskier returnsinstead of a certain loss of the investment.

DIEGO JIMÉNEZ HERNÁNDEZ 21

to the story. Consumers have in mind an aspirational threshold, which theyuse to decide on sequential alternatives. If this is the case, a dominating al-ternative may come unseen if it is considered after one of the alternatives thatfulfill the aspirational value. Furthermore, Masatlioglu et al. (2012) establishesa decision-making process that enables the authors to study situations whereconsumers may not consider all alternatives. They introduce attention filters,subsets of all feasible alternatives, to the standard choice model. Another ex-ample is Lester et al. (2012), who rationalizes the exclusion of evidence in trials.The authors argue that if jurors face a cognitive cost of processing information,they may end up choosing alternatives that do not maximize common welfare,but that minimize the cost of processing information instead. If this is the case,the judge may find it optimal to exclude some evidence, so that jurors focusonly on the most informative elements.

A paper that closely relates to this one is Caplin and Dean (2014). In this pa-per, the authors develop two axioms that need to hold over a stochastic choicedata set in order to be able to recover the parameters that generated this typeof behavior. Given a cost function, their model yields a set of probabilities overchoices under each state of the world. This paper differs from Caplin and Dean(2014) in the fact that we assume, for the model that incorporates informationacquisition, the cost function varies while they assume a fixed cost functionthroughout their paper.

There is also a strand of literature that has studied information acquisitionempirically. First, Caplin et al. (2011) ran an experiment with undergraduatestudents where they find evidence that consumers behave in à-la-Simon (1955)way. By not looking at all alternatives, the subjects observed forwent importantprofitable opportunities that could have increased their payoffs had they takenthe time to observe them. Second, Gabaix et al. (2006) tested the predictioncapability of a direct cognition model. This model takes every set of searchoperations as if it were the last opportunity to look at alternatives. The authorsargued that the model is capable of predicting simple decisions better than thefully rational model. Moreover, when complex problems are analyzed, so thatthere is no direct solution to a rational model, the direct cognition model isable to predict the pattern of information acquisition, both within each gameand across games. Third, Kling et al. (2012) empirically reject the idea thatcomparison frictions are inconsequential to the decision making by running anexperiment on Medicare prescription drug plans.

This paper provides two contributions. While Caplin, Gomberg, and Sadka(2014) relies heavily on existence theorems, the purpose of this paper is to ex-plictly look for a set parameters that rationalize our choice process and the sizeof the set of possible solutions. This paper develops an algorithm which isused to explictly state a solution. The second contribution of this paper is thatit aims to elicit a second model which encompasses the first model for furtherwork. The model proposed can be applied both in a theoretical and an empir-ical approach. The rest of the paper comes as following, Section 1 introducesthe model and the notation. It also introduces restrictions that the data shouldabide by in order for solution to exists. Section 2 characterizes the recovery of


parameters and, if no such recovery is possible, defines the distance to rational-izability as a measure of how far the data is from one that is rationalizable. Thissection also introduces a measure of the size of the set of all possible solutionswhen the data is rationalizable. Section 3 shows the application of the modelto a real dataset generated from Mexican labor courts. Finally, Section 4 pro-poses a model that will be explored for further work and Section 5 discussesthe policy implications and concludes the paper.

1. A MODEL FOR INFORMATION IN A CONTEXT OF BINARYDECISION-MAKING

1.1 BASIC MODEL

Following Caplin et al. (2014), the decision maker (DM) is a judge that has totake a binary decision -in this case a verdict-, (g) or (i).4 The utility from thisdecision is state dependent. A state is a sequence of information bits, eachof which can take the value of 0 or 1. In this sense, each of the N states ofthe world is completely characterized by M binary bits of information, withωji ∈ 0, 1 denoting the value of bit j on state i. With this structure, we can

denote each state ωi ∈ Ω as a vector of dimension M so that ωi = (ω1i , . . . ω

Mi ).5

The state space Ω = (ω1, . . . , ωN ) is a finite measurable state space. Let G and Idenote a partition of the state space (ie. G ∩ I = ∅ and G ∪ I = Ω). In the casefor judges, G denotes the set of those states where the defendant is guilty andI denotes those states where the defendant is innocent.

Let ω∗ = (w∗1 , . . . , ω∗N ) denote the true state of the world. Suppose that the

decision maker has a prior belief β ∈ ∆#(Ω) = ∆N which assigns probabilitiesto each possible state. In this case, we will denote β(G) as the probability thatthe judge assigns to being in a state where the defendant is guilty. Suppose that,for each realization of the choice procedure, there is a set of evidence E ⊂ Ω.6

Evidence will reveal the true state for a certain number of bits. Given evidenceE, the judge updates his prior following Bayes’ Law. We will denote β(Y |E)to be the probability that the judge assigns to the partition Y ∈ G, I givenevidence E (ie. β(Y |E) = β(E ∩ Y )/β(Y )). Suppose that the actual utilityof the DM is given by Ug

G = U iI = 0, Ug

I = −q and U iG = −(1 − q) where

q ∈ (0, 1) denotes the aversion of deciding g when the true state is I . Thesuperscript denotes the judge decision and the subscript denotes the partitionfor the true state of the world.7 Therefore, it would be optimal for the judge to

4To impose clarity, I will denote the decision maker using a male pronoun throughout the restof this exposition.

5This structure places a clear restriction in the cardinality of the state space since N ≤ 2M . Thestate space can only be, at most, all of the possible combinations of bits. We will assume this re-striction is fulfilled with equality to keep the model in simple terms. However, this generalizationis straightforward by limiting prior beliefs to exclude those states that ex-ante are deemed to beimpossible.

6Evidence can be thought also as a number of bits for which we know their true state, and canbe used to reduce the number of possible states that the decision maker considers to be possible.

7Both UgI and U i

G can be thought of as the type I and II errors that the decision maker tries


rule guilty (g) if β(G|E) > q and innocent (i) if β(G|E) < q. I will not considerthe possibility of indifference from now on. In this case, any mixed strategywill give the same expected utility to the DM.8

The structure imposed in this model allows the DM to know some informa-tion about the state of the world by observing the evidence. We will assumethat the evidence reflects partial information about the state of the world. Iwill denote E ( P(Ω) to be the set of all possible observed evidences. Giventhe structure of the evidence, we will have that for E ∈ E we can write E ∈0, 1, xM . In this case, E ∈ E having coordinate j equal to x represents that bitj is not present in the evidence. Example 1 is used to explain how will evidencelook like, and, given E, which states of the world the decision maker does notdistinguish.

Example 1. Suppose thatM = 2 so thatN = 4. We will have that Ω = 0, 0, 0, 1, 1, 0, 1, 1.All possible observed evidences will be x, x, x, 1, x, 0, 0, x and 1, x. In thiscase, x, x = Ω; 1, x = 1, 0, 1, 1; and so on.

As mentioned above, the DM has to decide on his verdict. Let f : E → g, idenote his decision ruling, which depends on the evidence the decision makerobserves. In order to maximize utility, this ruling must be consistent with whatwas mentioned above, so that we have:

f(E) =

g if β(G|E) > q

i if β(G|E) < q

1.2 DATA GENERATED

The structure we will talk about is a (single) DM that we observe for severaldifferent sets of evidence. All that we will need to observe is the set LO =(Ei, f(Ei))

Ii=1, that we will call the legal output. What we do not observe is

the true (complete) state ω∗, the prior belief over the states of the world, β,the guilty set in the mind of the decision maker, G, and his conviction bias,q. There will be two important objectives with the dataset. First, we will lookon conditions that need to hold so that the dataset can be rationalized by atriplet (β, q,G). Second, we will consider the case where information is notrationalizable by this model and generate a distance to rationalizability.9

to minimize. This interpretation, although not necessarily completely formal, provides solid eco-nomic intuition into the model.

8A reason to ignore this case is that any game where indifference is the main driver is notinteresting as all of what we observe may be following from indifference. A way to approacha problem where the decision maker is seen to observe different alternatives for the same exactgame is one where the DM chooses a probability for each action rather than just one action. Formore information, see Caplin and Dean (2014).

9In this case, distance to rationalizability will not mean that the data is not rationalizable by anytype of model. This distance will only elicit how far is the data from being rationalizable using ourmodel.


Definition 1.1. The legal output LO is rationalizable by (β, q,G) if for all i ∈ I suchthat f(Ei) = g we have that β(G|Ei) > q and for all i ∈ I such that f(Ei) = i wehave β(G|Ei) < q.

The first natural question that arises is whether the structure of the modelimposes conditions on the database generated. The answer to this question ispositive. If we observe a database generated by this method, it should be thecase that any information that turns out to be irrelevant to the decision makerneeds to keep the decision consistent. To elucidate the former sentence, wepresent Axiom 1.1. It turns out that this axiom will be a necessary condition onany dataset generated by this choice procedure.

Axiom 1.1 (Additivity). We will say that E and E′ satisfy additivity if f(E) = g,f(E′) = g and E ∩ E′ = ∅ imply f(E ∪ E′) = g or f(E) = i, f(E′) = i andE ∩ E′ = ∅ imply f(E ∪ E′) = i.

In economic terms, the aforementioned axiom enforces consistency of deci-sions along different amounts of information. In particular, additivity impliesthat whenever two non-intersecting partitions of information arise in the sameresult, the union between these sets of information cannot reverse the verdict.By Proposition 1.1, the link between rationalizability and the previous Axiomis provided.

Proposition 1.1. The legal outputLO = (Ei, f(Ei))Ii=1 is rationalizable by (β, q,G)

if for E ∈ LO and E′ ∈ LO such that E ∩E′ = ∅ and f(E) = f(E′) we have that ifE ∪ E′ ∈ LO then f(E) = f(E ∪ E′).

Proof. See Appendix A.

Now, think about the dataset presented in Table 1. This example has 4bits and 4 observations. Since the number of bits is 4, there will be 24 = 16possible states of the world. The states of the world will be denoted ω1 =0, 0, 0, 0, ω2 = 0, 0, 0, 1, ω3 = 0, 0, 1, 0, . . . , ω16 = 1, 1, 1, 1.

TABLE 1: MOCK DATABASES FROM THE MODEL

(A) EXAMPLE 1

Evidence Veredict

xx10 g0x11 gxx0x i1x1x i

(B) EXAMPLE 2

Evidence Veredict

xx10 gxx11 gxx1x i1x1x g

In general, we will use the notation ωi = bin(i − 1) for i = 1, . . . , 2M . Inthis case, bin(i − 1) denotes the binomial expansion of the number i − 1. By


using this approach, we have an unequivocal relation between the number ofthe states and the realization of bits in that state that will be exploited whenimplementing the algorithms.10

2. AN ALGORITHM ON THE RECOVERY OF PARAMETERS

We will now work on a characterization theorem for this choice procedure.Think about a dataset LO = (Ei, f(Ei))

Ii=1 like the ones presented at the Table

1. Caplin, Gomberg, and Sadka (2014) shows that a stronger condition thanour 1.1 additivity axiom, which they call strong additivity, is the sufficient con-dition for rationalizability of the database. What this paper does is to take aslightly different approach. We will map the evidence and the veredicts fromthe way that we currently have it to a I × 2M matrix A that gathers all of theinformation required for the recovery of the parameters. Once mapped to A,we will argue that any vector that maps this matrix to the positive orthant willcharacterize the parameters necessary for rationalizability.

We have I is the number of observations in our legal output and M thenumber of bits. Fix evidence E and take the indicator function 1E(ω), which isequal to one if ω ∈ E and zero otherwise. We will define a(E) = 1 if f(E) = gand a(E) = −1 if f(E) = i. With this into consideration, the matrix A will bedefined as:

AI×2M =

a(E1) · 1E1

(ω1) a(E1) · 1E1(ω2) · · · a(E1) · 1E1

(ω2M )a(E2) · 1E2

(ω1) a(E2) · 1E2(ω2) · · · a(E2) · 1E2

(ω2M )...

.... . .

...a(EI) · 1EI

(ω1) a(EI) · 1EI(ω2) · · · a(EI) · 1EI

(ω2M )

(2.1)

By definition, this matrix will be composed by zeros, ones and minus ones.Furthermore, as already said, it will be the case that if there is a vector x ∈ R2M

such that A · x > 0, then the parameters (β, q,G) will be completely character-ized by x. Using Caplin, Gomberg, and Sadka (2014), I generate the followingtheorem.

Theorem 2.1. For a given legal output LO = (Ei, f(Ei))Ii=1 we generate matrix

AI×2M . If there exists a vector x ∈ R2M

such that A · x > 0, then there exists(β, G, q) that rationalizes the legal output.

Proof. See Appendix A.

By the Theorem 2.1 all that we need to show now is that such x exists.Moreover, it is straightforward to show that this will be a sufficient conditiontoo, so that negating the existence will also imply that the data is not rational-izable. This mathematical condition has a very strong, but simple, economic

10This solution, although unnecessary for the theoretical description, will be very useful on thecomputational approach. The unequivocal relation will be used to construct the matrix A pre-sented in (2.1) in a simple way.


explanation. Whenever A is such that no such x exists, the dataset is experi-encing choice reversals. Now, we will look for conditions on the existence ofthe solution. For this purpose, we will use Theorem 2.2 –which is a direct con-sequence of Fishburn’s (1970) Theorem of the Alternative (Theorem 4.2, page46)– to guarantee such existence.

Theorem 2.2. If z1, z2, . . . , zI ∈ R2M

, then either there is an x ∈ R2M

such thatx · zi > 0 for all k = 1, . . . , I or there are non-negative numbers r1, . . . , rI not all ofwhich equal zero such that

∑Ii=1 rkz

ij = 0 for j = 1, . . . , 2M .

2.1 EXISTENCE OF A SOLUTION AND DISTANCE TO RATIONALIZABILITY

Theorems like 2.2 show us that all that we will need to do is to falsify the secondcondition in order to ensure that the first condition holds. Notice that the firstcondition of this theorem is precisely the existence that we need. Take matrixA and define as zi the i−th row of this matrix. To guarantee existence, one cansolve a maximization problem like the following. Take any r > 0 and compute:

maxr∈RI

‖r‖2 (2.2)

s.t. r ·A = 0

0 ≤ ri ≤ r for all i = 1, . . . , I

Notice that the vector 0 is always feasible so that the problem is non-empty.Moreover, the problem has a solution since it maximizes a continuous functionon a non-empty compact set. For a fixed r, we will now define v(A) = v(A, r)as the value function of this problem. If v(A) > 0 then the second conditionon Theorem 2.2 is fulfilled and therefore the data is not rationalizable. On theother hand, if v(A) = 0 then we end up with the first case of our theoremand existence of a solution is guaranteed. Furthermore, it is straightforward toshow that this result is independent of the choice of r as long as it is strictlygreater than zero.

As mentioned before, v(A) = 0 will be a sufficient condition so that theredoes not exist (β,G, q) that rationalize the dataset. Nonetheless, when thishappens, we can look explicitly on the data to know which observations arethose that produce a choice reversal. Definition 2.1 will introduce the idea of adistance to rationalizability.

Definition 2.1. Define the set LOI as the set of all possible legal outputs with I casefiles. For a legal output LO = (Ei, f(Ei))

Ii=1, define d : LOI → 0, 1, . . . , I as:

d(LO) = I −maxJ

[#J ⊂ 1, . . . , I : (Ej , f(Ej))j∈J is rationalizable]

d will be called distance to rationalizability of the legal output LO.

It is straightforward to see that if the legal output LO is rationalizable, thend(LO) = 0. On the other hand, if the legal output is not rationalizable, d can be


thought as the least number of observations that need to be removed from ourdataset so that the remaining solutions are rationalizable. The implementationof this problem will be as the following. We will try to maximize the cardi-nality of J ⊂ 1, . . . , I. We can start looking at all of those subsets that havecardinality I − 1, then continue with all of those with cardinality I − 2, and soon. This process will obligue us to check 2I possible combinations. Althoughdifficult, the finiteness of the number of combinations will guarantee the exis-tence of d. The code to implement the distance to rationalizability is availableon the Appendix.

2.2 FINDING A SOLUTION EXPLICITLY

In order to compute the solution to our rationalizability problem, we can definea maximization problem that yields an x such that A ·x > 0. For this purpose, Iwill define two different problems. Each of them will take a different approach.The reason behind defining two problems and not only one is that their com-putational implementation can lead to different types of solutions. While thefirst maximization problem is quicker in computational terms when used withlarge-scale datasets, it is very sensitive to the initial conditions on the problem.The second maximization problem will lead to more robust solutions that areless dependent on the initial iteration, but will take as much as ten times longerto converge than the first problem.

Define the first problem as:

maxx∈R2M

‖x‖2 (2.3)

s.t. A · x > 0

− x ≤ xm ≤ x for all m = 1, . . . , 2M

The second problem will be defined as:

maxx∈R2M

min

i∈1,2,...,Ix · zi

(2.4)

s.t. A · x > 0

− x ≤ xm ≤ x for all m = 1, . . . , 2M

Either of this problem’s solution will rationalize our parameters. We canrun (2.3) first and determine whether the solution is admissible for our grounds.What may happen with this solution is that there could be the case that a lotof the components of vector x can be zero. When the algorithm produces aninterior solution on x, this solution will be a very simple and elegant solution.Yet, this algorithm can also try to go to a corner solution. If this is the case,then using (2.4) can yield a much richer solution. The difference in the objec-tive functions comes at a cost.11 The code to implement the problems (2.2), (2.3)

11While the former can be implemented as a quadratic programming problem that is quick tosolve, the latter will be implemented as a constrained maximization that is costly in computingterms.


and (2.4) are shown in the Appendix.

2.3 TIGHTNESS OF OUR SOLUTION

As mentioned before, there is nothing that guarantees the existence of a uniquesolution to A · x > 0. In fact, it is easy to show that if there exists an x suchthat A · x > 0, then there will exist infinitely many of them. However, we canthink of a notion of the size of the set of solutions. First of all, define the setS = S(A) = x ∈ R2M

: A ·x > 0 as the set of all possible admissible solutionsof our problem. It is straightforward to show that S is a linear convex cone thatdoes not include the origin. We will define S = S(A) = x ∈ S : ‖x‖2 = 1 asall admissible solutions that intersect with the unitary hypersphere in R2M

.

Definition 2.2 (Size of the solution). We will define the size of the solution as theproportion of the (unitary) hypersphere that intersects with the set S in terms of thesize of the whole hypersphere, so that:

µ(S) =λ(S)

λ(x ∈ R2M : ‖x‖2 = 1)

where λ is the Lebesgue measure.

It will be straightforward to show that 0 ≤ µ(S) ≤ 1. Again, the code toimplement the tightness of the solution in MATLAB is shown in the Appendix.

2.4 USING OUR MOCK EXAMPLES TO DESCRIBE THE PROCESS OF RECOVERY

Now think about our mock datasets from Table 1. We will have that both ofthem can be mapped directly into two matrices. For space constraints, we onlypresent the matrix from the first example at Table 2a. It turns out that thisdataset is indeed rationalizable, which is shown by the parameters on Table2b. We also document one example of the parameters that can be used to ra-tionalize this data (that arises from the information on 2b) on Table 2c. Finally,the tightness of our solution will be equal to 0.041.


TABLE 2: A SOLUTION FOR EXAMPLE 1

(A) MATRIX A DERIVED FROM THE DATASET OF EXAMPLE 1

A =

0 0 1 0 0 0 1 0 0 0 1 0 0 0 1 00 0 0 1 0 0 0 1 0 0 0 0 0 0 0 0−1 −1 0 0 −1 −1 0 0 −1 −1 0 0 −1 −1 0 0

0 0 0 0 0 0 0 0 0 0 −1 −1 0 0 −1 −1

(B) SOLUTION USING THE ALGORITHM FOR EXAMPLE 1

x =

−0.035−0.035

0.1200.120−0.034−0.035

0.1200.120−0.035−0.035−0.000−0.120−0.035−0.035

0.000−0.120

A · x =

0.23980.23980.27940.2398

(C) PARAMETERS THAT RATIONALIZE THE EXAMPLE 1

G =(

0 0 1 1 0 0 1 1 0 0 0 0 0 0 1 0)

q = 1/2

β =(

0.04 0.04 0.12 0.12 0.04 0.04 0.12 0.12 0.04 0.03 0 0.12 0.04 0.04 0 0.12)

For the second example, we can observe that the data will not be rational-izable. This will come from the fact that there is one choice reversal betweenobservations 1 + 2 and observation 3. In fact, r = (1, 1, 1, 0) would suffice thisclaim. However, we can look at what is the smallest amount of case files thatwe have to take out of the matrix so that the data becomes rationalizable. Inthis case, the distance to rationalizability will be 1, as removing the third ob-servation would lead us with a rationalizable subset.


3. AN APPLICATION OF THE MODEL TO REAL DATA

3.1 THE DATASET

I will use a dataset that comes from a labor court in Mexico City. This is a laborcourt that applies the federal labor law applicable to all of the private sectorworkers in Mexico. The information is captured from the documents producedby the court. We are able to compile the dataset in this way because, eventhough trials are mostly oral, judges decide on a closed record. They receivethe closed case file with the transcription of all the hearings and the evidencepresented on those hearings. After reading the evidence, judges motivate theirdecisions and give their verdict.12 In this dataset, we observe the motivation(and with that, we observe the state of some bits or their lack of existence) andtheir verdict.

We observe 27 observations of the same judge –our decision maker– with10 bits of evidence each. In our dataset, the judge is trying to determine ifthere was an employer-employee relationship. If this is the case, the judgewould later proceed to determine if its termination was fair or unfair. Fairfiring does not lead to any severance pay, while unfair firing means that therewill be severance pay.13 For this exercise, we will use ten bits and one outcome.Table 3a describes the bits of evidence used and Table 3b describes the verdictthat was tested.

Something to take into consideration is that all of these cases have, as acommon characteristic, that the employer argued that (s)he never hired the‘employee’, so that none of the plaintiffs should have a working contract. Thisis why the verdict on Table 3b becomes so important; to determine whether onespecific firing was fair or unfair, we need to determine whether there was or nota employer-employee relation first. Another thing to take into consideration isthat the judge’s incentives are aligned to carefully revise the information. Ifthere is an appeal granted on a case, the judges are obliged to revise the casewithin 72 hours. So judges are better doing their job correctly at the first timeinstead of having to revise the complete case again in this very short period.This mechanism aligns the judges’ incentives to capture the complete evidenceso that they reduce the number of appeals.

Now, I will properly describe the bits on Table 3a. The bit #1 states if theemployer tried to argue that the case has been outdated by law. In this case,any case older than its statute of limitations should go in favor of the employer.This bit is interesting since observing 1xx . . . x could suffice to say that therewas no employer-employee relationship (even if there was). The bits #2 and#3 answer whether the employee requested a deposition. Depositions are in-stances where someone is called to court to answer some questions. Bit #3

12By law, it should never be the case that some information is left out by the judge, as this wouldguarantee enough grounds for an appeal.

13As a next step, it could be interesting to see whether this model can also predict the amount ofseverance paid conditional on the probability of guiltiness. One hypothesis could be that the moreprobable that the firing was out of the law increases the amount of severance. Unfortunately, atthe moment of this paper, this type of data has not been compiled.


becomes important when considering that, if the employer did not attend thedeposition, everything the employee said is taken to be true. Bit #4 is straight-forward. It answers if there are witnesses that validate the employee’s story.Witnesses may be able to prove that, even without a working contract, theplaintiff indeed was working for the employer. Bits #5 and #6 are documentswhich are in favor of the employee. Notice the similarity with bit #10; exceptfor the fact that if the employer submits a payslip, it would be to argue thatthere is no record of the employee being paid. Finally, bits #7, #8 and #9 are thecounterparts of #2, #3 and #4, respectively.

TABLE 3: EXPLANATION OF THE BITS OF EVIDENCE AND THE VERDICT FROM THELABOR COURTS DATASET

(A) BITS OF EVIDENCE USED

# bit Explanation of the bit

1 Did the employer try to argue that case was outdated by the statute of limitations?2 Was there a deposition requested (and that took place) by the employee?3 Did the employer attended the deposition requested by the employee?4 Are there witnesses in favor of the employee?5 Did the employee submit as evidence a payslip?6 Did the employee submit as evidence a contract?7 Was there a deposition requested (and that took place) by the employer?8 Did the employee attended the deposition requested by the employer?9 Are there witnesses in favor of the employer?

10 Did the employer submit as evidence a payslip?

(B) VERDICT USED

# outcome Explanation of the outcome

1 Was the employer able to prove there was no employer-employee relationship?

3.2 RESULTS

It turns out that our outcome can be rationalized using our model. However,this may come from the fact that we only have a small number of observations(27) and a very big number of possible states (1024). In this case, the set ofpossible solutions for a given situation is quite big. Nevertheless, the valuefor the tightness of our solution is statistically zero. This may come from thenumber of dimensions.

Table 4 presents the cases that we use to recover the parameters, their ver-dicts and the probability of guiltiness found using the recovered parameters.


The cases are presented as a sequence of bits where the i−th bit correspondsexactly to the bit found on Table 3a. If the answer to the question presented isyes, then the bit is equal to one. If the answer to the question presented is no,then the bit is equal to zero. If there is no record of this question, then the bit isequal to x.

Notice that there are a few cases where the probability is close to 1/2. Eventhough this seems to be the case, numerically we can assure the inequality isfulfilled strictly in all of these cases. However, the values are so close to 1/2that the value rounded up to three digits is exactly 0.500.

TABLE 4: MODEL RESULTS ON THE LABOR COURT DATASET

# Case file Evidence Real verdict Estimated probability of g(1 = G) (β(G|E))

1 00100x10x1 1 0.5092 0010xx01xx 1 0.6503 1100xxxxxx 0 0.0854 00100x110x 1 0.6365 001xxx10xx 1 0.5906 0010xx01xx 1 0.6507 0xxx0x01xx 1 0.5278 0010xx010x 1 0.5009 0010xx01xx 1 0.650

10 0xxxxxxxxx 1 0.50011 001xxx01xx 1 0.57212 001x0x01x0 1 0.50013 0xxxxx10xx 1 0.52514 0010xx010x 1 0.50015 00100x0100 0 0.50016 1110xx01xx 0 0.05217 0010xx010x 0 0.50018 0010x0010x 1 0.50019 0xxxxxxxxx 1 0.50020 111011xxxx 0 0.05021 001xxx01x0 1 0.54022 00100001xx 1 0.84423 0xxxxxxxxx 0 0.50024 0000xx00xx 1 0.50025 0xx0xx01xx 1 0.57126 001xxx01xx 1 0.57227 0010xxxxxx 1 0.533


4. FURTHER WORK: A MODEL WITH INFORMATION ACQUISITION

We now propose an idea to generalize this model. As in the past, supposethat the evidence E ⊂ Ω is fixed for the decision maker. Without applying anyeffort, the decision maker will not be able to observe any information regardingthe true state ω∗ and, if enough effort is applied, he will at most be able toobserve E. However, this effort will come at a cost that will be thoroughlyexplained later on.14 By applying costly effort, the judge may observe certainset of signals ME at a cost. We will call ME as the motivating evidence, whichwill be such that E ⊂ ME ⊂ Ω. The decision maker will use this motivatingevidence to maximize his expected utility. To give structure to the decisionproblem, motivational evidence will come in the form of bits. For each bit, thejudge either observes the value of that bit if he pays the cost or not observe thebit at all (in which case the result will be denoted by x as it was before).

To introduce the idea of costly information acquisition, we will define a costfunction for the evidence. This cost function will represent the cost the DMfaces to acquire certain amount of bits and will be crucial in the optimizationproblem. The cost function is constructed by using M constants (c1, . . . , cM )where each ci ∈ (0, 1] to define c : P(1, . . . ,M) → (0,M ] as c(J) =

∑j∈J cj

for each J ⊂ 1, . . . ,M. This structure ensures that c is both strictly monotoneand additive. Evidence not being present in the file would be equivalent tothinking that, for reasons exogenous to the outside observer, the cost functionwill have some non-attainable bits that vary in each draw.15 I will denote thenon-attainable bits by ψ ⊂ 1, . . . ,M as the set of non-attainable bits in a costfunction realization. We bound the cost of each bit, ci by 1 as any strictly greatercost will not be identifiable. Furthermore, any cost bigger or equal to 1 leadsto that bit never getting observed disregarding the cost of all of the rest of thebits.

Example 2. Suppose that M = 3 so that N = 8. If ψ = 1 then the first bit isnon-attainable. In this case, the decision maker must choose whether to observe thesecond bit only, the third bit only, the second and third bits together or to observe nobits at all.

The problem the DM faces is to maximize his expected utility by choos-ing motivating evidence (ME) and the decision ruling that accompanies the

14The cost of acquiring information to reveal the true state of the world can be thought of aseither a cognition cost, or an impatience cost. Cognition costs are important if information is noteasy to understand. On the other hand, impatience costs can represent the idea that if the DMis only allowed to consume after certain time, the utility from that consumption will be smallercompared to the utility this DM can have by consuming today. Both types of costs allow us tothink that it may be better not to acquire a bit of information if the bits planned to be acquired willnot be useful at all.

15For example, this may happen if some of the information that should be present is in realityabsent. Think for example in a situation where the judge has to give a verdict and one of thedocuments with evidence has gone missing from the file. If the judge applied enough effort tolook for the document he can find the document and use the information on it. This is a reason onwhy can there be an exogenous variation on cost.


motivating evidence f(ME) given the non-attainable bits ψ. The set of nonattainable bits will be those that are not present in the evidence E.

maxME,f(E)

E(U)− c(ME) (4.1)

s.t. ψ ⊂ 1, . . . ,M

Existence of a solution in this problem comes from the fact that the cardi-nality of the state space is finite.16 However, the problem is computationallydifficult for a human as the cardinality of the problem grows exponentiallywith the number of bits. This means that when the space state has M bits, theDM has to compare over 2M alternatives.

Example 3. Suppose that the state space is composed only of two bits and both of themare feasible. The individual must choose whether to observe the first bit, the second bit,both bits or no bits at all. Think for example, that the individual is considering choosingto observe the first bit. In this situation, he needs to compute the probability of the firstbit being a 0 -and the action he would follow if this is the case- and the same if the bitsays that the true state is equal to 1. For each of this situations, the individual willbayesianly update his prior belief and decide according to that.

4.1 DATA GENERATED

We will define an extended legal file as a triplet (E, f(ME), ψ). Notice thatthere is a condition on ψ, as it cannot include any bits not present at E. Byexogenously varying the set of non attainable bits, the outside observer canobtain a set of legal files. These legal files will contain the information thatthe DM used to pronounce the verdict. From these files, our primary aim willbe to look under what conditions can the parameters (β, q,G, c) be recovered.The dataset generated will be called the court output, and will be denoted by(Ek, f(ME)k, ψk)Kk=1; in this case, the court output consists of K legal files.Finally, we will say that parameters (β, q,G, c) recover the court output if theycan be used to generate the complete court output.

5. CONCLUSIONS

There are two direct implications that arise from the model presented in Section1. Suppose that all of the information that the judge used to take the decision ispresent in our database. If this is the case, no judge should be able to systemat-ically say guilty or innocent with the exact same amount of evidence. This typeof inconsistencies from a judge –or any decision maker in general— should becarefully examined. In the judge’s example, this could mean that the judgemay be failing to compile all of the evidence that is being taken into consider-ation; in the Mexican case this is sufficient grounds for an appeal. Even worse,

16The individual needs to compute a finite number of expected values to choose from. Finitenessensures the existence of a (global) maximum.


the judge could be using information that he does not want to compile on theevidence (like a brivery, for example).

The second direct implication is that, for information to be rationalizable,there should be no choice reversals. If, caeteris paribus, changing the value of aspecific bit from 0 to 1 does not change the judge’s decision, then not observingthat bit at all should not change the decision either. This is exactly what ourTheorem 2.2 proposes. Think about it this way: if there is a choice reversal,then there will be a linear independent number of inequalities in matrix A. Inthis situation, it is most likely that the decision maker is choosing both g and iwith the same amount of information and this should not happen.

Finally, the last contribution of this paper is to explicitly work on the com-putational aspects of the existence of a solution. While existence is itself ofgreat importance, this paper combines a high-degree of difficulty theoreticalmodel and empirical results. This allows us to talk about the size of our solu-tion and distance to rationalizability as proper elements, rather than observingthem just as mere abstract objects.

REFERENCES

BLACKWELL, D. (1951): Comparison of Experiments, Berkeley, California: Uni-versity of California Press, 93–102, Proceedings of the Second Berkeley Sym-posium on Mathematical Statistics and Probability.

CAPLIN, A. AND M. DEAN (2014): “Revealed Preference, Rational Inattention,and Costly Information Acquisition,” NBER Working Papers 19876, NationalBureau of Economic Research, Inc.

CAPLIN, A., M. DEAN, AND D. MARTIN (2011): “Search and Satisficing,”American Economic Review, 101, pp. 2899–2922.

CAPLIN, A., A. GOMBERG, AND J. SADKA (2014): “Legal Evidence and Ratio-nal Inattention,” Unpublished manuscript.

FISHBURN, P. (1970): Utility theory for decision making, Publications in opera-tions research, Wiley.

GABAIX, X., D. LAIBSON, G. MOLOCHE, AND S. WEINBERG (2006): “CostlyInformation Acquisition: Experimental Analysis of a Boundedly RationalModel,” American Economic Review, 96, 1043–1068.

KLING, J. R., S. MULLAINATHAN, E. SHAFIR, L. C. VERMEULEN, AND M. V.WROBEL (2012): “Comparison Friction: Experimental Evidence from Medi-care drug plans,” Quarterly Journal of Economics, 127, 199–235.

LESTER, B., N. PERSICO, AND L. VISSCHERS (2012): “Information Acquisitionand the Exclusion of Evidence in Trials,” Journal of Law, Economics, and Orga-nization, 28, 163–182.


LIPMAN, B. L. (1991): “How to Decide How to Decide How to...: ModelingLimited Rationality,” Econometrica, 59, pp. 1105–1125.

MASATLIOGLU, Y., D. NAKAJIMA, AND E. Y. OZBAY (2012): “Revealed Atten-tion,” American Economic Review, 102, 2183–2205.

SEIRA, E. AND A. ELIZONDO (2014): “Are Information Disclosure MandatesEffective? Evidence from the Credit Card Market,” Unpublished manuscript.

SIMON, H. A. (1955): “A Behavioral Model of Rational Choice,” Quarterly Jour-nal of Economics, 69, pp. 99–118.

a


APPENDIX∗

A. PROOFS

Proposition 1.1. The legal outputLO = (Ei, f(Ei))Ii=1 is rationalizable by (β, q,G)

if for E ∈ LO and E′ ∈ LO such that E ∩E′ = ∅ and f(E) = f(E′) we have that ifE ∪ E′ ∈ LO then f(E) = f(E ∪ E′).

Proof. Suppose that (β, q,G) rationalizes LO. We will only show the case thatf(E) = f(E′) = g. Then we have that:

β(G ∩ E)

β(E)> q ⇔ β(G ∩ E) > qβ(E) (A.1)

β(G ∩ E′)β(E′)

> q ⇔ β(G ∩ E′) > qβ(E′) (A.2)

Now suppose thatE∪E′ ∈ LO, then it must be the case that the decision rulingis such that:

β(G|E ∪ E′) =β(G ∩ (E∪E′))

β(E∪E′)=β((G ∩ E)∪(G ∩ E′))

β(E∪E′)=β(G ∩ E) + β(G ∩ E′)

β(E) + β(E′)

β(G|E ∪ E′) > q

(β(E) + β(E′)

β(E) + β(E′)

)= q

And therefore f(E ∪ E′) = g.

Theorem 2.1. For a given legal output LO = (Ei, f(Ei))Ii=1 we generate matrix

AI×2M . If there exists a vector x ∈ R2M

such that A · x > 0, then there exists(β, G, q) that rationalizes the legal output.

Proof. By hypothesis, there exists a vector x ∈ R2M

such that A · x > 0. Weknow that such vector should be such that ‖x‖1 > 0. From now on, we willassume without loss of generality that ‖x‖1 = 1. For each j = 1, . . . , 2M , wewill define β(ωj) = |xj | ≥ 0, so that by construction:

2M∑j=1

β(ωj) = 1

Now define the guilty partition by G(ωj) = 1 if xj > 0 and G(ωj) = 0 if xj ≤ 0.Finally, we will set q = 1/2. We will know show that (β, G, q) rationalize thelegal output. Fix i ∈ 1, . . . , I, then we have that:

2M∑j=1

a(Ei) · 1Ei(ωj)xj > 0 ⇐⇒ a(Ei)

∑j:wj∈Ei

xj > 0 (A.3)

∗Matlab code implemented can be found in the online version. To download it go tohttp://www.gacetadeeconomia.itam.mx


Moreover, by our construction we have that:∑j:wj∈Ei

xj =∑

j:wj∈Ei

xj>0

xj +∑

j:wj∈Ei

xj≤0

xj =∑

j:wj∈Ei

G(ωj)=1

β(ωj)−∑

j:wj∈Ei

G(ωj)=0

β(ωj) = 2∑

j:wj∈Ei

G(ωj)=1

β(ωj)−∑

j:wj∈Ei

β(ωj)

(A.4)

Combining the information from (A.3) and (A.4) we know that:

a(Ei) ·

2∑

j:wj∈Ei

G(ωj)=1

β(ωj)−∑

j:wj∈Ei

β(ωj)

> 0 ⇐⇒ a(Ei) ·(

2 · β(G ∩ Ei)− β(Ei))> 0

(A.5)

So we will have two cases, either a(Ei) = 1 or a(Ei) = −1. However, from(A.5) notice that we will have:

f(Ei) = g ⇒ a(Ei) = 1⇒ β(G|Ei) > q & f(Ei) = i⇒ a(Ei) = −1⇒ β(G|Ei) < q

So we conclude that (β, G, q) rationalizes the legal output.

B. MATLAB CODE IMPLEMENTED

B.1 DATA PREPROCESSING AND MOCK DATASETS USED

1 %% DATA PREPROCESSING OF THE REAL DATABASE2 clc, clear3 data = xlsread('base_preprocesada.xls');4 verdict = data(:,end−1:end−1);5 verdict(:,2) = 1−verdict(:,2);6 [n,m] = size(data(:,1:end−3));7 data_filtered = zeros(n,1);8 j=1;9 for i=1:m

10 if min(isnan(data(:,i))) ~= 111 data_filtered(:,j) = data(:,i);12 j = j+1;13 end14 end15 data = data_filtered(:,1:end);1617 %% MOCK DATABASES18 clc, clear1920 data = [NaN,NaN,1,0; 0,NaN,1,1; NaN,NaN,0,NaN; 1,NaN,1,NaN] % DATASET 121 verdict = [1 1 0 0]'2223 data = [NaN,NaN,1,0; NaN,NaN,1,1; NaN,NaN,1,NaN; 1,0,0,NaN] % DATASET 224 verdict = [1 1 0 1]'

B.2 MAIN SCRIPT

1 %% FINDING A SOLUTION AND THE SIZE OF THE SET OF SOLUTIONS2 [num_obs,M] = size(data);3 num_simul = 10000;4 for i = 1:25 disp(['Verdict ',num2str(i)])


6 count_states = generate_matrix_A( data, verdict(:,i));7 rationalizable = check_rationalizability(count_states)8 if rationalizable == 19 [x, Ax] = prob_sol(count_states,M,1);

10 result_prob = create_prob( x, count_states, num_obs )11 size = size_solutions( count_states,M,num_simul)12 end13 end1415 %% DISTANCE TO RATIONALIZABILITY16 distance = distance_rat(data,verdict)

B.3 GENERATE MATRIX A

1 function [ count_states ] = generate_matrix_A( data, verdict )2 [num_obs,M] = size(data);3 states = dec2bin(0:2^M−1);4 states_mat = states == '1';5 count_states = ones(num_obs,2^M);6 for i=1:num_obs7 check = 1−isnan(data(i,:));8 data_check = data(i,~isnan(data(i,:)));9 if isempty(data_check) ~= 1

10 for j=1:2^M11 mat_check = states_mat(j,:);12 mat_check = mat_check(check==1);13 count_states(i,j) = min(data_check == mat_check)==1;14 end15 end16 end17 count_states(verdict == 0,:) = −1*count_states(verdict == 0,:);18 end

B.4 CHECK IF THE LEGAL OUTPUT IS RATIONALIZABLE

1 function [ rationalizable ] = check_rationalizability(count_states, fun)23 [num_obs,M] = size(count_states);4 if fun == 15 opts = optimset('Algorithm','active−set','Display', 'off');6 [r,fval] = quadprog(−(1/2)*eye(num_obs), zeros(num_obs,1), ...7 [],[],count_states', zeros(M,1), ...8 zeros(num_obs,1),100*ones(num_obs,1),zeros(num_obs,1),opts);9 rationalizable = (− fval < .0001 & min(r) >= 0);

10 rationalizable = 1 − rationalizable;11 end12 if fun == 213 num_bits = log2(M);14 options=optimset('Display', 'off');15 [x,~,exitflag]=linprog(−ones(num_obs,1),[],[],count_states',...16 zeros(1,2^num_bits),zeros(num_obs,1),10000*ones(num_obs,1),[],options);17 %[max(abs(x'*count_states)), min(x), exitflag]18 rationalizable = (exitflag == −2);19 end20 end

B.5 PROVIDE A SOLUTION IF DATA IS RATIONALIZABLE

1 function [ x, Ax ] = prob_sol( count_states,M,i )2 opts = optimset('Algorithm','active−set','Display','off');3 [num_obs,~] = size(count_states);4 switch i5 case 16 [x,~] = quadprog(−(1/2)*eye(2^M), zeros(2^M,1),...7 −count_states,−1*ones(num_obs,1),[],[],...8 −100*ones(2^M,1),100*ones(2^M,1),rand(2^M,1),opts);9 case 2

10 fun_aux = @(A,x) − min(A*x);11 fun_sol = @(x) fun_aux(count_states,x);12 [x,fval] = fmincon(fun_sol,rand(2^M,1),−count_states,...13 zeros(num_obs,1),[],[],−100*ones(2^M,1),100*ones(2^M,1),[],opts);


14 end15 Ax = count_states*x;16 if i == 1 & min(Ax) < 017 disp('The algorithm did not find a feasible starting point, switching...')18 [x,Ax] = prob_sol(count_states,M,2);19 end20 end

B.6 SIZE OF THE SET OF SOLUTIONS

1 function [ size ] = size_solutions( count_states,M,num_simul )2 simul = 2*rand(2^M,num_simul)−1;3 simul = simul./sqrt(ones(2^M,1)*sum(simul.^2));4 size = sum(min(count_states*simul)>0)/num_simul;5 end

B.7 DISTANCE TO RATIONALIZABILITY

1 function [ distance ] = distance_rat( data, verdict )2 [num_obs,M] = size(data);3 count_states = generate_matrix_A(data, verdict);4 rationalizable = 0;5 i = num_obs;6 while rationalizable == 07 posibles = nchoosek(1:num_obs,i);8 [num_tries,~] = size(posibles);9 num_try = 1;

10 while rationalizable == 0 && num_try <= num_tries11 probar = count_states(posibles(num_try,:),:);12 rationalizable = check_rationalizability(probar);13 num_try = num_try + 1;14 end15 i = i − 1;16 end17 distance = num_obs − (i+1);18 end

B.8 RECOVERING PARAMETERS GIVEN RATIONALIZABILITY

1 function [ result_prob ] = create_prob( x, count_states, num_obs )2 inform = x;3 G = (inform > 0);4 beta = abs(inform)/sum(abs(inform));5 mat_info = abs(count_states);6 beta_guilt = mat_info.*(ones(num_obs,1)*G')*beta;7 beta_e = mat_info*beta;8 result_prob = beta_guilt./beta_e;9 end

A SURVEY ON THE WEAK EVIDENCE OF

NON-FUNDAMENTALNESS IN ECONOMIC

APPLICATIONS*

EDUARDO LAGUNA-MÜGGENBURG** IGNACIO LOBATO***

Time series analysis with economic data has unduly relied on the use of causaland invertible models, especially when it comes to dealing with autoregressive mo-ving average (ARMA) models. Procedures that deal with non-fundamental modelshave slowly developed under the claim that causality and invertibility should notbe taken for granted. It is believed that the conclusions of a model may be highly de-pendent on these assumptions. Empirical evidence of these non-standard featuresis not vast, casting doubt upon their economic importance or on the usefulness ofnew identification procedures. In this survey, we present some representative exam-ples of non-fundamentalness found in economic literature but argue that these donot make a compelling case to affirm with certainty that non-fundamentalness isempirically important. Although there are some hints that these features are in factrelevant, empirics need to be performed more carefully to dissipate any doubt.

JEL CODES: C10, C22, C32, C53KEY WORDS: Time Series, Non-Fundamentalness, Empirical Survey

Standard time series analysis with economic data has excessively relied onthe use of causal and invertible (fundamental) models, in particular on auto-regressive moving average (ARMA) models. The traditional justification hasbeen both theoretical and practical. Causal and invertible ARMA models areidentified from second order properties of the data, so typical estimation pro-cedures can be employed. Innovations in these models are given the standardinterpretation of forecasting errors. In recent years this standard frameworkhas been questioned in at least two scenarios. First, in economic models withrational expectations where the economic agents have larger information setsthan the ones available to the econometricians, see Hansen and Sargent (1980)or Leeper et al. (2013). Second, when the economic agents do not possess ho-mogenous information, see for instance, Kasa et al. (2006) or Rondina (2007).For a recent survey see Alessi et. al (2011).

Despite the theoretical advancements, statistical analysis with non-fundamentalmodels (i.e. models that fail to be causal or invertible) has not developed ac-cordingly. The main problem associated with these models is that second orderproperties of time series cannot be used to identify the parameters of interest.Then, procedures that rely on the second order properties of the series, that is,autocorrelations or spectral density, are of no direct help. In particular, since forGaussian series the parameters of non-invertible/non-causal models cannot be

*We thank financial support from CONACYT, reference number 151264, and from AsociaciónMexicana de Cultura

**ITAM/Stanford University. Email: [email protected]***ITAM. Email: [email protected]

[email protected]

[email protected]

42 NON-FUNDAMENTAL TIME SERIES: SURVEY

identified, the statistical procedures associated to the Gaussianity assumptionhave to be put aside or have to be modified.

The statistical literature has developed two main approaches: either assu-ming a non-Gaussian specification for the innovation distribution or using theinformation contained in higher order moments. Andrews et al. (2007a) andBreidt et al. (2001) follow the first approach and assume a Laplace distribution.Similarly, an approximate maximum likelihood procedure for non Gaussianprocesses is examined by Lii and Rosenblatt (1992). In the second approach,the same authors proposed the use of higher order spectral tools (Lii and Ro-senblatt, 1982). Lobato and Velasco (2013) consider the efficient combination ofthe information contained in higher order moments also in the frequency do-main. In the time domain, Ramsey and Montenegro (1992) propose the use oftools based on higher order moments. For the invertible and for the noniverti-ble case, Ghysels et al. (2003) and Gospodinov and Ng (2013), respectively, usethe simulated method of moments.

This article raises the question on whether there is enough empirical evi-dence of these non-standard features. It is not uncommon that authors presentempirical applications along with their proposed methods for dealing withnon-invertibility or non-causality. However, it is not always obvious that thechosen specification provides the best model for the data. Economic theory ar-guments can make a compelling case for employing models detached from thestandard. Nevertheless, the absence of serious empirical applications suggeststhat there might be some bias influencing the authors to reinforce their modelswith forced empirical examples. The applications are not restricted to a certainsubject but rather cover a wide variety of economic data, e.g. macroeconomicsand finance.

The article is structured as follows. Section 2 provides basic time series de-finitions including some remarks on the multivariate case (Alessi et al., 2011);Section 3 contains the critical survey on current applications described in theliterature and final remarks are placed in Section 4.

1. DEFINITIONS

1.1 UNIVARIATE SERIES

The following standard definitions are intended to serve as a theoreticalbenchmark of non-causal and non-invertible models (Brockwell and Davis,1991).

Definition 1. An stochastic process Xt is said to be causal or a causal function ofεt, the innovations, if the former can be written as a function of all past values of thelatter.

Definition 2. If the innovations are the ones that can be written in terms of the pastvalues of Xt, then the process is said to be invertible.

LAGUNA-MÜGGENBURG & LOBATO 43

In the context of linear models, an ARMA(p,q) process defined by φ(L)Xt =θ(L)εt

1 will be causal if there exists a sequence of constants ψj such that:

∞∑j=0

| ψj |<∞ (1.1)

Xt =

∞∑j=0

ψjεt−j t = 0,±1, ... (1.2)

The process will be called invertible if there exists a sequence of constantsπj such that:

∞∑j=0

| πj |<∞ (1.3)

εt =

∞∑j=0

πjXt−j t = 0,±1, ... (1.4)

If the polynomials φ(·) and θ(·) have no common zeros, then Xt is causalif and only if all the roots of φ(z) are outside the unit circle and εt is invertibleif and only if all the roots of θ(z) are outside the unit circle. Even though apriori nothing precludes a process to be non-causal or non-invertible, whendealing with AR processes it is common practice to assume causality to finda covariance stationary representation. However, this assumption is usuallymade in an arbitrary way.

Deciding which is the correct specification is not always straightforward,for instance neither the first nor second moments can be used to discrimina-te between invertible and noninvertible representations of a moving average(MA(q)) process. This can be fully appreciated by setting q=1. The MA(1) de-fined by yt = µ + θut−1 + ut where the innovations ut are white noise (ut ∼WN(0, σ2)) has the same mean and autocovariances as y∗t = µ+ θ−1u∗t−1 + u∗twhere u∗t ∼WN(0, θ2σ2). If the white noises are also Gaussian then there is anidentification problem usually solved by imposing | θ |< 1. For non-Gaussianprocesses higher moments can be observed but often the same restriction ismade on the coefficient, imposing invertibility. In general, for a MA(q) thereare 2q−r alternative representations2 associated with the roots and their inver-ses. Only one of these has all of them outside the unit circle. The innovationsassociated with this representation are called fundamental innovations for Yt (Ha-milton, 1994).

Before proceeding to the multivariate case, we consider it is important topresent another concept that will be useful to discuss some of the applicationsdescribed below. The concept corresponds to a linear model that can reflect

1Where φ(L) and θ(L) represent polynomials of the lag function.2The number of models comes from the possible q roots of the MA model minus half the num-

ber of complex roots.


characteristics of non-linear processes, the all-pass model. An all-pass modelis a weak white noise ARMA model in which all roots of the autoregressivepolynomial are the reciprocals of the non-invertible roots of the moving avera-ge polynomial. They generate non-independent white noise series in the non-Gaussian case. For example an all-pass model of first order — AP(1)— is a veryparticular causal non-invertible ARMA(1,1):

yt − φyt−1 = µ+ ut − φ−1ut−1 | φ |< 1 (1.5)

For a possibly non-invertible polynomial θ(L) the process defined by yt =θ(L)et with et ∼ iid(0, σ2), can be written as yt = θ∗(L)εt where θ∗(L) is in-vertible and is obtained by inverting the roots of θ(L) that are inside the unitcircle. Thus εt = θ(L)θ∗(L)−1et follows an all-pass model, weak white noise.Following the approach of Andrews et al. (2007b) and Breidt et al. (2001) a non-invertible model can be estimated by first estimating the invertible model andthen extracting the white noise residuals to fit an all-pass model using Maxi-mum Likelihood, Least Absolute Deviation or some other method. One shouldbear in mind that this “heuristic” approach does not address global identifica-tion and the two-step procedure implies that roots are estimated twice, whichis inefficient. Davis and co-authors do not justify the use of residuals in thesecond step.

In the past it was usual to rule out non-funamentalness without much con-sideration for reasons of convenience. Even more, economic theory does notalways provide enough information to discriminate between the 2q−r repre-sentations a process can have. However, this does not imply that eliciting cau-sality or invertibility should be regarded as irrelevant. For instance, Lanne andSaikkonen (2013) explain that the usual method to test hypothesis in the in-terest rate model with rational expectations is actually quite dependent on theassumption of fundamentalness. This means that unless there is strong eviden-ce of causality, the results derived from this model are not robust.

1.2 MULTIVARIATE SERIES

The multivariate case is substantially more complicated and thus it is notyet fully developed and as we will see later there is not a unified approach tostudy these models. With that in mind, we present here the characterization ofnon-fundamental models put forward in Alessi et al. (2011) for a n-dimensionalcovariance-stationary zero-mean process Xt of observable variables and an m-dimensional unobservable vector process Zt. n ≥ m is a necessary conditionfor Zt ∈ spanXt−k, k ≥ 0, that is for Zt to be fundamental with respect toXt, because if n < m the number of shocks to be recovered exceeds the amountof series observed. Alessi et al. (2011) focus on square systems (n = m) andprovide the following characterization:

Proposition 1. Given a covariance-stationary vector Xt, the representation Xt =C(L)Zt is fundamental if:


1. Zt is a white noise vector.

2. C(L) has no poles, singularities in a complex function, of modulus less or equalthan unity, i.e. it has no poles inside the unit circle.

3. detC(y) has all roots outside the unit circle, i.e. detC(y) 6= 0 ∀y ∈ C s.t. | y |<1.

The inverse representation of Xt = C(L)Zt is D(L)Xt = Zt. If D(L) is aone-sided polynomial, then Zt is fundamental and can be approximated by afinite vector autoregression (VAR) (Alessi et al., 2011).

2. EMPIRICAL APPLICATIONS IN THE LITERATURE

2.1 NON-INVERTIBLE EXAMPLES

UNEMPLOYMENT

For the monthly time series of unemployment in the United States3 — fromJanuary 1948 to October 1997 — Huang and Pawitan (2000) apply the followingmethod: First they estimate the invertible representation of the process andthen search among 2q−r-equivalent models the one that minimizes a weightedsum of the absolute value of errors.

They propose a mixed seasonal model ARIMA(0, 1, 5)×(0, 0, 2)12 after loo-king at the ACF. The standard invertible estimates, using Gaussian likelihood,are:

For the regular component: θ = (0,0163,−0,1844,−0,1329,−0,1235,−0,1834)

For the seasonal component: Θ = (0,2578, 0,2265)

Innovation variance estimate: σ2 = 0,0417.

The standard errors of the estimates are all around 0.04. It should be notedthat for this example there are 32 second-order equivalent models, since thereare two pairs of complex roots for the regular component. The authors considerthe heavy tailed nature of the residuals to be indication of non-invertibilty andusing their modified loss function, the winning model Huang and Pawitanselect is one with pure invertible non-seasonal roots and pure non-invertibleseasonal roots. The estimates are:

For the regular component: θ = (0,0163,−0,1844,−0,1329,−0,1235,−0,1834)

For the seasonal component: Θ = (−1,1832, 4,4150)

Innovation variance estimate:σ = 0,0342.

3The information is public and free on the Federal Reserve Bank of St. Louis database:http://res earch.stlouisfed.org/fred2/data/UNRATE.txt.


The confidence in the final model was measured by a bootstrap evaluation.Huang and Pawitan generated 500 times series using the estimates above. Foreach series, the innovation process is a bootstrap sample from the final residualseries. After having reconstructed the series from these innovations they fittedan ARIMA(0, 1, 5)×(0, 0, 2)12 model for each series and chose the best one fromthe 25 possible models. The authors state they did not recompute the invertibleroots and deemed it acceptable since the standard errors where very small.

Huang and Pawitan state that 90 % of the 500 bootstrapped series produ-ced the same root configuration as the final model above. Nevertheless thisis no surprise given that the residuals used for the evaluation came from theestimated model they had proposed. For instance, consider a simple MA(1)model xt = εt + θεt−1 — just as the one used in Section 2 to show the pro-blem of identification. The value of θ determines if the process is invertible ornot; the estimated residuals clearly depend on the estimated parameter. Theseries for the residuals of θ∗ is different from the series of residuals when 1/θ∗

is used. This produces a bias towards the originally estimated configuration,diminishing the reliability of this procedure to measure the confidence of theestimation.

COMMODITY PRICES

Gospodinov and Ng (2013) present a minimum distance estimation met-hod for moving average models that does not impose invertibility to identifythe parameters. Using two second order and three third order cumulants theyidentify the parameters of a MA(1) model whose roots can be either outside,inside or in the unit circle. They also propose a simulation-based minimumdistance (SMD) estimator for more general moving average models.

They exemplify the use of their SMD estimator with the series for dailycommodity prices of the nearest futures contract for 22 commodities4 fromMarch 1983 to July 2008.5 They estimate an ARMA(1,1) model for the pricechange of each of the series. It is important to notice that, since price changesare practically white noise, a model with only one parameter would not pro-vide a significant result. An ARMA(1,1) allows for multiple solutions since thelack of identifiability generates canceling roots in the AR and MA components.The authors are aware of this problem and explain that this arises preciselyfrom imposing invertibility. If the true process is uncorrelated (i.e. it is whitenoise) but presents higher order dependence, then an all-pass non-invertiblemodel can be estimated to sidestep the canceling roots problem and obtainidentification. Of course, estimating such model imposes restrictions on theparameters that must be taken into account.

Comparing the results from their SMD method to the Gaussian MaximumLikelihood procedure they conclude that a non-invertible MA component ap-pears to be driving the price change process. They take the fact that the esti-mated parameters are of similar magnitude and sign across commodities as an

4From 6 different commodity groups.5The data was obtained from the Commodity Research Bureau.


CUADRO 1: SMD AND GAUSSIAN ML ESTIMATES OF ARMA(1,1) FROM GOSPODINOVAND NG (2013)

sample moments Gaussian ML SMDCommodities skewness kurtosis α θ α θ

Heating oil -0.207 7.394 0.653 -0.742 0.501 -1.531(0.239) (0.217) (0.096) (0.076)

Soybean oil 0.018 5.440 -0.893 0.814 0.635 -1.595(0.106) (0.124) (0.065) (0.108)

indication of correct identification (as an example, the estimation for heatingoil and soybean oil are provided in Table 1). Certainly the pattern exhibited bythe estimated coefficients seems to rule out multiple solutions as in the inverti-ble case. Nevertheless the model appears to be a first order all-pass and a wordof caution should be put forward. First, if the model is indeed an all-pass thenthe SMD should consider the restriction that the MA root to be the reciprocalof the AR one. Second, it is unreasonable to preform a Gaussian ML estimationsince, precisely by the presence of canceling roots, we know the latter modelcould be misspecified. It seems unfair to compare the estimations the authorspropose with the ones of an incorrectly specified model. A proper comparisoncould be with a Conditional Heteroskedastic Autorregresive model or somenon-linear model. This should be done before exploring the implications thatnon-invertibility of price changes has for impulse response analysis and fore-casting.

CAPITAL

Lobato and Velasco (2013) propose a procedure that achieves identificationon the location of the roots of the ARMA polynomials for non-Gaussian timeseries using higher order moments with frequency domain techniques. Thisprocedure eliminates the need to conduct non-invertibility and non-causalitytests beforehand and results in a consistent and asymptotically normal estima-tor. Although this particular application has not been incorporated in the finalversion of that paper it is the work of one of the authors and we present it herefor completeness.

Following Leeper et al. (2013), we believe fiscal foresight provides a plausi-ble story on why capital movement should have at least some forward lookingcomponents. The idea that agents can anticipate at least some shocks is not li-mited to fiscal situations; in fact the presence of government is not needed forthis behavior to appear. In a simple neoclassical model, anticipated positiveproductivity shocks generate an increase in the capital stock before the shockmaterializes. In the end, it does not really matter which kind of shock is moreplausibly foreseen (fiscal, financial, terms of trade; since in the simplest modelTotal Factor Productivity absorbs them all) to conjecture that capital dynamicsshould present a non-causal behavior. We focus on capital and not other varia-


bles such as consumption because capital is the state variable in the dynamicprograming problem. Additionally capital is the variable

We used annual data for the U.S. capital stock from 1948 to 2010 constructedby Kehoe and Meza (2011). After taking a look at the time series, and since themodel considers an economy in stationary state, we filtered the capital withan AR(2) model. Afterwards, we estimated a MA(1) using the methodology ofLobato and Velasco (2013). For the U.S. capital, the estimate of the coefficientis θ=2.52 which corresponds to a non-invertible moving average model. Weselected these lags to have a parsimonious model but this is by no means asettled question and is only presented to signal posible areas of future research;we do not want the somewhat negative tone of the survey to discourage furtherwork.

2.2 NON-CAUSAL EXAMPLES

MICROSOFT STOCK

Breidt et al. (2001) use a two-step procedure with all-pass models to fit non-causal models. The steps are: 1) estimate a sth order AR model and obtain theresiduals; 2) select the order r of the model and fit a purely non causal rth orderall-pass model. The appropriate causal AR(q) polynomial can be identified bycanceling the roots in the causal AR(s) polynomial which correspond to theinverses of the roots in the purely non causal AR(r) polynomial. They apply thismethod to the volumes of Microsoft (MSFT) stock traded over 754 transactiondays from June 3rd, 1996 to May 27th, 1999.6

Since they want to focus on estimation and not on the order selection pro-blem an AR(1) is fitted with Gaussian ML, which yields:

φ(c)nc = 0,5834 (0,0296) (2.1)

The resulting residuals do not show much evidence of correlation, but thesquare and absolute value of the residuals does have significant lag one autoco-rrelations.7 They conclude that a AR(1) model with i.i.d noise is not appropriatefor the data. This is in the same spirit as the procedure proposed by Chen et al.(2012) where independence of innovations is taken as characteristic of a funda-mental process. Fitting a purely non-causal all-pass model to the residuals, thenew estimate:

φnc = 1,7522 (0,0989) (2.2)

This time there are no signs of autocorrelation. However inference derivedform this two-step procedure is questionable, since they estimate the same pa-rameter twice. In addition, the series used in the second step come from theresiduals of the first one and standard errors should take into account the fact.The properties of the procedure proposed in this paper are unknown.

6This information is public and can be easily obtained; for example athttp://finance.yahoo.com/ q/hp?s=MSFT.

7The asymptotic p-value is less than 0.001.


Breidt et al. (2001) also fitted trading volumes for two small companies inthe same sector and found that causal AR models described adequately the dy-namics. They explain it is possible that, because of its size, Microsoft’s actionsare widely anticipated by the market, so the effect of shocks precedes their arri-val and thus a non-causal model seems appropriate.

WAL-MART STOCK

For the volumes of Wal-Mart stock traded daily on the New York StockExchange from December 1, 2003 to December 31, 2004 (Xt), Andrews et al.(2007a) propose an AR(2) process with non-Gaussian noise (Zt). The ML esti-mation leads to the following non causal AR(2) model:

(1− 0,7380L)(1 + 2,8146L)Xt = (1 + 2,0766L− 2,0772L2)Xt = Zt (2.3)

With one root inside and one outside the unit circle. Analyzing the residualsthey conclude that this model satisfactorily fits the series in contrast with thecausal AR(2) — which they estimate as well. Both the absolute values and squa-res of the residuals of the causal model present large one-lag correlations andhence one can not conclude they are independent.

The validation method seems arbitrary, a bootstrap procedure is used to ar-gue in favor of the non-causal model. The causal model is rejected by lookingat the ACF of the absolute values and squares of the residuals, since large one-lag correlations imply that these series are not i.i.d.. Also arbitrary is the timerange of the studied series; it is not clear why the series are so short. More em-pirical evidence is needed. This and the previous example hint an interestingbehavior on stock volume for large firms, but more robust analyses should beconducted. It would be desirable that an ARMA model, instead of just an ARmodel, would be fitted to this data and the one in the previous example to ha-ve a more general comparison between causal and non-causal models. It couldbe the case that the AR parameter is capturing some of the moving averagebehavior and thus appearing to be non-causal when it is not.

2.3 MULTIVARIATE NON-FUNDAMENTAL EXAMPLES

Lanne and Saikkonen (2013) consider that given the wide use of VARs ineconomics and finance, and the fact that the conclusions of some models de-pend on the assumption of causality, non causal VARs should be used to vali-date statistical analysis based on causal models. Thus, they generalize a modelpreviously developed by themselves for the univariate case (Lanne and Saik-konen, 2013). Their model is composed by two matrix polynomials of the back-ward shift operator, Π(B) and Φ(B−1), that relate an n-dimensional stochasticprocess yt and a sequence of i.i.d. random vectors εt.

Π(B)Φ(B−1)yt = εt (2.4)


Both matrix polynomials have their zeros outside the unit circle. If at leastone coefficient of the polynomial Φ is different from zero, the process is noncausal; Lanne and Saikkonen (2013) call it purely non causal if all the coeffi-cients of Π are zero. Note that the conventional causal model is obtained whenall the coefficients from Φ are zero. We do not wish to proceed further with thismodel since it would take us far from scope of the survey and we refer anyinterested reader to the original paper. However, we believe the most generalaspects of the model were worth describing to clarify some of the unconven-tional notation used by these authors in their empirical example. They definea VAR(r,s) to be a process with a Π polynomial of degree r and a Φ polynomialof degree s.

INTEREST RATE

To exemplify their model, Lanne and Saikkonen (2013) study the expecta-tions hypothesis of the term structure of interest rates. This hypothesis esta-blishes that long term interest rate is a weighted sum of present and expectedfuture short-term rates and imposes testable restrictions on the VAR’s parame-ters. The dynamics captured by a causal VAR indicate the expected changes inthe short rate drive the term structure. However, Lanne and Saikkonen (2013)believe that the methods used in economics to prove the validity of the dependhighly on the assumption of causality. They use data from the monthly yieldsof U.S. zero-coupon bonds from January 1967 to April 1996.8 The demeanedchange in the six-month interest rate is represented by ∆rt and the spread bet-ween the five-year and the six-month interest rates by St.

Lanne and Saikkonen (2013) specify a non causal VAR and apply the follo-wing steps: First, they fit a conventional causal VAR model by LS or Gauss MLand determine its order. Then, they check its residuals for Gaussianity, if de-viations are detected then it makes sense to proceed to non causal models. Thisis because it is impossible to distinguish between causality and noncausalityunder Gaussianity. All causal and non causal models of the selected order areestimated and the one that maximizes the log-likelihood function is selected.Finally, error autocorrelation and conditional heteroskedasticity are checked.

The AIC and BIC select Gaussian VAR(3) and VAR(2) models, only the firstone has serially uncorrelated errors. However, it presents conditional heteros-kedasticity and considerable deviations form normality. Lanne and Saikkonencheck for Gaussianity with Q-Q plots and the Shapiro-Wilk test. A multivariatet-distribution is selected; the estimate of the degrees-of-freedom parameter is4.085 (1.210). An alternative for normality testing in time series can be foundin Lobato and Velasco (2004); where the possibility of serial dependence is ad-dressed.

All four third-order models are estimated, and a VAR(2,1)-t model maximi-zes the log-likelihood. It is also the only one with no autocorrelation or heteros-kedasticity. The estimated coefficients seem to have an interpretation contrary

8It is available in Duffee’s website: http://www.econ2.jhu.edu/people/Duffee/affine.htm.


to the expectations hypothesis. An expected increase in the short-term rate hasno effect on the spread and an expected future increase on the spread tends todecrease the short-term rate and increase the spread.

Using the same data, Davis and Song (2012) study non causality with anapproach that generalizes the univariate definitions discussed earlier. They ac-tually point out a weakness in Lanne and Saikkonen’s model. They note thatthe polynomial multiplication may not be commutative since all the coeffi-cients are matrices. This means that citing the same data to Π(B)Φ(B−1)yt = εtand Φ∗(B−1)Π∗(B)yt = εt may produce completely different coefficient esti-mates. Not only that, according to Davis and Song (2012) a conventional VAR,Φ(B)Xt = Zt, cannot always be represented by the model in equation 4.4. Theresults for the log-likelihood function values of the best fit using a VAR of order3 are replicated in Table 2. The estimation for the degrees of freedom is 2.806(0.715), indicating a heavy tailed distribution.

CUADRO 2: LOG-LIKELIHOOD FOR EACH VAR(3) MODEL

Causal Gaussian Causal Non-Gaussian Purely NC Mixed-256.624 -237.345 -235.481 -229.054

Davis and Song (2012) explain one should be very cautious with their esti-mators because there is a violation of some of the regularity conditions. The ARpolynomial det(IΦ1z−Φ2z

2−Φ3z3) has only one root inside the unit circle with

the other five outside. Thus, they argue, it is impossible to have a representa-tion like Lanne and Saikkonen’s model, that is, to separate the polynomial intoa causal component and a purely non causal component. Nevertheless, theyconclude the non-causal model is more appropriate than the causal one afteranalyzing the residuals’s ACF.

FISCAL FORESIGHT

Davis and Song (2012) fit a trivariate VAR model for the demeaned differen-ces of U.S. GDP, total government expenditure and total government revenue(all in real per capita terms). It is quarterly data from 1955 to 2000 previouslyused by Mountford and Uhlig. (2009). Davis and Song consider an earlier ver-sion of Lanne and Saikkonen’s paper in which this data was used and a VAR(2)was suggested based on the the fact that AIC and BIC selected VAR(3) andVAR(2), respectively. It is important to notice that the choice between Akaike’sand Shwarz’s criteria can lead to different conclusions. Escanciano and Lobato(2009) argue that the adequacy of using AIC or BIC depends critically on theorder of serial autocorrelation; a less arbitrary method would be to let the dataautomatically select the criterion. Some simple and robust automatic tests forthe correct specification of the lag order of a vector autoregression are presen-ted by Escanciano et al. (2013).

Davis and Song (2012) compare the log-likelihood function values of thebest fit of causal Gaussian VAR(2), causal non-Gaussian VAR(2), purely non


causal VAR(2) and the mixed VAR(2). These can be found in Table 3. Onceagain the fat-tailed distribution is indicated by the estimate for the degreesof freedom, λ = 5,751 (1.399). The estimates for the AR polynomial det(I −Φ1z−Φ2z

2) indicate that it has three roots inside the unit circle and three rootsoutside. Davis and Song (2012) note that for the multivariate case there can bemultiple ways to decompose I−Φ1z−Φ2z

2 = (I−Ψ1z)(I−Ψ2z). Observing theACF’s of the residual and of the squares of the residuals, Davis and Song (2012)conclude that the non causal model does a better job, since its square residualsdo not show serial correlation whereas the ones from the causal do. This meansthat multiple representations can correspond to the same VAR model.

CUADRO 3: LOG-LIKELIHOOD FOR EACH VAR(2) MODEL

Causal Gaussian Causal Non-Gaussian Purely NC Mixed-819.230 -802.865 -800.355 -791.270

Davis and Song (2012) provide very good insights for the development ofnon-fundamental processes in a multivariate context; especially the multiplerepresentation problem that arises at the decomposition of the matrix model.The empirical evidence they find in favor of non-causality might seem overw-helming but one should consider that their analysis is centered on linear mo-dels with i.i.d. innovations so all the dependence is linear. Perhaps the evidencein favor of non-causality is just a reflection of the inadequacy of the linear as-sumption. Non-causal models can present certain characteristics of non-linearmodels; for example, the squares of the series exhibit positive autocorrelation.Of great concern is the fact that testing for non-causality and choosing a modelamong the many alternatives turns out to be very sensible to the specification.For instance, the use of maximum quasi-likelihood might provide more robustestimates than gaussian maximum likelihood.

3. FINAL REMARKS

The results displayed in the previous section should be taken with somedegree of skepticism since the statistical properties of the employed procedu-res are not clearly stated, and sometimes they are even unknown. Evidence ofthe prevalence of these nonstandard features is far from being accumulatingin real economic data. In particular, the main problem with many of the pro-cedures employed in the previous section is the two-step approach, where theestimation error of the first stage is not taken into account in the second sta-ge. This renders inefficient estimation, and dubious statistical inference sincestandard errors are not properly estimated. For example, in Microsoft’s stockanalysis the autorregresive parameter is estimated in a first stage and its inver-se is independently estimated in a second step. This illustrates that more effortis needed in providing rigorous statistical procedures that perform inferencein just one step. These have the additional advantage of avoiding the need of


invertibility or causality tests, such as the one in Chen, Choi and Escanciano(2012).

In this article we have found that the empirical evidence regarding non-fundamen-talness in economics is rather weak. Nonetheless, the criticism ma-de over many of the empirical examples does not invalidate the theory presen-ted in the papers. Recent years have witnessed an immense development ofthe subject in economic as well as econometric theory. As econometric theoryfills the void exposed by economic theory with multiple new procedures, theempirical side seems to be lagging behind. A more thorough search for validapplications is imperative to preclude the developments in this area of econo-metrics to lose momentum.

REFERENCIAS

Alessi, L., Barigozzi, M., and Capasso, M. (2011). Non-fundamentalness instructural econometric models: A review. International Statistical Review,79:16–47.

Andrews, B., Calder, M., and Davis, R. (2007a). Maximum likelihood estima-tion for α-stable autoregressive processes. Annals of Statistics, 37:1946-1982.

Andrews, B., Davis, R. A., and Breidt, F. J. (2007b). Rank-based estimation forall-pass time series models. Annals of Statistics, 35(2):884–869.

Breidt, F. J., Davis, R. A., and Trinidade, A. A. (2001). Least absolute deviationestimation for all-pass time series models. Annals of Statistics, 29(4):919–946.

Brockwell, P. J. and Davis, R. A. (1991). Time Series: Theory and Methods. SpringerVerlag, New York.

Chen, B., Choi, J., and Escanciano, J. (2012). Testing for fundamental vectormoving average representations. Working Paper, Indiana University.

Davis, R. A. and Song, L. (2012). Noncausal vector AR processes with applica-tion to economic time series. Working Paper, Columbia University.

Escanciano, J. and Lobato, I. (2009). An automatic portmanteau test for serialcorrelation. Journal of Econometrics, 151(2):140–149.

Escanciano, J., Lobato, I. N., and Zhu, L. (2013). Automatic specification testingfor vector autoregressions and multivariate nonlinear time series models.Journal of Business and Economic Statistics, 31(4):426–437.

Ghysels, E., Khalaf, L., and Vodounou, C. (2003). Simulation based inferencein moving average models. Annales d’Economie et de Statistique, 69:85–99.

Gospodinov, N. and Ng, S. (2013). Minimum distance estimation of possiblynon-invertible moving average models. Unpublished.

Hamilton, J. D. (1994). Time Series Analysis. Princeton University, USA.


Huang, J. and Pawitan, Y. (2000). Quasi-likelihood estimation of non-invertiblemoving average processes. Scandinavian Journal of Statistics, 27:689–702.

Kasa, K., Walker, T., and Whiteman, C. (2006). Asset pricing with higher-orderbeliefs. Working Paper, Indiana University.

Kehoe, T. J. and Meza, F. (2011). Catch-up growth followed by stagnation:Mexico 1950-2010. Latin American Journal of Economics, 48:227–268.

Lanne, M. and Saikkonen, P. (2013). Noncausal vector autoregression. Econo-metric Theory, 29(3):447–481.

Leeper, E. M., Walker, T. B., and Yang, S.-C. S. (2013). Fiscal foresight and infor-mation flows. Econometrica, 81(3):1115–1145.

Lii, K. and Rosenblatt, M. (1982). Deconvolution and estimation of transferfunction phase and coefficients for non-gaussian linear processes. Annals ofStatistics, 10(4):1195–1208.

Lii, K. and Rosenblatt, M. (1992). An approximate maximum likelihood es-timation for non-gaussian non-minimum phase moving average processes.Journal of Multivariate Analysis, 43(2):272–299.

Lobato, I. N. and Velasco, C. (2004). A simple test of normality for time series.Econometric Theory, 20(04):671–689.

Lobato, I. N. and Velasco, C. (2013). Consistent and efficient estimation of lineartime series models. Working Paper, Universidad Carlos III de Madrid.

Mountford, A. and Uhlig., H. (2009). What are the effects of fiscal policyshocks? Journal of Applied Econometrics, 24:960–992.

Ramsey, J. and Montenegro, A. (1992). Identification and estimation of non-invertible non-gaussian MA(q) processes. Journal of Econometrics, 54:301–320.

Rondina, G. (2007). Incomplete information and informative pricing: Theoryand application. Working Paper, University of Wisconsin, Madison.

THE EFFECTS OF THE ACCESS TO TRANSPORTATION NETWORKSON ECONOMIC PERFORMANCE. EVIDENCE FROM MEXICAN

MUNICIPALITIES.

JOSÉ MARÍA RODRÍGUEZ VALADEZ

What is the effect of access to transportation networks on economic performance?This work addresses this relation by studying how access, measured as the short-est distance to the historical transportation network, has affected long run economicperformance across Mexican municipalities. The effects of public infrastructure havebeen studied mainly using a production function approach.This study directs atten-tion to the effect of having access to transportation infrastructure in a different sense.Using a simple model of trade and factor mobility as the theoretical framework, therelation between access and economic performance is explained by how proximityto transportation networks affects differently trade costs of connected and not con-nected regions, ultimately affecting gains from trade. The hypothesis is that distanceto the network is negatively related with economic performance. In order to test thishypothesis, an access measure was constructed. This measure consists on the closestdistance from each Mexican municipality to the network. There is a clear endoge-nous bias treat caused by reverse causality between economic performance and theplacement of transportation infrastructure.In order to avoid this problem and fol-lowing Banerjee et al. (2012), the access measure is constructed taking as referencethe Mexican historical transportation network. The results show that both elasticityof total gross production per capita to distance to the network and elasticity of thenumber of economic units to distance to the network are negative and significant.That is, municipalities that are closer to the transportation network display a bettereconomic performance than those located in further places.

KEY WORDS: Publicly provided goods, regional development, economic inte-gration

1. INTRODUCTION

What is the effect of access to transportation networks on economic perfor-mance? This work addresses this relation by studying how access, measuredas the shortest distance to the historical transportation network, has affectedlong run economic performance across Mexican municipalities.

The effects of public capital on economic outcomes has been studied in eco-nomic literature from different approaches. Most of them have considered pub-lic capital as a factor affecting directly an aggregate production function. Ac-cording to this approach, public capital is considered as a direct input in thisaggregate production function or as a factor affecting aggregate productivity.(Aschauer, 1989; Costa et al., 2006; Holtz-Eakin and Schwartz, 1995, amongothers) The results from these studies have been heterogeneous, finding anelasticity of aggregate product to public capital ranging from highly positivelevels to almost null effect. Both econometric techniques used and location andtime samples taken across studies seem to affect the estimates of this elasticity.

This work assesses the relationship of one specific type of public capital,transportation networks, in a different sense. This study directs attention to theeffect of having access to the transportation network on economic performance.

56 THE EFFECTS OF THE ACCESS TO TRANSPORTATION NETWORKS

Using a simple model of trade and factor mobility as the theoretical frameworkbehind the econometric estimations, the relation between access and economicperformance is explained by how proximity to transportation networks affectsdifferently trade costs of connected and not connected regions, ultimately af-fecting the gains from trade. This fact is explained in the sense that distance tothe transportation network is added to the cost of the tradeable good producedin each region.

The hypothesis that motivates this work is that distance to transportationnetwork is negatively related with economic performance. In order to test thishypothesis, an access measure needed to be constructed. This measure consistson the closest distance from each Mexican municipality to the transportationnetwork. There is a clear endogenous bias threat caused by reverse causalityin the relation between economic performance and the decision of placementof transportation infrastructure. In order to avoid this problem and followingBanerjee et al. (2012), the access measure is constructed, taking as reference theMexican historical transportation network.

This empirical strategy represents a series of advantages. First, the shapeof the historical transportation routes corresponds to the routes shaped by thefirst modern transportation infrastructure in Mexico, which are the railroadsconstructed in the early twentieth century. The shape of the main contempo-rary transportation network, say, the highway system, highly resembles thatfrom the routes shaped by railroads. This fact allows to overcome the endoge-nous bias between placement of transportation network and economic perfor-mance, because the access measurement is defined independently from theeconomic performance of the regions. Second, since this transportation net-work was constructed more than a hundred years ago, this fact may allow totrack the long run effects of being a better connected region at the momentwhen the transportation network was constructed. Finally, this measure maycapture further development of transportation infrastructure, since railroadsare now far from being the main mode of transportation in Mexico, and havebeen substituted by further transportation infrastructure development, whichfollowed the paths drawn by this historical network.

The access measure is constructed following a simple algorithm. Taking asreference the railroads constructed before 1910, which represent the develop-ment of modern transportation infrastructure to Mexico, this access measureis computed as the shortest distance to any municipality where the railroadroutes passed through. To locate the municipalities that gives shape to a route,a straight line linking the connected cities is drawn, allowing to locate the mu-nicipalities within each route. After computing the distances to all routes, theminimum distance to any route is taken as the access measurement.

To test the hypothesis of the relation between access, measured as the short-est distance to the historical transportation network, and economic perfor-mance, an OLS estimation with state and year fixed effects is carried out. Theresults show that both elasticity of total gross production per capita to distanceto the network and elasticity of the number of economic units to distance to thenetwork are negative and statistically significant. That is, municipalities that

JOSÉ MARÍA RODRÍGUEZ VALADEZ 57

are closer to the transportation network display a better economic performancethan those located in further places.

The present work is organized as follows. In Section 2, a literature reviewof the study of the effects of public capital on economic activity is presented. Inthis section some different approaches found in economic literature to assessthe effects of public capital and transportation infrastructure on economic per-formance are discussed. In Section 3, a brief historical context of the birth ofmodern transportation infrastructure in Mexico is presented. In Section 4, thetheoretical framework is developed, which consists of a simple model of tradeand factor mobility. Section 5 presents the econometric specification used toestimate the elasticities of economic performance to the access measure. Thetwo economic performance measures analyzed are per capita total gross pro-duction and the number of economic units. Section 6 shows the results of theeconometric exercise and explains the estimated values for the correspondingelasticities. Section 7 offers some conclusions about the effects of transportationinfrastructure on economic performance.

2. LITERATURE REVIEW

The present work is related to the literature on the economic effects of publiccapital. We understand public capital as large capital intensive natural monopolies,such as highways, other transportation facilities, water and sewer lines, and commu-nication systems (Gramlich, 1994).

Generally, in this section the effects of public capital on economic perfor-mance will be reviewed. This relation has been understood mainly from aproduction function approach. This approach considers public capital a factoraffecting an aggregate production function in two ways, directly, as an input,or indirectly, through the effects that it may have on Total Factor Productivity.Some other body of study has understood the link between economic perfor-mance and transportation infrastructure different from the typical productionfunction approach. There are several linkages through which transportation in-frastructure does affect economic performance such as reduction of trade costsand changes in local markets demand for factors such as skilled labor, amongothers. These different approaches are discussed in this section.

2.1 INFRASTRUCTURE AND THE PRODUCTION FUNCTION APPROACH

The effects of public capital on economic performance have been studied fromdifferent approaches. A great amount of the work made on this topic has beendone from a Production Function Approach (Bom and Ligthart, 2008). Accordingto this approach, public capital plays an important role on the levels of privateaggregate output.

This linkage can be understood in two ways. First, public capital can bethought as an input of an aggregate production function. The idea behind thisapproach is that the contribution of public capital on aggregate private output


is proportional to the stock of public capital. The other way in which pub-lic capital can affect the production levels is as an input affecting multifactorproductivity. This approach is also known as a Growth Accounting Approach.

In this sense, Straub (2008) offers a survey of existing macro level literatureof the relationship between infrastructure and aggregate output. This worksets a general framework that helps to understand the linkages between in-frastructure, output levels and output growth. Discussion on the effect of in-frastructure and growth debate can be framed by starting with a general formof an aggregate production function:

Q = AF (K,L, I(KI)) (1)

where Q is the aggregate output level, A is the total factor productivity F (.) isthe production function which depends on private capitalK, aggregated hoursworked L, public capital KI and an intermediate goods function I(.). Straublists the reasons for introducing the infrastructure capital through an interme-diate product channel instead of a direct input. First, if we introduce KI asa direct input, we assume that services produced by this public-like good areproportional to the stock of infrastructure in a non-rival and non-excludableway. However, this is only partially true since infrastructure is increasinglymediated through market mechanisms and has characteristics of a standardprivate good. This is the reason why its effect could be understood through-out the production of specific services first. Straub argues that another reasonto include an intermediate good mediating infrastructure is that there existssome evidence that costs and prices of infrastructure services do not reflect thefundamentals of the activity, so these goods are not remunerated according toits marginal productivity. According to this, if the unit cost of infrastructure isnot market-determined, it raises the question of the validity of including it asa factor in the production function, as firms are not able to make decisions onthe cost or the amount of infrastructure capital they use. This concern had ledto work where infrastructure is considered as a factor affecting total factor pro-ductivity, because it could influence productivity by lowering costs or througheconomies of scale.1 An adequate framework to represent this relationshipwould be

Q = A(θ,KI)F (K,L, I(KI)) (2)

In this setup, public capital affects aggregate output in two senses. First, as afactor affecting intermediate inputs and second, affecting levels of aggregateproductivity A.

Overall, Straub recognizes that authors have come up with everything fromhighly positive output elasticity values to zero or even negative ones. Heargues that results are not always comparable due to differences in samplesand time periods under study, or because of the econometric techniques used.

1An example of a study using this framework is Cantos et al. (2005). In this study, authorsaddress the relation between public capital and Spanish regional output through the effects ofinfrastructure on TFP.


Nonetheless, this work represents a useful setting for analysing existing litera-ture concerning the topic.2

The first major work that opened a guideline to the study of the effects ofinfrastructure, was Aschauer (1989). Aschauer’s work assesses the relationshipbetween aggregate productivity and the stock and flow of government spend-ing. Using yearly US data about productivity and different categories of publiccapital stocks for years between 1949 and 1985, the author finds a positive andsignificant estimate of 24 percentage points for the productivity elasticity to theaggregate core infrastructure, which includes highways, mass transit, airports,electrical and gas facilities, water and sewers. Aschauer’s results show thatcore infrastructure has the most explanatory power for productivity. The mainconclusion of Aschauer’s article, and one of the reasons of why it gave directionfor future research about the effects of infrastructure on economic performance,is that it suggests that it should be attributed a significant weight to public in-vestment decisions, specifically to the stock of non-military structures such aspublic infrastructure, when assessing the role that the government plays in thecourse of economic performance and productivity improvement.

Aschauer work paved the way for a series of studies that attempted to un-derstand the effects of public infrastructure on aggregate productivity and out-put levels. Hereinafter, I shall examine the main results of some of the worksaddressing the contribution that infrastructure has on output and productivityfrom a production function approach. Eberts (1986) estimates the componentsof public capital stock for 38 metropolitan areas in the years 1958-1981. Thisstudy represents the first attempt to use time series to estimate the effect ofpublic capital stock on regional manufacturing production. In this study, pub-lic capital stock is introduced as an input in a trans-log production function.Eberts finds an elasticity of public capital, measured as water treatment, wa-ter supply, and highways and roads, of 15.5 percent. In another work, Nadiriand Mamuneas (1994) examine the effect of public infrastructure on the coststructure and productivity performance of US manufacturing industries. Here,the authors find that public infrastructure and R&D capitals have a significantpositive effect on productivity, which accounts for 6.8 percent and 9.5 percentrespectively. However, effects on cost structure vary across industries, as wellas the contribution to growth of labor productivity, which also varies acrossindustries and across time. This work results interesting as it tries to identifydifferentiated effects of public capital across industries, arguing that differentkinds of economic activity may require the services delivered by public capitalin different ways.

Some other studies addressing the relationship between infrastructure andeconomic outcomes from a production function approach are Costa et al. (2006),which calculate direct estimates of the association between public capital andeconomic sector output with measures of public capital at the state level. Theresults of this study bring heterogeneous evidence of the effects of infrastruc-

2A similar work that categorize different methodologies used for assessing the impact of trans-portation projects on development is that from Weisbrod (2008).


ture depending on the sector and the state.3

By contrast, there is a set of studies finding no evidence that support the re-lationship between infrastructure and economic development. In this regard,Holtz-Eakin and Schwartz (1995) carried out a study to find the relation be-tween infrastructure and productivity growth. The authors base their study ona neoclassical growth model framework where they explicitly introduce publiccapital in the production function. Using a panel data of the 48 contiguous USstates for the time period between 1971 and 1986, they got an estimated effect ofan average of 1.02 percent across states of public infrastructure on productivitygrowth.4

An interesting analysis of the effects of infrastructure is the one from Bomand Ligthart (2008). The reason this article outstands in the literature comesfrom the fact that it makes a meta-analysis of studies that treats public capitalas an input in the production function. Using data of 76 important studies ofthe effect of public capital on production levels, the authors implement theirstudy to find a global result coming from these heterogeneous individual stud-ies. Their benchmark model is estimated using Weighted Least Squares withrandom effects, finding an aggregate private output elasticity of public capitalof 8.6 percent.

Another body of research work argues the optimal stock and usage of in-frastructure goods in developing countries. In this sense, some authors arguethat developing countries exhibit a lag in the stocks of infrastructure servicessuch as Hulten (1996). Hulten develops an infrastructure efficiency model,which is used to justify that low and middle income countries that use infras-tructure inefficiently pay a growth penalty in the form of a smaller benefitsfrom infrastructure.5 In this sense, we can find a series of research on the in-frastructure gap that is exhibited by Latin America compared to countries thatwere in a similar economic development status some years ago (Calderon andServén, 2003; Easterly and Servén, 2003; Sánchez, 2010). Finally, Estache (2006)provides a survey of recent literature regarding the main current infrastructurepolicy issues in developing countries, finding that one of the main obstacles forcreating an adequate infrastructure policy are data gaps that make evaluationand monitoring a difficult task. Estache concludes saying that the debate on therelevance of infrastructure planning and policy is an essential factor for devel-opment, because lack of adequate infrastructure services slowdown nationaland regional growth.6

3Estimated elasticity values in this work vary across states and across economic sectors. Publiccapital elasticity values range for the manufacturing sector from negative 69 percent to positive43 percent, for non-agriculture sector from negative 32 percent to positive 66 percent and for allsectors from negative 54 percent to positive 82 percent.

4Another work that find little evidence of the importance of publicly provided infrastructureis Barro (1991) where he finds an estimated coefficient of 0.13 of infrastructure on growth butinsignificantly different from zero.

5Peterson (1990) carries out a similar analysis, focusing on the trends of public infrastructureto find that it has been undersupplied. The author argues that infrastructure issues should beanalysed from a political economy approach instead of a pure economic one.

6Analyses on optimal provision of infrastructure services have not been limited to developing


2.2 INFRASTRUCTURE AND OTHER LINKS WITH ECONOMIC PERFORMANCE

Overall, the work described in this section has focused on the effects of in-frastructure on economic development from a production function approach.However, there exists another set of literature that has focused on the effectsof public capital, such as transportation infrastructure, on economic outcomesfrom different approaches. Several authors have analyzed the linkages be-tween access to transportation networks with market integration, reduction ontrade costs, and trade promotion. These relations, in the end, can have a posi-tive effect on economic performance by promoting trade between regions andcountries, because access to transportation infrastructure may ease to boostregions comparative advantages or by changing the dynamics of local labormarkets.

Some studies have focused on how a proper provision of transportationinfrastructure has a positive effect over economic performance through otherchannels different from directly introducing a public capital input in a produc-tion function set up. A group of these studies focuses on how an appropriatetransportation network may reduce transport costs, and ultimately represent apromotion of trade. In this sense, there exits evidence showing that increasedtrade is correlated with higher productivity levels (Alcalá and Ciccone, 2004).

Trade costs are determined by different factors. Trade costs are all costsincurred in getting a good to a final user other than the marginal cost of pro-ducing the good itself. These trade costs include policy barriers, informationcosts, contract enforcement costs and transport costs (Anderson and Van Win-coop, 2004). Transportation infrastructure may be thought as a factor that, ifcorrectly and efficiently provided, may reduce transportation costs associatedwith trade activities.7

The work from Herrendorf et al. (2009) helps to illustrate this fact. Herren-dorf et al. (2009) argue that the high reduction in transportation costs causedby the construction of railways, promoted the settlement of the Midwest ofthe United States, and led to convergence of regional per capita incomes andan increase in GDP per capita in affected regions. Following the same logic,Hummels (2007) argues that international trade grew at an average of 5.9 per-cent per year, and that one of the prominent explanations of this increase is ahuge decline in transportation costs. The decline of transportation costs can beexplained by a decline in shipment costs and technological change.

countries. In this sense, Ihori and Kondo (2000) investigate the effect of disaggregate public capitalprovision in the Japanese economy. Using a set of optimality conditions based on Euler equations,they use GMM to estimate productivity levels of different measures of public capital. Their re-sults show that productivity levels across different public capital sectors are relatively high anddivergent across them, arguing that allocation of public works is not optimal yet in Japan. Themain sector where this provision is not optimal yet is specifically in railways and communicationinfrastructure.

7 Glaeser and Kohlhase (2003) explain that in the last century, transportation costs have de-clined by over 90 percent, arguing that transportation of goods is almost free now compared withpast levels. Even though this conclusion may be considered for certain countries that have anappropriate transportation infrastructure.


Furthermore, Shepherd and Wilson (2006) develop a study based on gravity-model simulations and find that improvements in road quality in strategictrade countries in Europe should represent an improvement in interregionaltrade and represent greater gains in trade than policy actions such as reduc-tions in tariffs or custom procedures. A final example of the importance ofreducing transportation costs is the work from Storeygard (2012). Using an in-novative data set,8 Storeygard finds how economic activity presents a negativeelasticity of 20 percent to transportation costs, measured as distance to mainport cities in Sub-Saharan Africa.

A different research tries to find effects that access to transportation net-works has over economic outcomes such as poverty alleviation. In this re-gard, Dercon et al. (2009) estimate how improvements in road quality and in-creased access to agricultural extension services have an effect on consumptionand poverty in Ethiopia. Using household surveys and a GMM InstrumentalVariable estimator, Dercon finds that increased access to these forms of infras-tructure reduces poverty by 9.8 percentage points and increases consumptiongrowth by 7.1 percent. Likewise, more research has been done on the roleof transportation infrastructure on African poverty alleviation. For example,Buys et al. (2010) argue that poor economic integration as well as isolation fromregional and international markets are one of the identified factors behind lev-els of poverty in Sub-Saharan Africa. Specifically, they argue that a poor trans-port infrastructure is among the deterrents to trade expansion. For that reason,they focus on how upgrading continental road network might be worth seri-ous economic consideration. For this purpose, they use a gravity model frame-work to find that overland trade among Sub-Saharan African might expanddramatically with a majority of the benefits focused on the rural poor popula-tion. In a similar sense, Jacoby (2000) tries to explain the size and distributionof road projects using household level data and a non-parametric approach.The author finds that providing extensive road access to market would confersubstantial benefits on average, and mostly go to poor households. Neverthe-less, he finds that benefits would not be large enough to appreciably reduceincome inequality.

In addition to the reviewed literature of the effects of transportation in-frastructure on poverty alleviation, a different body of studies have focusedon how this type of public capital does affect labor market structures. In thissense, the work from Atack et al. (2008) offers an explanation to the rise ofthe factory in the US in the mid-nineteenth century. These authors argue thatdivision of labor was limited by the extent of the market. Taking as causal fac-tor the transportation revolution brought by rail roads arrival, authors explainhow manufacturing went from artisan to factory production. To prove this,they use manufacturing data at the establishment level linked to county leveldata on rail access, and they carry out a difference in differences analysis andan instrumental variable approach. They find that rail access, defined as a rail

8Storeygard(2012) uses satellite data lights as a proxy for city economic activity and the shortestdistance to active cities and oil prices as a measure of transport costs.


road passing through the boundaries of a county, had a positive and significanteffect on the size of the firm.

In a similar sense, Michaels (2008) assesses the effects of reduced trade bar-riers over labor market structure. In order to do so, the author takes the con-struction of the US interstate highway system as an experiment. This highwaysystem was designed to connect cities and border crossings and also to serveto national defense, but as an unintended consequence it crossed many ruralcounties. Consequently, these counties experienced an increase in trade relatedactivities, which led to an increase in the relative demand for skilled manufac-turing workers in skill-abundant counties, and it decreased elsewhere, consis-tent with Hecksher-Ollin model of trade predictions.

One of the studies that goes further in the understanding of benefits fromtransportation infrastructure projects is Donaldson (2010). An important moti-vation of this article is that in 2007, twenty percent of World Bank funds weredestined to transport infrastructure projects. However, Donaldson et al arguethat there is still a lack of empirical understanding for the extent in which thiskind of projects reduce transaction costs. In this work, Donaldson recollectsdata from colonial India where the Raj, the rail road network in India, wasconstructed.

Guided by a Ricardian general equilibrium model, Donaldson finds outthat rail roads in India produced mainly four results: (i) a reduction in tradecosts and reduction in interregional price gaps (ii) an increase in interregionaland international trade, (iii)an increase in real income levels and (iv) a suffi-cient statistic for the effect of rail roads on welfare in the model accounts forvirtually all of the observed reduced form impact of rail roads in real income.The idea behind these results is that rail road network did improve the tradingactivities and were translated into welfare gains, and these welfare gains camepredominantly because this new transportation infrastructure allowed regionsto get gains from trade. The way in which Donaldson assesses the issue causedby potential correlation between project placement and unobserved changesin the local economic environment is the following. He estimates the effect ofover 40,000 km of rail road lines that reached advanced stages of surveying butwere never actually built. These control lines never display spurious effects.

2.3 ACCESS TO TRANSPORTATION NETWORKS

This section will describe the research from Banerjee, Duflo and Qian (2012).In their work, the authors investigate the effects that access to transport net-works has over regional economic outcomes in China. In order to overcomethe potential bias caused by an endogenous relation between transport net-works and economic outcomes, the authors track the historical road networksand construct a distance measure to these lines as their explicative variable.9

9The birth of modern transporation infrastructure in China took place in the late nineteenthcentury and early twentieth century, with the construction of the railroad system, which werebuilt to promote Western economic and military interests in China, to connect ports with historicalcities and to connect Colonial cities outside China (Banerjee et al., 2012, p. 16,17).


The main question in this work is whether areas that are quasi-randomlyassigned to have a better access to the transportation network consequentlyhave better economic outcomes in the long run. The logic of this relation isthat it is expected that a better access improves the income levels of the averageregion, due to a promotion of economic activity.

There are three important points to consider in the relation established bythe authors. First, this study is focused on the long run effects that access totransportation networks has over economic performance. This is, more thanfocusing on trade and price changes caused by proximity, what authors areconsidering is how location patterns are affected on the long run. Second, theirstudy pays attention on the effects that access has in the average location, notthe effects that access has on connected cities. The interesting access effectsraise in the average locality that quasi-randomly experiences the effects of beingcloser or distant to the transportation network. Finally, effects of access maybe different according to what extent does a location already has access whenthey construct a new transportation facility. In other words, there are expecteddiminishing returns from an infrastructure project.

As a theoretical framework, the authors develop a model of trade and factormobility10 where they explicitly introduce a distance parameter. This param-eter is used to model how does access to transport networks affects economicperformance. This model offers three main results. First, output per capitais higher and inequality lower in the better connected areas. Second, mod-elling trade opening as an increase in the exportable goods’ price, would riseincome levels everywhere in proportion and hence have no differential growtheffect across regions. And third, extending the model to incorporate agricul-ture, trade opening is likely to benefit better connected areas in terms of growthrates.11

Finally, authors investigate empirically the effects that access has over dif-ferent economic measures. Their economic outcome variables of interest areGDP per capita, annual GDP per capita growth, median village household in-come, Gini index, firm location and aggregate firm outputs, all of them at thecounty level. Their results show that GDP levels, number of firms12 , firm prof-its, and Gini index are negatively correlated with distance to transportationnetworks. These results are consistent with their model’s predictions.

The theoretical framework proposed by Banerjee et al. (2012) as well as theeconometric strategy are used in the present work. The differences betweenBanerjee et al. (2012) and the present work are first, in this work I am going tofocus on the complete railroad routes that were constructed before 1910 whilethe base work uses a limited set of straight lines, in the beginning, for then

10This model is presented in detail in section 4.11These model results hold under the assumption that capital is more mobile than goods.12In this sense Kawamura (2001) provides an study where he assess whether the location pat-

terns with respect to transportation facilities have changed in time. Using two time period sampleshe finds that business in 16 Chicago counties have tended to move closer to freeway ramp. He ar-gues that urban core areas are more concerned about access to transportation facilities, some torail stations while other kind of business to freeway ramps. His findings indicate a shift in theimportance of transportation access for business location choice.


using an extended set of lines. Second, instead of focusing on variables suchas county GDP and other measures such as firm profits and Gini index, inthis work I focus on municipality total gross production levels and growth, aswell as number of economic units, both of them coming from firm data at themunicipality level.

3. HISTORICAL CONTEXT: THE MEXICAN RAIL ROAD NETWORK

This section presents a brief context of the birth of modern transportation in-frastructure in Mexico, the conditions of transport networks in Mexico beforethe arrival of the railroad system, the delay of Mexico in this regard, the logicbehind the lines built, and some data related to this infrastructure project.

The present study wants to answer if access to transportation networks hasany positive effect on economic outcomes. In Banerjee, Duflo and Qian (2012),the authors address this question by identifying the Chinese railroad construc-tion as the arrival of a modern transportation network, which integrated theChinese regional markets for the first time. This railroad project took place inthe late nineteenth century, and it represents a useful way to understand howaccess to this first transportation network affected economic performance. Theconstruction of this network took place several decades before the time periodof analysis of the authors, since they investigate effects of access in a period offast economic growth in China, in the second half of the twentieth century. Thisfact represents a series of methodological advantages for their study. These ad-vantages will be discussed later.

Since the purpose of the of the present work is to investigate how doesaccess affects economic performance of Mexican municipalities, attention willbe focused on the construction of the Mexican railroad network which tookpart in the late nineteenth century and the early twentieth century. The Mex-ican case has a lot of similarities with the Chinese one in their transportationnetwork experiences. One of these similarities refers to the previous lack ofan integrated transportation infrastructure that both China and Mexico had.This situation had consequences like the lack of an integrated national marketdue to the non-existence of connection between regions. One of the reasonsthat explain this situation was the absence of domestic savings, resulting ina scarcity of resources that hindered capital accumulation and productive in-vestement. The Mexican economy presented features of its old regime suchas a slow population growth, a majority of people living in rural areas, andan archaic transport system that promoted the national market fragmentationand that hindered the labor specialization across regions. Road transportationcosts were so high that they inhibited long distance trade. The road conditionswere deplorable and this situation could be understood due to the mentionedlack of investment, and a steady erosion caused by environmental conditions.(Kuntz Ficker, 2010)

Mexico experienced a delay in railroad development in two senses. First,it had a lag compared with other Latin American countries such as Argentina,


Brasil and Chile, which experienced the arrival of the transport revolution twodecades before. Second, Mexico development plans did not focus much atten-tion to the construction of an integrated transportation system since it grantedits first rail construction concession in 1837, but it was until 1873 when the firstrailroad route was constructed.

The first railroad to be constructed was the one connecting Mexico and Ve-racruz. However, this railroad did not change the lack of integration at all,since the use of this route was expensive and it covered a very limited region.This situation started to change with the arrival of the Porfiriato, in 1880, sincethe current government had a developmnet plan that included the construc-tion of a national railroad system. Mexico’s financial capabilities to performthe construction of this infrastructure project were limited. This fact forced thegovernment to allow foreign investment inflows to fund the project. Duringthe period between 1880 and 1910 nearly 20,000 kilometers of railroads wereconstructed, positioning Mexico’s rail network as the second largest in LatinAmerica. (Riguzzi, 1996)

One of the direct and immediate consequences of railroads constructionwas an increase in trade. Trade volumes grew between 1880 and 1910 at ayearly rate of more than 10 percentage points. Another effect of the rail ex-pansion was a clear reduction in transportation costs, a tendency to commodifyagricultural products, as well as a concentration of the manufacturing activi-ties in some set of cities. Overall, one of the main consequences of the arrivalof this railroad network was the reshaping of the economic space at the local,state, region and national levels. Places that were closer to this railroad net-work experienced a change in their relative economic importance since theywere inserted into the trade dynamics brought by the railroads. In an oppo-site sense, places that were further away experienced the arrival of the railroadnetwork as a negative shock, since local productive factors tended to move toplaces closer to the network. (Kuntz Ficker, 1995)

According to Coatsworth et al. (1984), effects of the arrival of railroads oneconomic performance, from a social savings approach13, represent a saving ofresources in the freight sector of between 24.6 percent and 38.5 percent approx-imately (Coatsworth et al., 1984, p.87) and a negligible effect in the passengertransport sector of 1 percent and 0.33 percent of 1910 Mexico’s GDP, for thefirst class and lower classes sectors, respectively, (Coatsworth et al., 1984, p.63)

13The concept of social savings has a key importance in the literature related to the economic ben-efits from transportation projects, specifically used in the analysis of railroads. Social savings to thesaved resources by the society due to the construction of this kind of infrastructure, compared withthe situation where no project was constructed at all (Coatsworth et al., 1984, p. 17). Transporta-tion projects have related social savings because it reduces transportation costs, making it cheaperto transport both people and goods. In this sense, there exists some works that estimate the socialsavings generated by railroads, such as the important study carried out by Fogel (1962) where heargues that US railroads generated savings of around 2.7 percent of US GNP in the early nineteenthcentury. A more recent version of this study, which makes use of a newfangled methodology isthe one from Pérez Cervantes (2013), where he estimates social savings from US railroads, usinga model of optimal railroad location, that social savings from this railroads represented nearly 7.9percent points of US GDP, and a 10.3 percent points from a set of suggested optimal railroad lines.


Moreover, if we take into account the social savings in the passenger trans-portation, the gains from railroads were mostly concentrated in the high in-come levels. This fact can be explained because low income population contin-ued to travel by carts, since railroads still represented an expensive alternativefor mobility.14

Overall, the construction of railroads during the period between 1880 and1910 represented the arrival of the first modern transportation infrastructure tothe Mexican territory. Notwithstanding these railroads were constructed as anattempt to integrate Mexican markets, the shape of these railroad routes did notfollow economic objectives but mostly were constructed in a random fashion.Integration of the Mexican market came as an unintended consequence of theconstruction of this network, but it is important to say that the objective thatcould be tracked in the shape of the constructed routes was this attempt tostablish links with the American market. (Riguzzi, 1996, p. 71)

4. THEORETICAL FRAMEWORK: A MODEL OF TRADE AND FACTOR MOBILITY

According to Banerjee, Duflo and Qian (2012) transportation infrastructure canbe advantageous for economic development due to multiple reasons. First, itplausibly reduces trade costs and promotes market integration. Second, it pro-motes factor mobility. Third, it is easier to take advantage of opportunities forinvestment in human capital. And fourth, there are some intangible benefits, asfreer movement of people and goods that may promote the flow of new ideasand information about new technologies.

In this section a simple model of trade and factor mobility is presentedwhere distance affects mobility of both factors of production and goods ex-ports. This model shows how does access to infrastructure affects economicoutcomes, depending on how does it affects factors and goods mobility.15

4.1 MODEL ASSUMPTIONS

There are M +N + 1 regions in this economy: M distant regions, N connectedregions and 1 big city or metropolis. Each region produces one good exclu-sively for export which could be the same as or different from the good thatit imports, and another good which it consumes. These goods could be eitheridentical or differentiated.

They key assumption of this model is that the relative price of the exportablegood in terms of the importable one, in the world market, is the same, p. How-ever, distance to the market adds to the cost. We model this by assuming thatthis transportation cost is increasing in distance from the market, such that theprice received by the exporters is p in the metropolis, p(1−d1) in the connectedregion and p(1− d2) in the distant regions, where d2 > d1.

14This fact may be understood as low mobility of labor, which will be one of the assumptions ofthe trade and factor mobility model presented later.

15This model is developed in Banerjee et al. (2012).


A population of firms carries out the region’s production. In order to pro-duce, firms uses two inputs: labour and capital. Output of the exportable goodis given by

Q = AKαL1−α(K)β (3)

where K is the average level of K in firms in that region, allowing in this wayfor spillovers from allocation. Spillovers can not be so large since diminishingreturns are kept (i.e. α+ β < 1). Levels of A could vary across regions in orderto capture how the flow of ideas may be different across regions.

One of the main assumptions behind this model is related with factor mo-bility. It is assumed that labor does not move: the city has an endowment oflabor ofL∗while all the other regions have an endowment ofL′. However, cap-ital can move but facing a cost. It is assumed that, in equilibrium, movement ofcapital is from the regions to the city. Hence, if the rental rate of capital in thecity is r, the opportunity cost of capital in the connected regions is r(1 − ρd1)and in the distant region is (1 − ρd2). This fact shows that the further regionsface a higher cost of sending capital to the city, so these regions may be willingto accept a lower return on capital if it is invested in the same region. In thissense, connected regions have more incentives to send their capital to the citycompared to the distant regions, since their opportunity cost is higher, that isr(1 − ρd1) > r(1 − ρd2). In the case when ρ equals to zero, capital is perfectlymobile and each region will send all its capital to the city. As ρ increases, the in-centives for the regions to keep part of their capital increases. When ρ is equalto

(1d2

)the distant regions have no incentives to send capital to the city, the

same for the connected regions if ρ is equal to(

1d1

).

4.2 PROBLEM SOLUTION

For a firm located at a distance d, the profit maximization is defined as

maxL,K

Π = PQ− wL− r(1− ρd)K

⇒ maxL,K

p(1− d)AKαL1−α(K)β − wL− r(1− ρd)K

Solving for L and K leads to the following first order conditions

w = p(1− d)A(1− α)

(K

L

)α(K)β (4)

r(1− ρd) = p(1− d)Aα

(L

K

)1−α

(K)β (5)

where w is the wage rate in that type of region, L is the labor endowment, Kis the equilibrium amount of capital invested in a firm in that region and d isthe corresponding distance variable, which takes a value of d = 0 for the city,d = d1 for connected regions and d = d2 for distant regions. Capital marketclears according to the following condition

MKD +NKC +KM = K (6)


where KD,KC , KM are the average amount of capital used per firm in eachregion region (where D stands for distant region, C is for connected and M isfor the metropolis or city). Using the fact that K = K and L = L′ for regionsdifferent from the city, the capital demand condition can be manipulated asfollows,

K1−α−β =

(p(1− d)

r(1− ρd)

)Aα(L′)1−α (7)

which tells that whether the distant regions or the connected have more capitalper firm depends on whether the ratio (1−d)

(1−ρd) is increasing or decreasing in d.16

If ρ > 1, the ratio (1−d)(1−ρd) is increasing in d, then the distant region will actually

have more capital per worker compared to the connected region since d1 < d2.(i.e. K0 < K1 if K0 is the equilibrium amount of capital in the connectedregion and K1 the equilibrium amount of capital in the distant region) Thiscase takes place when capital is less mobile than goods and hence, more distantregions have less incentives to move their capital to the city. On the other hand,when capital is more mobile than goods, that is, when ρ < 1 the ratio (1−d)

(1−ρd) isdecreasing in distance. This means that better connected regions have a higheramount of capital compared with less connected ones since d1 < d2 and K isdecreasing with distance.(i.e. K0 > K1)

The wage-rental ratio will be taken as a measure of inequality, followingtraditional trade models. When this ratio is equal to one, there is no inequality,and as this ratio decreases, there exists a higher inequality since returns to laborare lower compared to returns to capital. 17

w

r(1− ρd)=

(1− α)

α

(K

L′

)(8)

This equation shows that inequality is higher (the left-hand side is lower) wher-ever K is lower (the right-hand side). That is, in the case where capital is lessmobile than goods,(ρ > 1 andK0 < K1) the further region is able to retain moreof its capital, and this is translated into less inequality in this region, since(

w

r(1− ρd0)

)0

=(1− α)

α

(K0

L′

)<

(1− α)

α

(K1

L′

)=

(w

r(1− ρd1)

)1

(9)

In the case where capital is more mobile than goods (ρ < 1 and K0 > K1) theincentives to send capital to the city are higher, so the further regions movemore of their capital, and this is translated into less inequality in the betterconnected regions, since(

w

r(1− ρd0)

)0

=(1− α)

α

(K0

L′

)>

(1− α)

α

(K1

L′

)=

(w

r(1− ρd1)

)1

(10)

16This ratio represent the relative mobility of goods compared with capital mobility. When ρ > 1capital is more mobile than goods, and the opposite when ρ < 1.

17This measure of inequality is taken from Banerjee et al. (2012), and eventhough there couldbe very egalitarian societies with low return to labor, this measure could be understood as aninequality between different factor-intensive sectors.


Lastly, we can analyze the output per capita levels

y = p(1− d)A

(1

L′

)α(K)α+β (11)

which can be rewritten as

y = p(1− d)A

((1

L′

)α(p(1− d)

r(1− ρd)

)Aα(L′)1−α

) α+β1−α−β

(12)

In the case where ρ < 1 , y is decreasing in d. In the case when ρ > 1, wemay observe the reverse result, since further regions retain most of their capitalendowment.

4.3 MODEL PREDICTION

There is a main result than can be extracted from this model.Output per capita and inequalityIf capital is more mobile than goods, output per capita will be higher and in-

equality lower in the better connected region. In the opposite case, when goodsare more mobile than capital, the more distant region will have less inequalityand the difference in per capita output between regions will be smaller.

5. EMPIRICAL STRATEGY

The main motivation of this work is to investigate if access to transportationnetworks has had any effect on Mexican economic outcomes. According tothe model previously described, it is expected that regions with more access totransportation networks have had a better economic performance, comparedto the further ones. Regions that are closer to the transportation network maytake advantage of trade opportunities at a lower cost, compared to those thatdo not have these facilities. Since trading is cheaper as long as the region islocated near the transportation network, we expect to find a positive relationbetween access to the transportation network and economic performance. Thisrelation can be stated as follows.

H0: Distance to transportation networks is negatively related with economic per-formance

5.1 DATA

For testing the previous hypothesis, several sources were consulted.Economic Measures. There will be used two measurements of economic per-

formance in this study. One of them is the Total Gross Production in thousandsof pesos (current currency) and the second is the number of Economic Units.both of them at the municipality level. These measurements are taken fromMexican Economic Census18.

18Censos Económicos for years 1999, 2004 and 2009, - Instituto Nacional de Estadística y Geografía(INEGI).


The Total Gross Production measure represents the value of all goods andservices coming from economic activities as a result of the operations of theeconomic units, including the marketing margin of resold goods by the firms.It includes the production that was not released to the market because it wasin some production process or awaiting for customers, and the production offixed assets for the usage of the economic unit. This value is reported at pro-ducer current prices.19 It could also be defined as the amount that the pro-ducer could earn from the purchaser minus the value added tax, charged tothe buyer.20

This economic performance measure will be transformed to per capita terms.In order to do this, population levels are needed for the corresponding years.Since Mexican Population Census and Counts21 were risen in 1995, 2000, 2005,and 2010, levels for the needed years are estimated by computing the averageyearly population growth rate between 1995-2000, 2000-2005 and 2005-2010.These growth rates are used to compute the values for 1999, 2004 and 2009.

The second economic performance measure, Economic Units refers to thenumber of economic units that are located in each municipality, and that reporta measurement of production. This units are entities that produce any kind ofgoods or services.

Distance Measures. To construct the distance measures, a simple algorithmthat links the geographical coordinates of each Mexican municipality with theregion of interest is used. There are four distance measures used in this work,namely, the shortest distance to the transportation network, the shortest dis-tance to connected cities,22 shortest distance to coastline and shortest distanceto the US border. The algorithm used to construct this measures is described indetail in Appendix D.

The explicative variable of interest is the access to transportation network,which is measured as the closest distance to any municipality where an his-

19This economic measure is not in real but in nominal(current) terms. Even though, this pro-duction measure values are reported at producer prices. It can be expected that producer pricesare less affected to inter-region price variance, compared to the case where they were reported atconsumer prices. Inter-regional producer prices can be attributable to inter-region trade costs morethan local factors affecting them. To construct a constant(real) measure of regional gross produc-tion, it would be necessary a deflator at the municipality level. This exercise is out of the scope ofthe present work.

20These data comes at the economic sector level for each municipality. There are some economicsubsectors that do not have reported information in this Census, such as agriculture and forestryand communication services. In order to have consistent measure of Total Gross Production acrossyears, the retrieved data contains subsectors that are available for all the time periods consulted.

21Censos Poblacionales (2000, 2010) and Conteos Poblacionales (1995, 2005) - Instituto Nacional deEstadística y Geografía (INEGI) retrieved from Sistema Nacional de Información Municipal, (SNIM) ofthe Instituto Nacional para el Federalismo y Desarrollo Municipal (INAFED).

22Connected cities are defined as all cities which were connected by historical transportationroutes. For example, there is a route connecting Mexico and Veracruz. This route may pass throughseveral municipalities, but the connected cities, in this case, are Mexico and Veracruz. The reasonto include this variable is because, by construction, a municipality that is near a connected city iscloser to the transportation network. We will control for both distances to avoid getting biasedestimates for the access to transportation network elasticity.


torical transportation route passed.23 As it has been previously discussed, thetheoretical justification for using this variable as the measurement of a munic-ipality access to the transportation network is the following. The routes takenas reference are historical transportation routes. There is a potential endoge-nous relation caused by a reverse causality problem in the linkage betweeneconomic outcomes and the location of transportation infrastructure. This po-tential threat arises because it can be argued that transportation infrastructureis constructed to connect rich and developed cities. Due to this reason, wetake the routes from the rail roads constructed in the early twentieth centuryas the shape that gave birth to modern transportation infrastructure in Mexico.Although nowadays railroads are far from being the main mode of transporta-tion, the shape of the transportation routes outlined by this first network arehighly kept by contemporaneous highways routes (See Appendix C). To putthis clearer, if it is thought that there is an endogenous relation between eco-nomic performance and the location of transportation facilities it can be arguedthat location of contemporaneous highways may follow the logic of the need toconnect contemporary prosperous regions, but it can not be argued that con-temporaneous levels of prosperity determined the location of rail roads con-structed a century ago. Following this logic, the routes outlined by railroadroutes are taken as the framework that transportation network followed fromits beginning in order to construct the main independent variable and over-come the possible endogenous relation previously described.24

Other control variables. There are some other variables that will be used inthis analysis and that are related to municipality specific characteristics. Oneof them is the road length. This variable accounts for the kilometres of roadsconstructed in each municipality. Since the objective of this study is to see howtransportation infrastructure affects economic performance, another importantmeasure of transportation infrastructure is the road length. Since we are alsocontrolling for the municipality’s area, controlling for road length will give anapproximate estimate of how road density affects economic performance. Itis more interesting to include this variable instead of including the distancemeasure to a road, and this is because access, measured as shortest distance, toa road facility should have a really low variance across observations.25

23The complete methodology for the construction of the access variable can be found in Ap-pendix D.

24There exists a possibility that this measure may be capturing the combination of reverse causal-ity and persistence: initially rich regions had better access to railway and initially rich regions arerich today. A typical check to carry out this possibility may be to check if the constructed measureof access to transportation infrastructure is not correlated with pre-railway period variables. Thatwould require to look for the data more than 100 years ago at the municipality level, but that checkis not carried out in the present work.

25For a complete set of summary statistics and the distribution of the access variable see Ap-pendix A.


5.2 ECONOMETRIC SPECIFICATION

Following Banerjee et al. (2012), the main estimating equation is specified asfollows

Ymst = βDistancems + ΦXms + ηs + γt + εmst (13)

where Ymst is the measure of economic performance in municipality m, states, and year t; Distancems is the measure of access, Xms is a vector of controlvariables; ηs and γt are state and year fixed effects26, respectively and εmst isan error term. All of the variables are in natural logarithm form.

According to the theoretical framework, it is expected that the estimated pa-rameter for Distance to be negatively related (i.e. β < 0) with the correspond-ing economic performance measurement, that is, per capita total gross produc-tion, (Banerjee et al., 2012) and economic units (Banerjee et al., 2012; Kawamura,2001).

Estimation results are presented in the following section.

6. RESULTS

In this section the econometric results of the elasticities of economic perfor-mance measures to the access to transportation networks measurement will bediscussed.27 Table 1 and Table 2 report estimated elasticities of total gross pro-duction per capita for municipalities to access. Table 3 and Table 4 report theestimated elasticities for economic units to the access variable. The differencebetween the two pairs of estimations is that Table 2 and Table 3 contains esti-mates of a restricted sample where the number of observations is steady acrossspecifications. The first column reports the estimates including the linear andquadratic form of the access variable as well as state and year fixed effects.Across specifications, control variables are added.

The set of control variables included are the distance to the closest con-nected city (CONNECTED), the closest distance to the coast (COAST), the clos-est distance to the US border (BORDER), as well as the municipality’s area(AREA) , road length (ROAD) and altitude (ALTITUDE). All variables are innatural logarithm form, and control for both linear and quadratic form.

6.1 TOTAL GROSS PRODUCTION AND ACCESS

The estimates for the access variable are significant at the 99 percent level for allspecifications and are of the expected sign. That is, as our hypothesis stated,distance to the network is negatively related with economic performance, inthis case with per capita levels of gross production. The estimates have some

26Eventhough it is considered a fixed effects for state, there could exist a ommited variable bias.In particular, it would be interesting to investigate whether better connected localities have his-torical characteristics such as population that is more prone to trade, that could explain part ofthe results. It would be interesting to develop further research in this issue, having appropiateinformation at the municipality level.

27For a complete description of the variables considered in the models, see Appendix A.


variation across specification, having an estimated elasticity of negative 6.14percent in the complete specification. The quadratic term is as well significantat the 99 percent level. These estimates show that as distance to the trans-portation network increases, the levels of production decrease in a decreasingfashion. To understand the magnitude of the estimates, a comparison across re-gions can be established. According to the distribution of the access variable,28

the 75th location is 2.1 times further than the median location so, using the es-timated values for the access elasticities it can be argued that the difference ofoutput between these regions is 17.86 percent. That is, the 75th location has alower output of 17.86 percent compared with the 50th location, and this differ-ence can be explained only by the access to the transportation network. In thissense, comparing the 25th location with the 75th one, which is 4.8 times further,the difference in output levels attributable to access is of 53.51 percent.29

Similarly, results show that distance to a connected city has a significantand negative effect of 20.2 percent. This means, being closer to a connectedcity explains better economic performance. An effect in the same direction isfound for distance to the border. Results show a negative relation betweendistance to the border and production of 5 percent. These results explain thatconnectivity to different regions does affect levels of production, but the effectof being closer to a connected city is the one with the greatest effect. This resultdenote that it is more important to be close to a trade pole, such as a connectedcity where local production can be trade or even exported, than directly beingclose to the border.

6.2 ECONOMIC UNITS AND ACCESS

The number of economic units is negatively related to the distance to the net-work. The estimated elasticity of economic units to distance to the network isnegative 7.02 percent. This effect also shows a decreasing fashion. Both lin-ear and quadratic terms are significant at the 99 percent level. To illustrate themagnitude of these effects, a similar exercise as previous section will be done.In terms of distance, the 75th location is 2.1 times further than the median one.According to this difference, it is expected that the 75th location has 19.25 lesseconomic units compared with the 50th one, attributable to the access to thenetwork. In this same sense, the 75th location is 4.8 times further than the

28See Appendix A.29Differences in output are computed according to the following marginal effects:

∆Y = β1

(∆Distance

Distance0

)+ 2β2ln(Distance0)

(∆Distance

Distance0

)where β1 is the estimate of the linear effect of the Distance variable and β2 is the quadratic formestimate.

In a similar sense, the point elasticity can be calculated according to the following expression

εY,A =∆Y

∆A

A

Y=

%∆Y

%∆A= (β1 + 2β2lnA)


25th one, leading to 58 percent less economic units in the further region andattributable to the access measure.


TAB

LE

1:D

ISTA

NC

ET

OT

RA

NSP

OR

TAT

ION

NE

TW

OR

KA

ND

PR

OD

UC

TIO

NL

EV

EL

S:C

OM

PL

ET

ESA

MP

LE

Dep

ende

ntV

aria

ble:

Ln(T

otal

Gro

ssPr

oduc

tion

(per

capi

ta))

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

DIS

TAN

CE

-.127

9***

-.086

0***

-.078

7***

-.078

0***

-.076

3***

-.071

8***

-.061

3***

-.061

4***

(.012

2)(.0

120)

(.012

2)(.0

121)

(.012

1)(.0

135)

(.012

4)(.0

136)

DIS

TAN

CE2

-.021

6***

-.011

4***

-.013

7***

-.013

8***

-.014

0***

-.015

9***

-.010

9***

-.013

3***

(.003

5)(.0

035)

(.003

6)(.0

035)

(.003

4)(.0

040)

(.003

4)(.0

039)

CO

NN

ECT

ED-.2

173*

**-.2

238*

**-.2

229*

**-.2

372*

**-.1

838*

**-.2

492*

**-.2

022*

**(.0

288)

(.028

2)(.0

283)

(.027

7)(.0

331)

(.027

8)(.0

333)

CO

NN

ECT

ED2

-.023

1***

-.024

2***

-.024

2***

-.026

5***

-.020

3***

-.028

5***

-.023

4***

(.003

6)(.0

036)

(.003

6)(.0

036)

(.004

4)(.0

036)

(.004

5)C

OA

ST-.1

377*

**-.1

376*

**-.1

233*

**-.1

106*

**-.0

097

.012

3(.0

146)

(.014

3)(.0

150)

(.018

6)(.0

218)

(.023

9)C

OA

ST2

-.000

8-.0

012

-.000

3-.0

021

.011

1***

.010

6**

(.004

0)(.0

040)

(.004

0)(.0

053)

(.004

2)(.0

048)

BOR

DER

-.072

7***

-.062

7***

-.070

5***

-.046

8**

-.054

2**

(.025

7)(.0

238)

(.025

0)(.0

224)

(.023

5)BO

RD

ER2

.003

3.0

065

.004

7.0

083

.005

9(.0

074)

(.007

2)(.0

073)

(.006

8)(.0

068)

AR

EA-.3

582*

**-.3

333*

-.429

7***

-.453

0***

(.099

9)(.1

706)

(.098

4)(.1

687)

AR

EA2

.044

6***

.027

3*.0

499*

**.0

354*

*(.0

091)

(.014

4)(.0

089)

(.014

1)R

OA

D.0

941

.054

0(.1

567)

(.152

8)R

OA

D2

.015

2.0

221

(.018

5)(.0

181)

ALT

ITU

DE

-.913

5***

-.916

1***

(.177

1)(.1

991)

ALT

ITU

DE2

.057

1***

.056

8***

(.015

9)(.0

190)

Obs

erva

tion

s69

5969

5969

5169

5169

5140

2469

5140

24

*p<

0.1

0,*

*p<

0.0

5,*

**p<

0.0

1

Stan

dard

erro

rsin

pare

nthe

ses.

Stan

dard

erro

rscl

uste

red

atth

em

unic

ipal

ity

leve

l.A

llre

gres

sion

sco

ntro

lfor

year

and

stat

efix

edef

fect

s.


TAB

LE

2:D

ISTA

NC

ET

OT

RA

NSP

OR

TAT

ION

NE

TW

OR

KA

ND

PR

OD

UC

TIO

NL

EV

EL

S:R

EST

RIC

TE

DSA

MP

LE

Dep

ende

ntV

aria

ble:

Ln(T

otal

Gro

ssPr

oduc

tion

(per

capi

ta))

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

DIS

TAN

CE

-.110

2***

-.084

0***

-.076

3***

-.075

5***

-.076

4***

-.071

8***

-.066

5***

-.061

4***

(.013

2)(.0

131)

(.013

7)(.0

135)

(.013

4)(.0

135)

(.013

5)(.0

136)

DIS

TAN

CE2

-.019

8***

-.013

4***

-.015

1***

-.015

2***

-.016

3***

-.015

9***

-.013

6***

-.013

3***

(.003

8)(.0

040)

(.004

1)(.0

040)

(.003

9)(.0

040)

(.003

9)(.0

039)

CO

NN

ECT

ED-.1

653*

**-.1

642*

**-.1

619*

**-.1

851*

**-.1

838*

**-.2

032*

**-.2

022*

**(.0

359)

(.034

2)(.0

342)

(.033

7)(.0

331)

(.033

9)(.0

333)

CO

NN

ECT

ED2

-.017

3***

-.017

4***

-.017

1***

-.020

5***

-.020

3***

-.023

6***

-.023

4***

(.004

7)(.0

046)

(.004

5)(.0

045)

(.004

4)(.0

046)

(.004

5)C

OA

ST-.1

345*

**-.1

340*

**-.1

177*

**-.1

106*

**.0

013

.012

3(.0

176)

(.017

1)(.0

184)

(.018

6)(.0

238)

(.023

9)C

OA

ST2

-.002

9-.0

037

-.002

4-.0

021

.009

6*.0

106*

*(.0

054)

(.005

4)(.0

053)

(.005

3)(.0

049)

(.004

8)BO

RD

ER-.0

758*

**-.0

668*

**-.0

705*

**-.0

504*

*-.0

542*

*(.0

257)

(.024

2)(.0

250)

(.022

6)(.0

235)

BOR

DER

2.0

044

.007

0.0

047

.008

2.0

059

(.007

4)(.0

072)

(.007

3)(.0

068)

(.006

8)A

REA

-.222

8-.3

333*

-.353

6**

-.453

0***

(.151

5)(.1

706)

(.149

2)(.1

687)

AR

EA2

.032

2**

.027

3*.0

425*

**.0

354*

*(.0

125)

(.014

4)(.0

123)

(.014

1)R

OA

D.0

941

.054

0(.1

567)

(.152

8)R

OA

D2

.015

2.0

221

(.018

5)(.0

181)

ALT

ITU

DE

-.941

1***

-.916

1***

(.196

4)(.1

991)

ALT

ITU

DE2

.060

5***

.056

8***

(.018

8)(.0

190)

Obs

erva

tion

s40

2440

2440

2440

2440

2440

2440

2440

24

*p<

0.1

0,*

*p<

0.0

5,*

**p<

0.0

1

Stan

dard

erro

rsin

pare

nthe

ses.

Stan

dard

erro

rscl

uste

red

atth

em

unic

ipal

ity

leve

l.A

llre

gres

sion

sco

ntro

lfor

year

and

stat

efix

edef

fect

s.


TAB

LE

3:D

ISTA

NC

ET

OT

RA

NSP

OR

TAT

ION

NE

TW

OR

KA

ND

EC

ON

OM

ICU

NIT

S:C

OM

PL

ET

ESA

MP

LE

Dep

ende

ntV

aria

ble:

Ln(E

cono

mic

Uni

ts)

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

DIS

TAN

CE

-.129

4***

-.095

0***

-.088

6***

-.087

9***

-.088

6***

-.075

3***

-.080

9***

-.069

9***

(.015

9)(.0

146)

(.014

0)(.0

139)

(.013

5)(.0

141)

(.013

5)(.0

140)

DIS

TAN

CE2

-.014

0***

-.007

3*-.0

094*

*-.0

097*

*-.0

137*

**-.0

150*

**-.0

118*

**-.0

135*

**(.0

044)

(.004

2)(.0

043)

(.004

2)(.0

040)

(.004

8)(.0

040)

(.004

8)C

ON

NEC

TED

-.169

0***

-.177

9***

-.177

0***

-.246

4***

-.203

6***

-.251

0***

-.212

6***

(.032

0)(.0

311)

(.031

2)(.0

295)

(.036

5)(.0

295)

(.036

6)C

ON

NEC

TED

2-.0

099*

*-.0

112*

**-.0

111*

**-.0

204*

**-.0

160*

**-.0

213*

**-.0

176*

**(.0

039)

(.003

9)(.0

039)

(.003

7)(.0

048)

(.003

8)(.0

048)

CO

AST

-.136

5***

-.135

9***

-.102

3***

-.086

4***

-.052

3**

-.025

5(.0

186)

(.018

2)(.0

181)

(.021

0)(.0

233)

(.025

3)C

OA

ST2

-.001

7-.0

024

.001

6-.0

005

.006

0.0

048

(.005

2)(.0

053)

(.005

1)(.0

064)

(.005

5)(.0

065)

BOR

DER

-.102

4***

-.088

6***

-.113

7***

-.082

6***

-.106

6***

(.031

5)(.0

282)

(.030

8)(.0

276)

(.029

9)BO

RD

ER2

.014

3.0

193*

*.0

133

.019

4**

.013

5(.0

094)

(.008

8)(.0

091)

(.008

7)(.0

089)

AR

EA.2

263*

.586

9***

.184

4.5

137*

*(.1

156)

(.197

5)(.1

154)

(.199

9)A

REA

2.0

191*

-.041

8**

.022

9**

-.036

0**

(.011

0)(.0

168)

(.010

9)(.0

169)

RO

AD

-.803

2***

-.793

7***

(.211

7)(.2

105)

RO

AD

2.1

442*

**.1

444*

**(.0

236)

(.023

6)A

LTIT

UD

E-.5

956*

**-.6

098*

**(.1

766)

(.197

4)A

LTIT

UD

E2.0

445*

**.0

448*

*(.0

160)

(.018

5)O

bser

vati

ons

5617

5617

5609

5609

5609

3658

5609

3658

*p<

0.1

0,*

*p<

0.0

5,*

**p<

0.0

1

Stan

dard

erro

rsin

pare

nthe

ses.

Stan

dard

erro

rscl

uste

red

atth

em

unic

ipal

ity

leve

l.A

llre

gres

sion

sco

ntro

lfor

year

and

stat

efix

edef

fect

s.


TAB

LE

4:D

ISTA

NC

ET

OT

RA

NSP

OR

TAT

ION

NE

TW

OR

KA

ND

EC

ON

OM

ICU

NIT

S:R

EST

RIC

TE

DSA

MP

LE

Dep

ende

ntV

aria

ble:

Ln(E

cono

mic

Uni

ts)

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

DIS

TAN

CE

-.113

9***

-.084

3***

-.078

0***

-.077

2***

-.084

3***

-.075

4***

-.079

4***

-.070

2***

(.017

3)(.0

156)

(.015

0)(.0

148)

(.014

5)(.0

140)

(.014

3)(.0

138)

DIS

TAN

CE2

-.015

8***

-.009

6*-.0

114*

*-.0

117*

*-.0

159*

**-.0

153*

**-.0

144*

**-.0

138*

**(.0

051)

(.005

0)(.0

050)

(.004

9)(.0

049)

(.004

8)(.0

049)

(.004

8)C

ON

NEC

TED

-.157

1***

-.155

2***

-.154

1***

-.205

2***

-.208

9***

-.213

9***

-.218

0***

(.041

3)(.0

397)

(.039

6)(.0

378)

(.036

0)(.0

378)

(.036

2)C

ON

NEC

TED

2-.0

088*

-.008

7*-.0

085*

-.015

2***

-.016

5***

-.016

6***

-.018

0***

(.005

0)(.0

050)

(.005

0)(.0

048)

(.004

7)(.0

049)

(.004

8)C

OA

ST-.1

316*

**-.1

309*

**-.0

989*

**-.0

799*

**-.0

483*

-.021

1(.0

215)

(.020

5)(.0

215)

(.020

6)(.0

267)

(.025

0)C

OA

ST2

-.000

5-.0

019

.002

4.0

009

.005

9.0

062

(.007

1)(.0

071)

(.006

8)(.0

065)

(.007

0)(.0

065)

BOR

DER

-.103

1***

-.090

2***

-.112

5***

-.084

7***

-.105

1***

(.031

9)(.0

286)

(.030

7)(.0

277)

(.029

8)BO

RD

ER2

.015

1.0

205*

*.0

144

.020

4**

.014

7*(.0

096)

(.009

0)(.0

091)

(.008

8)(.0

089)

AR

EA.2

658

.557

5***

.219

1.4

930*

*(.1

657)

(.196

7)(.1

702)

(.199

1)A

REA

2.0

133

-.038

9**

.017

8-.0

339*

*(.0

146)

(.016

8)(.0

149)

(.016

9)R

OA

D-.7

787*

**-.7

757*

**(.2

136)

(.212

1)R

OA

D2

.142

0***

.142

8***

(.023

8)(.0

237)

ALT

ITU

DE

-.617

3***

-.563

9***

(.198

1)(.1

966)

ALT

ITU

DE2

.048

9***

.040

5**

(.018

7)(.0

184)

Obs

erva

tion

s36

4536

4536

4536

4536

4536

4536

4536

45

*p<

0.1

0,*

*p<

0.0

5,*

**p<

0.0

1

Stan

dard

erro

rsin

pare

nthe

ses.

Stan

dard

erro

rscl

uste

red

atth

em

unic

ipal

ity

leve

l.A

llre

gres

sion

sco

ntro

lfor

year

and

stat

efix

edef

fect

s.


6.3 PRODUCTION GROWTH RATE AND ACCESS

In order to check if whether the effect of transportation cost has faded awayor not, a regression taking the growth rate of the municipality total gross pro-duction growth rate as dependent variable was run. As the previous exercises,regressions are run for a complete and a restricted sample, in order to have thesame number of observations across specifications. These regressions controlfor the same set of control variables as the previous ones, as well as for thelagged value of the growth rate.

The estimated coefficient of the distance to the transportation network vari-able report a nearly zero and non significant effect on the growth rate. Thisresult may represent the fact that the differences in access to transportationnetwork does not affect the current growth experiences, so the importance ofdifferential in trade costs across the periods of study does not affect the currentdynamics of regional growth. Even though differences in levels of productioncan be explained by the access to the transportation network, current accesshas not affected the contemporary growth path.


TAB

LE

5:D

ISTA

NC

ET

OT

RA

NSP

OR

TAT

ION

NE

TW

OR

KA

ND

PR

OD

UC

TIO

NG

RO

WT

H:C

OM

PL

ET

ESA

MP

LE

Dep

ende

ntV

aria

ble:

Ln(T

otal

Gro

ssPr

oduc

tion

(per

capi

ta))t-L

n(To

talG

ross

Prod

ucti

on(p

erca

pita

))t−

1

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

DIS

TAN

CE

.006

0.0

092

.009

6.0

090

.008

9.0

080

.007

7.0

075

(.006

1)(.0

068)

(.007

0)(.0

069)

(.006

9)(.0

087)

(.007

1)(.0

088)

DIS

TAN

CE2

.000

3.0

008

.000

7.0

005

.000

5.0

019

.000

6.0

018

(.001

8)(.0

019)

(.001

9)(.0

019)

(.001

9)(.0

023)

(.002

0)(.0

024)

GR

OW

TH_1

-.400

6***

-.399

7***

-.399

6***

-.399

4***

-.399

1***

-.307

9***

-.399

0***

-.307

5***

(.037

1)(.0

372)

(.037

2)(.0

370)

(.037

1)(.0

563)

(.037

2)(.0

563)

CO

NN

ECT

ED-.0

134

-.013

2-.0

121

-.010

7-.0

135

-.008

2-.0

127

(.015

5)(.0

153)

(.015

2)(.0

154)

(.019

4)(.0

156)

(.019

7)C

ON

NEC

TED

2-.0

004

-.000

4-.0

001

.000

1.0

002

.000

5.0

003

(.002

2)(.0

021)

(.002

1)(.0

022)

(.002

7)(.0

022)

(.002

8)C

OA

ST-.0

055

-.002

5-.0

037

-.003

4-.0

255*

*-.0

104

(.007

2)(.0

073)

(.007

7)(.0

105)

(.010

8)(.0

152)

CO

AST

2.0

004

.000

6.0

005

-.001

0-.0

029

-.001

5(.0

019)

(.001

8)(.0

018)

(.002

7)(.0

021)

(.003

2)BO

RD

ER.0

363*

**.0

356*

**.0

370*

**.0

319*

*.0

359*

**(.0

124)

(.012

3)(.0

126)

(.012

9)(.0

130)

BOR

DER

2.0

096*

**.0

094*

**.0

097*

**.0

080*

*.0

096*

**(.0

035)

(.003

5)(.0

036)

(.003

6)(.0

037)

AR

EA.0

183

-.132

5.0

318

-.128

1(.0

667)

(.122

0)(.0

669)

(.122

7)A

REA

2-.0

026

.011

6-.0

034

.011

1(.0

062)

(.010

6)(.0

062)

(.010

7)R

OA

D.0

674

.064

2(.1

252)

(.126

3)R

OA

D2

-.009

8-.0

093

(.014

3)(.0

145)

ALT

ITU

DE

-.062

8.0

878

(.111

4)(.1

544)

ALT

ITU

DE2

.012

1-.0

067

(.010

1)(.0

141)

Obs

erva

tion

s20

8920

8920

8720

8720

8712

1220

8712

12

*p<

0.1

0,*

*p<

0.0

5,*

**p<

0.0

1

Stan

dard

erro

rsin

pare

nthe

ses.

Stan

dard

erro

rscl

uste

red

atth

em

unic

ipal

ity

leve

l.A

llre

gres

sion

sco

ntro

lfor

year

and

stat

efix

edef

fect

s.


TAB

LE

6:D

ISTA

NC

ET

OT

RA

NSP

OR

TAT

ION

NE

TW

OR

KA

ND

PR

OD

UC

TIO

NG

RO

WT

H:R

EST

RIC

TE

DSA

MP

LE

Dep

ende

ntV

aria

ble:

Ln(T

otal

Gro

ssPr

oduc

tion

(per

capi

ta))t-L

n(To

talG

ross

Prod

ucti

on(p

erca

pita

))t−

1

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

DIS

TAN

CE

.004

8.0

084

.008

3.0

079

.008

2.0

080

.007

7.0

075

(.007

5)(.0

083)

(.008

6)(.0

086)

(.008

6)(.0

087)

(.008

7)(.0

088)

DIS

TAN

CE2

.001

5.0

021

.001

9.0

017

.002

0.0

019

.001

9.0

018

(.002

3)(.0

022)

(.002

2)(.0

023)

(.002

3)(.0

023)

(.002

3)(.0

024)

GR

OW

TH_1

-.312

4***

-.310

2***

-.310

3***

-.310

1***

-.308

3***

-.307

9***

-.307

9***

-.307

5***

(.056

1)(.0

564)

(.056

4)(.0

559)

(.056

2)(.0

563)

(.056

1)(.0

563)

CO

NN

ECT

ED-.0

127

-.012

0-.0

123

-.013

4-.0

135

-.012

6-.0

127

(.020

2)(.0

198)

(.019

5)(.0

195)

(.019

4)(.0

197)

(.019

7)C

ON

NEC

TED

2.0

002

.000

3.0

004

.000

1.0

002

.000

2.0

003

(.002

8)(.0

028)

(.002

7)(.0

027)

(.002

7)(.0

027)

(.002

8)C

OA

ST-.0

068

-.003

6-.0

026

-.003

4-.0

097

-.010

4(.0

097)

(.009

7)(.0

105)

(.010

5)(.0

154)

(.015

2)C

OA

ST2

-.001

2-.0

010

-.001

1-.0

010

-.001

5-.0

015

(.002

7)(.0

027)

(.002

7)(.0

027)

(.003

2)(.0

032)

BOR

DER

.035

5***

.035

8***

.037

0***

.034

7***

.035

9***

(.012

2)(.0

122)

(.012

6)(.0

125)

(.013

0)BO

RD

ER2

.009

3**

.009

5***

.009

7***

.009

4**

.009

6***

(.003

6)(.0

036)

(.003

6)(.0

037)

(.003

7)A

REA

-.112

4-.1

325

-.108

9-.1

281

(.110

0)(.1

220)

(.110

4)(.1

227)

AR

EA2

.008

7.0

116

.008

3.0

111

(.009

2)(.0

106)

(.009

3)(.0

107)

RO

AD

.067

4.0

642

(.125

2)(.1

263)

RO

AD

2-.0

098

-.009

3(.0

143)

(.014

5)A

LTIT

UD

E.0

967

.087

8(.1

524)

(.154

4)A

LTIT

UD

E2-.0

076

-.006

7(.0

140)

(.014

1)O

bser

vati

ons

1212

1212

1212

1212

1212

1212

1212

1212

*p<

0.1

0,*

*p<

0.0

5,*

**p<

0.0

1

Stan

dard

erro

rsin

pare

nthe

ses.

Stan

dard

erro

rscl

uste

red

atth

em

unic

ipal

ity

leve

l.A

llre

gres

sion

sco

ntro

lfor

year

and

stat

efix

edef

fect

s.


6.4 DISCUSSION

In this section the estimation of the effects of access to transportation infras-tructure on two measures of economic performance, namely, municipalities’total gross production and economic units have been presented. As expected,a negative relation between these variables was found. This fact shows thatmunicipalities that are closer to the transportation network display a bettereconomic performance.

The access to a transportation network does represent an advantageouscondition for economic performance. This relation has been explained froma trade costs perspective. In this work is evidence of the benefits of beingcloser to the network. An important part to take into account is that this ef-fect corresponds to the effects in the average location. According to the his-torical context, municipalities have beenquasi-randomly located with respect tothis network, so the estimations show how differences in access may explaindifferences in economic performance.

The results found in this work are consistent with the elasticities found inthe Chinese case, where authors find an elasticity of county per capita GDP toaccess of negative 6.72 percent and an elasticity of number of firms to access ofnegative 8.4 percent.

7. CONCLUSION

The relation between public capital and economic performance has been stud-ied from different approaches in economic literature. A big fraction of the re-search studies this relation from a production function approach, arguing thatpublic infrastructure can be understood as an input affecting an aggregate pro-duction function, directly as an input, or indirectly affecting total factor pro-ductivity.

In this work, the relation between a specific kind of public capital, such astransportation infrastructure, is addressed from a different approach. Takingas reference a model of factor mobility and trade, the effects of transportationinfrastructure are understood through the access that this type of public capitaloffers to regions and reduce trade costs. That is, it is expected that places thatare closer to the network display better economic performance.

The variable of interest in this study is the access to the transportation net-work measurement. This measure was constructed taking the shortest distancefrom any Mexican municipality to the historical transportation network andthe shape of this network highly resembles nowadays main transportation net-work. Using the historical transportation network in order to construct the ac-cess measurement represents a series of methodological advantages because itallows to overcome the endogenous relation between economic performanceand the placement of transportation infrastructure.

The results of this work support the hypothesis that access to transporta-tion network is positively related with economic performance for the Mexicancase. The estimations show that the two measurements of economic perfor-


mance (i.e. per capita gross production and economic units) are negatively re-lated with the distance to the network. That is, municipalities that are closer tothe network have experienced a better economic performance compared withthose further away.

This work shows a different way to assess the effects of public capital be-yond the direct effects that could be attributed from a production function ap-proach. Although the relation of access to the network with economic per-formance has a modest magnitude, the effects are significant. Moreover thedifferences of access to the network, that are quasi-randomly determined, showhow placement of this kind of facilities may have a differentiated effect acrossregions.

This work is a contribution to further the understanding of the effects thatsome types of public capital, such as transportation infrastructure, may haveover economic outcomes. An important conclusion of this work is that pub-lic capital may have positive effects over economic outcomes, but these effectsmay be differentiated according to the way in which this services are delivered.As it has been shown in this work, the transportation network has a differen-tiated effect on regions depending on (i)how well connected those regions are,and (ii)their extent of access to the network. Provision of infrastructure hasdifferent effects depending on the way that infrastructure is delivered. An im-portant recommendation that can be extracted from this work is that regardingthe provision of public capital, it is important to consider not only the amountprovided, but also where that capital is located. As this work has shown, re-gions that were quasi randomly located nearer to the transportation network,had a better economic performance in the long run. Following this fact, it isimportant when analyzing the provision of public infrastructure, to considerhow this provision may affect regional economic dynamics.


REFERENCES

Alcalá, F. and Ciccone, A. (2004), ‘Trade and Productivity’, The Quarterly Journalof Economics 119(2), 613–646.

Anderson, J. E. and Van Wincoop, E. (2004), ‘Trade Costs’, Journal of EconomicLiterature 42(3), 691–751.

Aschauer, D. A. (1989), ‘Is Public Expenditure Productive?’, Journal of MonetaryEconomics 23(2), 177–200.

Atack, J., Haines, M. R. and Margo, R. A. (2008), ‘Railroads and the Rise ofthe Factory: Evidence for the United States, 1850-70’, NBER Working PaperNo.14410.

Banerjee, A., Duflo, E. and Qian, N. (2012), ‘On the road: Access to Transporta-tion Infrastructure and Economic Growth in China’, NBER Working PaperNo.17897.

Barro, R. J. (1991), ‘Economic Growth in a Cross Section of Countries’, The Quar-terly Journal of Economics 106(2), 407–443.

Bom, P. and Ligthart, J. (2008), ‘How Productive is Public Capital? A Meta-Analysis’, Working Paper .

Buys, P., Deichmann, U. and Wheeler, D. (2010), ‘Road Network Upgradingand Overland Trade Expansion in Sub-Saharan Africa’, Journal of AfricanEconomies 19(3), 399–432.

Calderon, C. and Servén, L. (2003), ‘The Output Cost of Latin America’s In-frastructure gap’, The Limits of Stabilization: Infrastructure, Public Deficits, andGrowth in Latin America pp. 95–118.

Cantos, P., Gumbau-Albert, M. and Maudos, J. (2005), ‘Transport Infrastruc-tures, Spillover Effects and Regional Growth: Evidence of the Spanish Case’,Transport Reviews 25(1), 25–50.

Coatsworth, J. H., Hernández, J. A. and Manjarrez, H. (1984), El impactoeconómico de los ferrocarriles en el Porfiriato: Crecimiento contra desarrollo, Era.

Costa, J. d. S., Ellson, R. W. and Martin, R. C. (2006), ‘Public Capital, RegionalOutput, and Development: Some Empirical Evidence’, Journal of RegionalScience 27(3), 419–437.


Dercon, S., Gilligan, D. O., Hoddinott, J. and Woldehanna, T. (2009), ‘TheImpact of Agricultural Extension and Roads on Poverty and ConsumptionGrowth in Fifteen Ethiopian Villages’, American Journal of Agricultural Eco-nomics 91(4), 1007–1021.

Donaldson, D. (2010), ‘Railroads of the Raj: Estimating the Impact of Trans-portation Infrastructure’, NBER Working Paper No.16487.

Easterly, W. and Servén, L. (2003), The Limits of Stabilization: Infrastructure, Pub-lic Deficits, and Growth in Latin America, World Bank Publications.

Eberts, R. W. (1986), ‘Estimating the Contribution of Urban Public Infrastruc-ture to Regional Growth’, Federal Reserve Bank of Cleveland Cleveland,Working Paper No. 8610.

Estache, A. (2006), Infrastructure: A Survey of Recent and Upcoming Issues, in‘World Bank Conference, Tokyo’, pp. 29–30.

Florescano, E. and Eissa-Barroso, F. A. (2008), Atlas histórico de México, Aguilar.

Fogel, R. W. (1962), ‘A Quantitative Approach to the Study of Railroads inAmerican Economic Growth: A Report of Some Preliminary Findings’, Jour-nal of Economic History 22(2), 163–197.

Glaeser, E. and Kohlhase, J. (2003), ‘Cities, Regions and the Decline of Trans-port Costs’, Papers in Regional Science 83(1), 197–228.

Gramlich, E. M. (1994), ‘Infrastructure Investment: A Review Essay’, Journal ofEconomic Literature 32(3), 1176–1196.

Herrendorf, B., Schmitz, J. A. and Teixeira, A. (2009), Transportation and Develop-ment: Insights from the US, 1840-1860, Federal Reserve Bank of Minneapolis,Research Department.

Holtz-Eakin, D. and Schwartz, A. E. (1995), ‘Infrastructure in a StructuralModel of Economic Growth’, Regional Science and Urban Economics 25(2), 131–151.

Hulten, C. R. (1996), ‘Infrastructure Capital and Economic Growth’, NBERWorking Paper No.5847.

Hummels, D. (2007), ‘Transportation Costs and International Trade in the Sec-ond Era of Globalization’, The Journal of Economic Perspectives 21(3), 131–154.


Ihori, T. and Kondo, H. (2000), ‘Efficiency of Disaggregate Public Capital Pro-vision in Japan’, CIRJE, Faculty of Economics, University of Tokyo.

Jacoby, H. G. (2000), ‘Access to Markets and the Benefits of Rural Roads’, TheEconomic Journal 110(465), 713–737.

Kawamura, K. (2001), ‘Empirical examination of relationship between firm lo-cation and transportation facilities’, Transportation Research Record: Journal ofthe Transportation Research Board 1747(1), 97–103.

Kuntz Ficker, S. (1995), Empresa extranjera y mercado interno: el Ferrocarril CentralMexicano, 1880-1907, Colegio de México.

Michaels, G. (2008), ‘The Effect of Trade on the Demand for Skill: Evidencefrom the Interstate Highway System’, The Review of Economics and Statistics90(4), 683–701.

Nadiri, M. I. and Mamuneas, T. P. (1994), ‘The Effects of Public Infrastructureand R&D Capital on the Cost Structure and Performance of US Manufactur-ing Industries’, The Review of Economics and Statistics 76(1), 22–37.

Pérez Cervantes, F. (2013), Railroads and Economic Growth: A Trade PolicyApproach, PhD thesis, The University of Chicago.

Peterson, G. E. (1990), Is Public Infrastructure Undersupplied?, in ‘ConferenceSeries - Federal Reserve Bank of Boston’, pp. 113–130.

Riguzzi, P. (1996), Los caminos del atraso: tecnología, instituciones e inversiónen los ferrocarriles mexicanos, 1850-1900, in S. Kuntz and P. Riguzzi, eds,‘Ferrocarriles y vida economómica (1850-1950). Del surgimiento tardío al de-caimiento precoz. México’, El Colegio Mexiquense/Universidad AutónomaMetropolitana-Xochimilco/Ferocarriles Nacionales de México, pp. 31–99.

Sánchez, R. J. (2010), ‘Textos para discussão CEPAL-IPEA’.

Shepherd, B. and Wilson, J. S. (2006), ‘Road Infrastructure in Europe and Cen-tral Asia: Does Network Quality Affect Trade?’, World Bank Policy ResearchWorking Paper .

Storeygard, A. (2012), ‘Farther on Down the Road: Transport Costs, Trade andUrban Growth in Sub-Saharan Africa’, Manuscript, Department of Economics,Brown University .


Straub, S. (2008), Infrastructure and Development: A Critical Appraisal of the MacroLevel Literature, Vol. 4590, World Bank Publications.

Weisbrod, G. (2008), ‘Models to Predict the Economic Development Impact ofTransportation Projects: Historical Experience and New Applications’, TheAnnals of Regional Science 42(3), 519–543.


a

APPENDIX

A. SUMMARY STATISTICS

TABLE 7: SUMMARY STATISTICS

Variable Units Obs Mean Std. Dev. Min Max

Economic Measures

Total Gross Production Pesos(thousands) 7335 2867769 1.911007 6 70.40107Economic Units units 7335 1282.76 4573.72 0 84651

Geodesic Distance Measures

Distance to Network kilometers 7361 56.76 56.68 0 592.12Distance to Connected City kilometers 7337 80.01 53.85 0 592.12

Distance to Coast kilometers 7361 133.48 93.90 0 589.37Distance to US Border kilometers 7361 742.74 286.57 0 1358.34

Other Measures

Population People 7406 124878.7 2130150 94.73 111007Area kilometers2 7466 2379.75 40426.92 1.47 1963952

Altitude km. over sea level 7466 1253.70 829.04 0 2900Road Length kilometers 4525 344.19 1608.78 0 24327

TABLE 8: DISTRIBUTION OF DISTANCE VARIABLE

Percentile Value1% 1.9775% 6.215

10% 9.86425% 19.59550% 44.89275% 94.20790% 143.52395% 172.82099% 248.843


B. DATA SOURCES

TABLE 9: DATA SOURCES

Data Source Description

Economic Census Economic Census carried out each five periods by INEGIPopulation Census Population Census carried out each five periods by INEGI

SNIM Sistema Nacional de Información Municipal. Contains information at municipality level. Data comes from INEGI datasourcesINEGI Series consulted directly from INEGI webpage

TABLE 10: VARIABLES AND DATA SOURCES

Variable Data Source Note

Economic Measures

Total Gross Production Economic Census Time Span: 1999, 2004, 2009Economic Units Economic Census Time Span: 1999, 2004, 2009

Geodesic Distance Measures

Distance to Network SNIM Constructed by the authorDistance to Connected City SNIM Constructed by the author

Distance to Coast SNIM Constructed by the authorDistance to US Border SNIM Constructed by the author

Other Measures

Area SNIM Time InvariantAltitude SNIM Time Invariant

Population Population Census/Counts Time Span: 1995, 2000, 2005, 2010Road Length INEGI Time Span: 1999, 2004, 2009


C.

ME

XIC

O’S

TR

AN

SPO

RTA

TIO

NN

ET

WO

RK

FIG

UR

E1:

ME

XIC

AN

RA

ILR

OA

DS

CO

NST

RU

CT

ED

BE

FO

RE

1910

Sour

ce:(

Flor

esca

noan

dEi

ssa-

Barr

oso,

2008

,p.1

77)


FIG

UR

E2:

TR

AN

SPO

RTA

TIO

NN

ET

WO

RK

:CO

NST

RU

CT

ED

LIN

ES


FIG

UR

E3:

ME

XIC

OH

IGH

WA

YSY

STE

M

Sour

ce:S

ecre

tari

ade

Com

unic

acio

nes

yTr

ansp

orte

s,(2

009)


D. ALGORITHM FOR CALCULATING DISTANCE MEASURES

In this section we will describe the algorithm used for constructing the geodesicdistance measures.30 In order to calculate these measures we need geograph-ical coordinates (latitude and longitude) for each Mexican municipality in thedecimal form.31 The next step is to locate all the municipalities that are locatedalong the straight line connecting two cities that were connected by the railroad system constructed before 1910. The location of these municipalities wasdone using ArcGIS software and a Mexico map with state and municipalitygeographical division. There are Mi (i = 1, 2, ..., I) municipalities in Mexico,Rj , (j = 1, 2, ..., J) historical transportation routes and Tn

j , (n = 1, 2, ..., Nj)

municipalities in each of these routes. The steps taken for this purpose are thefollowing

1. Calculate the distance from each municipality,M , to each municipality,T ,conforming a route, R. This set of measures corresponds to the distancefrom each municipality to the complete route j. In this step we get I x Jx Nj distance measures (where I is the total of Mexican municipalities, Jthe total number of routes, and Nj , the number of municipalities in eachroute j)

Distanceji,n = geodist[mi, tjn]

for (i = 1, 2, ..., I) , (j = 1, 2, ..., J) and (n = 1, 2, ..., Nj)

2. Compute the minimum distance to each route j. This step results intoone distance measure for each route, j for each Mexican municipality, i.(J distances per municipality)

Distanceji = min[Distanceji,1, Distanceji,2, ..., Distance

ji,N ]

for (i = 1, 2, ..., I) , (j = 1, 2, ..., J)

30Distance measures are computed using geodist package in STATA.31The municipality’s geographical coordinates correspond to the municipality’s population cen-

troid. These measure was obtained in the DD,MM ,SS form (where DD= degrees, MM= min-utes, SS= seconds) form. In order to convert it to the decimal form we needed to compute thefollowing identity:

(Longitude/Latitude) = Grades+Minutes

60+Seconds

60 ∗ 60.


3. Compute the minimum distance from any municipality i, to any route j.After this step, we get the distance to transportation measure, which isdefined as the closest distance to any municipality located in any histori-cal transportation route.

Distancei= min[Distance1i , Distance2i , ..., Distance

Ji ]

for (i = 1, 2, ..., I)

After this steps, the resulting measure Distancei is the access to transportationinfrastructure used in the econometric analysis.

There were also constructed measures for distance to terminal cities, dis-tance to coast, and distance to the US border. The steps taken for constructingthese measures are the same as the described in this section, just instead ofhaving Rj routes, we have a set of terminal municipalities, a set of coastal mu-nicipalities, and a set of Mexican municipalities in the US border.

At the end, each Mexican municipality Mi, (i = 1, 2, ..., I) has associated ameasure for each of these variables: (access to transportation network, distance toconnected city, distance to coast and distance to US border).

ANÁLISIS DEL COMPORTAMIENTO DE PIGNORANTESEN CONTRATOS PRENDARIOS*

PEDRO FELIPE MAGAÑA SÁENZ**

Problemas de autocontrol es una de las razones principales por las cuales losindividuos procrastinan. Procrastinar es uno de los errores que se cometen al mo-mento de tomar decisiones pues resulta fácil y común postergar una actividad quenos resulta desagradable en el corto plazo pero tiene importantes beneficios en ellargo plazo (ejercitarnos, seguir una dieta balanceada o pagar deudas, por citar al-gunos ejemplos). En este contexto y con el objeto de entender de mejor manera elmercado prendario mexicano se realiza el siguiente trabajo por medio de un expe-rimento aleatorio con la finalidad de examinar la forma en que la estructuración decontratos afecta la probabilidad de éxito de un préstamo prendario, así como ponera prueba la hipótesis de que el precommitment de los pignorantes es de vital impor-tancia para que se cumplan las condiciones de dichos contratos.

CÓDIGO JEL: D03, D01, C42, G21.PALABRAS CLAVE: economía conductal, economí experimental, micro-

créditos, prestamos prendarios.

1. INTRODUCCIÓN

LAS CASAS DE EMPEÑO EN MÉXICO

Según la CONDUSEF, las casas de empeño son empresas comerciantes einstituciones de asistencia privada que otorgan un préstamo a alguna personacon una garantía prendaria, es decir, la garantía es un bien mueble. Por lo ge-neral joyas, anillos y relojes. Cuando una persona acude a la Casa de Empeñoa depositar un bien es necesario firmar un contrato de prenda, éste es un con-trato de adhesión mediante el cual el titular de la boleta y la casa de empeñose sujetan a las cláusulas que lo integran. Cabe señalar que la constitución dela prenda es necesaria para garantizar el pago del préstamo, ya que, en caso deque la cantidad otorgada o sus intereses no sean cubiertos oportunamente, lainstitución podrá absorber la propiedad del bien y subastarlo o venderlo pararecuperar el crédito. Por otro lado, para el desempeño de las prendas se de-berá presentar la boleta, pagar la cantidad prestada, los intereses devengadosy los gastos de almacenaje, e identificarse cuando así lo solicite la institución,con fecha límite un día hábil antes de la fecha de comercialización que se anoteen el anverso del billete y en avisos colocados en lugares visibles de la propiasucursal donde fue celebrado el contrato.

El servicio de empeño tiene un costo para el pignorante1 dividido en dosconceptos:

*Este artículo está basado en la tesis presentada por el autor para recibir el título de Licenciadoen Economía por parte del ITAM. El autor agradece todo el apoyo y sugerencias de Nicolás Melis-sas así como los valiosos comentarios de Kensuke Teshima, Andrei Gomberg, Arturo Aguilar y alos árbitros anónimos de la Gaceta de Economía.

**Email: [email protected] del verbo pignorar. Este verbo se refiere a otorgar o ceder algo en prenda (es decir,

a modo de garantía).

PEDRO FELIPE MAGAÑA SÁENZ 97

1. Una tasa de interés mensual nominal sobre el préstamo otorgado.

2. Una tasa por la custodia de la prenda sobre el valor del avalúo.

La tasa de interés por el préstamo otorgado será la que se señala en el bi-llete, y se calcula por mes nominal hasta el vencimiento del contrato. El messe considerará completo independientemente de la fecha en que se realice elempeño o refrendo2. En todos los préstamos se adicionará a la tasa de interés,los puntos que se señalen en el billete por concepto de almacenaje que, por logeneral, cubre gastos de valuación, almacenaje, custodia y prima de segurosy fianzas (en nuestro caso, la suma de la tasa de interés y la tasa de custodiaequivale cerca de 30 % o 40 % del monto prestado).

En términos generales, el proceso de empeño lo podríamos establecer conlos siguientes pasos.

1. Acudir a una sucursal.

2. Se presenta una identificación oficial.

3. Presentar el bien objeto de empeño.

4. La institución hace la valuación del bien.

5. Se le informa al pignorante cuánto se le va a prestar por el bien.

6. Se firma el contrato.

7. La institución, le entrega el efectivo junto con la boleta de empeño.

8. El interesado recibe copia del contrato en el cual se detallan los costos,derechos y obligaciones de ambas partes.

En México, las casas de empeño funcionan como un sustituto de los mi-crocréditos para los deciles más bajos de la población. Esto debido a que teneracceso al mercado de crédito para familias de escasos recursos es muy compli-cado y, aunado a esto, México tiene uno de los sistemas financieros más peque-ños y menos competitivos del mundo3. Por lo anterior, resulta casi imposiblepara estas familias tener acceso a algún tipo de crédito.

2Término usado cuando al pignorante se le da un nuevo plazo pagando solamente los interesesen el mes de vencimiento del contrato.

3Existen únicamente 47 bancos comparado con aproximadamente 7,500 bancos en Estados Uni-dos (según la Federal Deposit Insurance Corporation). Esto quiere decir que México representa apro-ximadamente el 7.14 % de la economía de los Estados Unidos en términos del PIB comparado conel 0.62 % que es lo que representa el sector bancario mexicano del estadounidense. Esto demuestralo poco desarrollado que se encuentra este sector. Por otro lado, sólo siete bancos principales tienenel 80 % de los activos en México.

98 ANÁLISIS DEL COMPORTAMIENTO DE PIGNORANTES EN CONTRATOS

PRUEBA ALEATORIA

El experimento se centra en identificar en qué forma la estructuración decontratos afecta la forma en la que los pignorantes realizan sus pagos y, en con-secuencia, logra desempeñar y entender la relevancia del precommitment paraque las condiciones contratadas se cumplan. Adicional a lo anterior, se anali-za cuáles son las características de los usuarios que afectan el porcentaje totaldel pago realizado, esto pues se cuenta con 70 variables aproximadamente y sebusca comprender el comportamiento de los individuos de forma más precisapara robustecer el modelo.

Por lo anterior, se usó una prueba aleatoria con la intención de identificarla efectividad de la aplicación de ciertos tratamientos a la población objetivo,pues se busca eliminar cualquier tipo de sesgo. Su importancia reside en lalegitimidad que le brinda al experimento. La idea básica es que los tratamien-tos son asignados aleatoriamente a los sujetos de investigación, pues esto nosasegura que los diferentes grupos de tratamiento son “estadísticamente equi-valentes” y nos ayuda a eliminar problemas de endogeneidad pues, sin dichosproblemas, se permite analizar el impacto de un producto identificando clara-mente la significancia directa en la variable dependiente. Además, elimina laposibilidad de que los resultados de la efectividad de los contratos estén ses-gados por problemas de auto selección.

Un ejemplo sería el siguiente: Considere un mundo con dos tipos de usua-rio: los sofisticados y los ingenuos (50 % de la población es sofisticado y 50 %ingenuo). Un cliente sofisticado sabe que tiene problemas de autocontrol parapagar un préstamo y, por esto, recurre a contratos en los que se fuerza a pagarperiódicamente el préstamo. Si se le permitiera elegir voluntariamente el con-trato, con una probabilidad bastante alta, elegiría un contrato que incentive suautocontrol (llamémoslo contrato B) y entonces pagaría su préstamo y lograríadesempeñar. Un cliente ingenuo piensa que no tiene problemas de autocontrolpara pagar un préstamo y no recurre a los contratos en los que se fuerza a pagarperiódicamente el préstamo. El cliente ingenuo, por lo tanto, debería escogerel contrato que no incentive su autocontrol (contrato A) y entonces no pagaríael préstamo y no lograría desempeñar. Analicemos los siguientes 3 escenarios:

1. A ambos usuarios se les impone el contrato A, lo que generaría que nin-guno pague la totalidad de préstamo y, por ende, no logren desempeñar(tasa de desempeño muy cercana al cero por ciento).

2. A ambos usuarios se les impone el contrato B. En este caso, los sofis-ticados pagarían la totalidad del préstamo y lograrían desempeñar; losingenuos no pagarían y perderían su prenda (tasa de desempeño muycercana al 50 %).

3. Se les ofrece elegir voluntariamente entre el contrato A o el contrato B.Aquí lo que sucederá es que los ingenuos, muy probablemente, elegiránel contrato A y los sofisticados, muy probablemente, elegirán el contratoB. Esto generaría que la tasa de pago del contrato A seria baja y, en con-secuencia, la tasa de desempeño seria muy cercana al cero por ciento y la


tasa de pago del contrato B seria bastante alta, lo que llevaría a tener unatasa de desempeño de alrededor del 100 % para dicho contrato.

Por esto, al hacer un ensayo aleatorio se puede observar cómo reaccionanel total de los usuarios ante un mismo contrato y, con esto, nuestro resultadono está sesgado por auto selección. El ensayo aleatorio es útil para métodosprácticos dado que, en la realidad, no se puede distinguir con qué clase decliente se está tratando y, en consecuencia, es muy difícil estimar el efecto deun cambio contractual sobre el pago total del pignorante.

En el sector financiero enfocado a las casas de empeño se ha demostradoque es preferible, para la proliferación de este mercado, que la gente logredesempeñar su prenda. La relevancia de esto reside en que si los usuarios,que en su mayoría son de escasos recursos económicos, no recuperan sus obje-tos eventualmente terminan agotando todos sus instrumentos de crédito (susprendas de empeño) y, con esto, las casas de empeño terminan reduciendo sumercado.

ESTUDIOS RELACIONADOS CON EL PROYECTO

Con la intención de evitar que los individuos pierdan su prendas se crearonlos contratos 2 y 3 (Contrato 2: Préstamo a 3 meses con pago mensual forzo-so (de no realizar el pago se cobra una pena por no pago equivalente al 2 %sobre el monto prestado); Contrato 3: Préstamo a 3 meses con pago mensualopcional (de no realizar el pago no se cobra ninguna pena, es decir, solo secree en la palabra del usuario)) tomando en cuenta el estudio realizado porDan Ariely y Klaus Wertenbroch (2002) realizado con estudiantes de MIT, en elcual se analiza qué efecto produce el establecimiento de deadlines dentro delcomportamiento de los estudiantes. Este experimento se enfocaba en analizarlos patrones de comportamiento de los estudiantes al presentarles una serie deesquemas de entrega de papers.

Lo interesante del estudio fue que se encontró que el establecimiento forzo-so de fechas límite distribuidas simétricamente por parte de un agente externo,en este caso los profesores, incrementaba la probabilidad de obtención de unamejor calificación para los estudiantes pues se comparó contra el grupo dondelos estudiantes elegían voluntariamente las fechas de entrega de los papers yestos tuvieron peores calificaciones (importante notar que dentro de este grupohubo estudiantes que se pusieron deadlines distribuidos simétricamente y estostuvieron mejores calificaciones, e incluso cercanas a las del primer grupo, quelos que no los distribuyeron simétricamente). Es decir, cuando la gente sabeque la posibilidad de retrasar entregas o compromisos es latente (procrastinar)optan por auto regularse eligiendo el marco que los obliga a administrar sustareas a través de un plazo fijo.

Otro estudio que analiza este tipo de efectos es el realizado por Jain y Man-suri (2003) en el Grameen Bank en Bangladesh. Este banco opera otorgando mi-cro créditos a grupos de personas de bajos ingresos en el que no se les exigecolateral o, en otras palabras, no se les exige una prenda. Su éxito está funda-mentado en el incentivo para pagar por parte de los usuarios del crédito, pues


si por alguna razón no abona algún pago, no será tomado en cuenta por el gru-po para préstamos futuros. Sin embargo, durante este experimento se mostróque las complicaciones son variadas: existe riesgo moral, selección adversa, al-tos costos de monitoreo. Por esto, uno de los acercamientos que analizaron paradisminuir la probabilidad de que se dieran estos escenarios fue la implementa-ción de un esquema de amortización o redistribución de pagos como métodopara eliminar los problemas de información que enfrentaban las institucionesde micro crédito, puesto que la implementación de este sistema genera que elusuario solicite un préstamo a un amigo o a prestamistas locales que tenganuna mejor información sobre la solvencia de los prestatarios. Por lo tanto, ladevolución anticipada de un porcentaje del monto prestado es un dispositivopara obtener información en poder de estos usuarios.

Si bien, con esto se nota que las prácticas de auto control llevan al cum-plimiento de metas y así evitar procrastinar. También se reconoce a través denumerosos experimentos psicológicos y económicos que el intercambio de pro-mesas resalta un comportamiento de cooperación. Christoph Vanberg (2008)realizo un estudio con la finalidad de reconocer los efectos de algunos tipos decontratos sociales (por ejemplo las promesas) sobre el cumplimiento de com-promisos. Se intentó explicar la motivación del incremento del comportamien-to cooperativo a través de dos teorías: primero, se asumió que a las personasles gusta que su palabra no pierda validez y segundo, a la gente no le gustadecepcionar a los demás en sus expectativas. El experimento fue conducido enel laboratorio experimental del Instituto de Economía Max Plank en Alemaniay la población objetivo estaba formada por estudiantes de universidad a loscuales se les aplicaba un programa basado en promesas previamente estable-cidas. En el experimento se aislaban dos grupos de personas en el que, previoal inicio de la dinámica, podían estar en contacto y había pagos monetarios depor medio, por lo tanto, las promesas hechas afectaban el resultado final deelección. En este paper se concluye que lo que importa no son las expectativasdel receptor, sino la “internalización” que el emisor hizo de su promesa. Otroejemplo de este tipo de estudios es el realizado por Gary Charness y MartinDufwenberg (2006) en donde se concluye que las promesas cambian las ex-pectativas y el comportamiento del receptor y esto, a su vez, cambia cómo secomporta el emisor con el receptor posteriormente. Ellos interpretan que estoúltimo se debe a que al emisor le importan las expectativas del receptor y no loquiere “defraudar” (sentimiento de culpa).

2. PROYECTO: COMPORTAMIENTO EN CASAS DE EMPEÑO

DESCRIPCIÓN DEL EXPERIMENTO

El experimento fue diseñado por el Centro de Investigación Económica(CIE) del ITAM dando inicio en el mes de Junio de 2012 y tuvo una duraciónaproximada de diez meses (culminó en marzo 2013). Inició con la generaciónde tres tipos de contratos y se les presentaron a los usuarios de la siguientemanera:


Contrato 1: Préstamo a tres meses sin pagos forzosos.

Contrato 2: Préstamo a tres meses con pago mensual forzoso (de no rea-lizar el pago se cobra una pena por no pago equivalente al 2 % del montoprestado).

Contrato 3: Préstamo a tres meses con pago mensual opcional (de no rea-lizar el pago no se cobra ninguna pena, es decir, sólo se cree en la palabradel usuario).

Contrato 4: El usuario elegía de manera voluntaria entre el Contrato 1(llamado 4A) o el Contrato 2 (llamado 4B).


Estos programas eran ofrecidos de manera aleatoria durante un día com-pleto en seis casas de empeño de la en el Distrito Federal con la intención deanalizar el comportamiento de los pignorantes (usuarios del préstamo) segúnlas características del contrato que se les asignaba o elegían.

Con la ayuda de más de diez asistentes se realizó el levantamiento de in-formación, la cual se obtuvo de dos fuentes. Se levantaron las encuestas a laspersonas que se presentaban en las sucursales para realizar un empeño. Pri-mero, se les daba una descripción detallada de las cláusulas del contrato deldía y, por último, se les pedía que nos llenaran una encuesta de 30 preguntascon la intención de conocer las características del pignorante . Por otro lado,la casa de empeño nos entregó un listado de movimientos que realizaron losusuarios durante el periodo del préstamo, esta incluía empeño, refrendo, pagode intereses, abonos a capital y desempeños.

Con la información anterior, se creó una tercera base de datos, la cual eraun híbrido entre las dos anteriores, pues nos permitía tener las característi-cas de los individuos y los movimientos que realizaron durante el periodo depréstamos. Esta base de datos, nos permite hacer un análisis ”uno a uno´´ conla intención de entender qué características (tanto de los individuos como delcontrato) afectan en el monto pagado por los pignorantes.

Se crearon dos variables nuevas a esta base de datos: una de ellas se generósumando todos los pagos y movimientos que realizó el usuario, y esta se di-vidió entre el monto total prestado y una variable dicotómica que nos dice sila persona logró desempeñar o no (uno si desempeña y cero si no desempe-ña). Por otro lado se crearon seis dummies, una por cada tipo de contrato. Secorrieron una serie de regresiones (éstas son descritas con mayor detalle en lassiguientes secciones del trabajo) que ayudan verificar puntos importantes dellevantamiento de datos como lo son:

Respaldar la aleatoriedad de la muestra

Validar autoselección

Entender el efecto de cada contrato y subcontrato


Características que afectan el porcentaje de pago del préstamo

La información recopilada en la base de datos nos da 2721 boletas de em-peño, la cual será la muestra para nuestro experimento.

3. DATOS

ESTADÍSTICA DESCRIPTIVA

TABLA 1: ESTADÍSTICOS DESCRIPTIVOS

Variable Obs Mean Std. Dev Min Max

pp-pago 2721 66.06 % 58.52 % 0 455 %val-pren 2032 5363.22 9779.71 35 100,000.00préstamo 2721 2096.56 2436.57 50 33,560.00

edad 2563 42.1 12.8 15 85

En la tabla 1 vemos que el porcentaje promedio de pago (pp-pago) está enel 66.08 % teniendo como máximo a una persona que pago 4.55 veces lo quese le prestó (455 %). En la segunda línea vemos el monto que los pignorantescreen que vale su prenda (val-pren), éste va de los $35 a los $100,000 pesosteniendo una media de $5,363.22 pesos, esto nos dice que, en promedio, losusuarios sobreestiman el valor real de la prenda en mas del 100 %. Este efectopuede ser explicado por que los individuos al creer que nosotros trabajábamospara la casa de empeño, tenia incentivos a mentir pensando que, con esto, se lesprestaría mas dinero (tomar en cuenta que la Fundación Dondé presta 75 % delvalor prendario de la prenda, es decir, cerca del 60 % del valor comercial de lamisma). En la tercera línea vemos que el monto de préstamo (préstamo) oscilaentre los $50 y los $33,560 pesos con un promedio de $2,096.55 pesos. En cuantoa la edad, podemos notar que los usuarios están entre los 15 y los 85 años deedad, teniendo un promedio de 42 años. Con este último dato, podemos intuirque la mayoría de los usuarios son adultos jóvenes o medios.


TABLA 2: TABLAS DE FRECUENCIA

Variable Subcategoría Obs Porcentaje

Producto 1 485 17.8 %2 433 15.9 %3 353 13.0 %

4A 676 24.8 %4B 200 7.4 %5A 380 14.0 %5B 194 7.1 %

Género Hombre 782 28.7 %Mujer 1,939 71.3 %

Educación Ninguna 43 1.6 %Primaria 273 10.0 %

Secundaria 616 22.6 %Preparatoria 866 31.8 %

Más de preparatoria 923 33.9 %

¿Participa en Tanda? No 1,657 60.9 %Sí 1,064 39.1 %

Gasto Familiar Semanal X <500 149 5.5 %500<X <1,000 1,122 41.2 %

1,000<X <2,000 939 34.5 %2,000<X <4,000 350 12.9 %4,000<X <6,000 89 3.3 %

X >6,000 72 2.7 %

¿Ha empeñado antes? No 289 10.6 %Sí 2,432 89.4 %

¿Logro desempeñar? No 1,428 52.5 %Sí 1,293 47.5 %

¿Qué prefiere tener? 100 pesos mañana 1,941 71.3 %150 pesos en un mes 780 28.7 %

La tabla 2 muestra los siguientes datos relevantes sobre nuestra base:

La cantidad de boletas o préstamos con cada uno de los productos ofreci-dos a los usuarios siendo el contrato 4 (el usuario elegía de manera volun-taria entre el Contrato 1 (llamado 4A) o el Contrato 2 (llamado 4B)) el quetiene mayor frecuencia pues nos interesaba notar si los individuos enten-dían que podían procrastinar y no pagar de manera adecuada y, una vezentendiendo esto, se imponían un castigo costoso en caso de postergarlos pagos como lo mencionan Ariely y Wertenbroch (2002).

También vemos que el porcentaje de mujeres que realizan el empeño es


muy superior al de los hombres (con el 71.3 %): esto se puede entenderpor el hecho de que en la mayoría familias mexicanas las mujeres sonlas encargadas del gasto en el hogar y, siendo el empeño un sustituto deun microcrédito para las personas de los deciles más bajos, se entiendeque las mujeres realicen más empeños. Otro factor puede ser que, regu-larmente, las mujeres tienen más artículos de joyería que los hombres.

También está reflejado el nivel de escolaridad de nuestra base de datos.Podemos ver que el porcentaje va incrementando conforme mayor es elnivel de escolaridad, siendo los que tienen más que preparatoria los queempeñan más con el 33.9 %.

Considerando las tandas como un sustituto de ahorro para la mayoríade las personas que ingresas a una (dada su estructura, sólo las personasque reciben el dinero en el primer número pueden considerarlo como unsustituto de crédito a pagos mensuales sin intereses) podemos entenderque la mayoría de las personas de nuestra muestra ahorran poco por loque recurren al empeño cuando les surge un gasto inesperado.

El gasto familiar semanal del encuestado lo usamos como una proxy delingreso mensual de los usuarios. El dato con mayor frecuencia fue ungasto entre 550 y 1000 pesos semanales, lo que se traduce en 2000 o 4000pesos gasto mensual en la familia (41.2 %).

La mayoría de las personas que empeñan son clientes frecuentes, puescerca del 90 % de los encuestados han realizado mínimo un empeño conanterioridad.

Notamos que la cantidad de personas que no lograron desempeñar fueligeramente mayor a la que logro desempeñar.

La mayoría de las personas en nuestra muestra tienen poco autocon-trol (factor de descuento muy elevado) en cuestiones monetarias, puesel 71.3 % prefieren recibir 100 pesos mañana que recibir 50 % de rendi-miento por esperar un mes (150 pesos).

TABLA 3: PROBABILIDAD DE DESEMPEÑO

Variable Contrato Éxito Fracaso Total Probabilidad de Desempeño

Producto 1 220 265 485 45 %2 211 222 433 49 %3 156 197 353 44 %

4A 315 361 676 47 %4B 122 78 200 61 %5A 170 210 380 45 %5B 99 95 194 51 %

Total 1,293 1,428 2,721 48 %


En la tabla 3 podemos notar la frecuencia de éxito (lograr desempeñar) ofracaso (no desempeño) de cada uno de los contratos, así como la probabi-lidad de éxito de los mismos. En particular, notamos que el contrato 4B fueel único que aumento la probabilidad de desempeño de los individuos, estose puede entender porque los usuarios se auto seleccionan y en este contratotenemos sólo pignorantes sofisticados y en el contrato 4A tenemos sólo pig-norantes ingenuos, en otras palabras, tenemos endogeneidad en este tipo decontratos donde los individuos eligen voluntariamente el tipo de contrato quedesea.

4. RESULTADOS

REGRESIONES E INTERPRETACIÓN

En esta sección se detalla la estructura de cada una de las regresiones utili-zadas en el experimento, así como la interpretación de cada una de ellas. Paraesto, vale la pena recordar que la estructura de los contratos fue la siguiente:

Contrato 1: Préstamo a tres meses sin pagos forzosos.

Contrato 2: Préstamo a tres meses con pago mensual forzoso (de no rea-lizar el pago se cobra una pena por no pago equivalente al 2 % del montoprestado).

Contrato 3: Préstamo a tres meses con pago mensual opcional (de no rea-lizar el pago no se cobra ninguna pena, es decir, solo se cree en la palabradel usuario).



En la tabla 4 se presenta la estructura de cada una de las regresiones utili-zadas en el trabajo y, posteriormente, se describe la intención de cada una deellas y se analizan sus resultados.


TABLA 4: ESTRUCTURA DE REGRESIONES

Regresión Estructura Explicación

Regresiones pararespaldar la

aleatoriedad de laprueba

V ariable =0+1(contrato2)+2(contrato3)+3(contrato4) + 4(contrato5) +

Serie de regresiones con cada una delas características de los pignorantecomo variable dependiente y cada

contrato como variablesindependientes

Regresiones paravalidar problemas de

autoselección

V ariable =0+1(contrato2)+2(contrato3)+3(contrato4A)+4(contrato4B)+5(contrato5A)+6(contrato5B)+

Serie de regresiones con cada una delas características de los pignorante

como variable dependiente y cada unode los subcontratos como variables

independientes.

Regresión paraentender los efectos

de cada contrato

pp_pago =0+1(contrato2)+2(contrato3)+3(contrato4) + 4(contrato5) +

El porcentaje de pago como variabledependiente y cada contrato como

variables independientes.

Regresión paraentender los efectosde cada subcontrato

pp_pago =0+1(contrato2)+2(contrato3)+3(contrato4A)+4(contrato4B)+5(contrato5A)+6(contrato5B)+

El porcentaje de pago como variabledependiente y cada uno de

los,subcontrato como variablesindependientes.

Regresión decaracterísticas

pp_pago = 0 + 1(val pren) +2(contrato2) + 3(contrato3) +

4(contrato4A)+5(contrato4B)+6(contrato5A)+7(contrato5B)+8(gen) + 9(educacion) + 10(tconsis1) + 11(pres antes) +

12(ctatanda)+13(gastofam)+

El porcentaje de pago como variabledependiente y los contratos,

subcontratos y características de lospignorantes como variables

independientes.

donde

Variable = Cada una de las características obtenidas en la encuesta (ge-nero, educación, valor de prenda, participación en tanda, gasto semanal,etc.)

pp-pago = porcentaje de pago (ratio de pago total entre monto prestado)

val-pren = valor de la prenda (estimación del pignorante)


contrato2 = dummy de contrato 2

contrato3 = dummy de contrato 3

contrato4A = dummy de contrato 4A

contrato4B = dummy de contrato 4B

contrato5A = dummy de contrato 5A

contrato5B = dummy de contrato 5B

gen = genero (0=hombre y 1=mujer)

educacion = 0= ninguno, 1= Primaria, 2= Secundaria, 3= Preparatoria, 4=más de Preparatoria

t-consis1 = time consistensy (0 = Mañana y 1= en un mes)4

Regresiones para respaldar la aleatoriedad de la prueba

Para comprobar la aleatoriedad del experimento, se corrieron una serie deregresiones. En éstas se puso como variable dependiente cada una de las ca-racterísticas de los individuos y como variable independiente cada uno de loscontratos.

Hay que tomar en cuenta que cada una de las variables que describen las ca-racterísticas de los pignorantes están expresadas de diferente manera, es decir,algunas son dicotómicas, otras continuas y algunas discretas. Por lo anterior, setuvieron que correr regresiones tipo probit, logit y lineal para tener evidenciay validar la aleatoriedad de nuestra muestra.

Al analizar los resultados de estas regresiones, es posible notar que hayalgunas variables que tienen contratos que resultan significativos pero esto su-cede en muy pocos casos, por lo que entran dentro de nuestro rango de error yvalidamos que nuestra muestra cumple con la aleatoriedad necesaria.5

Regresiones para valorar problemas de autoselección

Por otro lado, con la intención que tengamos diferentes tipos de personasen diferentes contratos (principalmente 4A, 4B, 5A y 5B) o, en otras palabras,validar los problemas de autoselección.

Después de analizar los resultados, se puede afirmar que no hay problemasrelevantes de autoselección como se esperaba, pues no encontramos variablessignificativas en los contratos de elección.6

4El término time consistency consiste en valorar que las decisiones de los individuos se man-tengan durante el tiempo. En otras palabras, que las decisiones de los individuos sean las mismassin importar la temporalidad de estas.

5Ver anexo 2 para resultados6Ver Anexo 3 para resultados


Regresiones para entender los efectos de cada contrato

Para resolver esta pregunta, se metió el monto prestado como parte de lavariable dependiente y se generó la variable pp-pago que es el ratio del montopagado entre el monto prestado:

pp pago = pago

prestamo

= monto pagado

monto prestado

Con la intención de entender la manera en que los contratos afectan el por-centaje pagado, se corrió una regresión tomando el porcentaje de pago comovariable dependiente y cada contrato como variables independientes. Se obtu-vieron los resultados descrito en la tabla 5.

TABLA 5: RESULTADOS POR TIPO CONTRATO

pp-pago Coef. t P

contrato2 0.0050674 0.13 0.895contrato3 -0.0302173 -0.74 0.459

contrato4A 0.0242517 0.7 0.485contrato4B 0.200958 4.1 0contrato5A -0.004711 -0.12 0.906contrato5B 0.0588237 1.19 0.235

_cons 0.6393402 24.14 0

N 2721

Se puede notar que el contrato 4 resulta prácticamente significativo, por loque se puede entender que hay un efecto causal por parte de este contrato.7

Por otro lado y con el objeto de entender los efectos de cada subcontrato enel porcentaje de pago, se corrió la misma regresión anterior con la diferenciaque ahora el contrato 4 y contrato 5 se dividió en cada subcontrato (4A, 4B, 5Ay 5B). Los resultados para esta especificación se muestran en la tabla 6.

TABLA 6: RESULTADOS POR TIPO CONTRATO DIVIDIENDO POR SUBCONTRATOS

pp_pago Coef. t P

contrato2 0.0050674 0.13 0.895contrato3 -0.0302173 -0.74 0.459

contrato4A 0.0242517 0.7 0.485contrato4B 0.200958 4.1 0contrato5A -0.004711 -0.12 0.906contrato5B 0.0588237 1.19 0.235

_cons 0.6393402 24.14 0

N 2721

7Ver Anexo 4


A partir de estos resultados, se puede inferir que hay una correlación sig-nificativa entre el porcentaje pagado y el contrato 4B pues esta incrementa elporcentaje total pagado en un 20 %.8

Regresión de características

Una vez analizando la información con la que contamos en nuestra base dedatos y las regresiones previas, me surge la siguiente pregunta:

¿Qué características de los usuarios afectan el porcentaje de pago delpréstamo?

Entonces se construyó una regresión tomando el porcentaje de pago comovariable dependiente y las características de los usuarios como variables inde-pendientes. Los resultados se muestran en la tabla 7:

TABLA 7: RESULTADOS CONSIDERANDO LAS CARACTERÍSTICAS DE LOS USUARIOS

pp-pago Coef. t P

val-pren 2.50E-06 1.85 0.064contrato2 0.006711 0.15 0.881contrato3 -0.0438274 -0.93 0.35

contrato4A 0.0013737 0.03 0.973contrato4B 0.1720876 3.05 0.002contrato5A -0.0376636 -0.81 0.416contrato5B -0.0134974 -0.23 0.819

Gen -0.0007131 -0.03 0.98Educacion -0.0115983 -0.9 0.366t-consis1 0.055046 1.92 0.055

pres-antes 0.1920991 4.46 0cta-tanda 0.0321738 1.2 0.229gasto-fam 0.0331614 2.63 0.009

_cons 0.4265043 6.29 0

N 2721

En esta regresión se obtuvo una r cuadrada de .0254 y podemos notar quetenemos 4 variables significativas en el modelo las cuales son:

contrato4B

prestamo-antes

gasto-fam

la constante8Ver Anexo 4A


El contrato 4B (producto 2 presentado como producto 4B, es decir, los indi-viduos eligieron voluntariamente el producto 2) nos dice que el hecho de quelos individuos elijan este producto aumenta el porcentaje de pago en 17.20 %en relación a cualquier otro contrato, y del mismo modo, resulta relevante laconstante (el producto 1) pues el hecho de que te haya tocado este producto au-menta el porcentaje de pago en 42.65 %. También notamos que el hecho de quelos usuarios hayan realizado un empeño anteriormente aumenta la porcentajede pago en 19.20 % comparado con las personas que nunca habían empeñadoantes. Por último, la variable gasto-fam resulto significativa siendo esta unaproxy del ingreso de los usuarios. Dicho lo anterior y analizando el coeficien-te de esta variable, podemos entender que entre mayor sea el ingreso de losusuarios mayor es su porcentaje de pago pues, aproximadamente por cada milpesos de gasto semanal en el hogar, aumenta en 3.31 % el porcentaje de pago.

PRUEBA BREUSCH-PAGAN

La prueba Breuch-Pagan nos ayuda ver si nuestro modelo cumple los su-puestos de homoscedasticidad. Si el P Value es menor a 0.05 no se cumple lahipótesis nula (siendo esta que el modelo es homocedástico). Al correr la prue-ba, notamos que el p-Value mayor a .05 por lo que se cumple la hipótesis nulay mostramos que nuestro modelos es homoscedástico.9

5. CONCLUSIONES

Después de analizar los resultados obtenidos en el modelo, podemos enten-der mejor el mercado de empeños en México y el comportamiento del mismoante cambios en la estructura y presentación de contratos, pues si nos interesaentender el porcentaje total pagado de los usuarios es importante considerar losiguiente:

Encontramos correlación en el contrato 4B y no en el contrato 2. Esto sepuede deber a que, al permitir que los pignorantes elijan el contrato, és-tos se ven obligados a entender de mejor manera las condiciones de losmismo para tomar la decisión y, en consecuencia, los usuarios compren-den las consecuencias económicas que tendrá el procrastinar los pagos,lo que les ayude a cumplir de mejor manera las clausulas del contrato.Esto por que en el contrato 2, la información se les presenta de la mismamanera, pero al serle asignado este contrato por default, el usuario noprocesa la información de la misma manera y, por ende, no está eligien-do voluntariamente la pena (no hay precommitment) lo que lo lleva a nopagar de la manera correcta. Con esto podemos inferir que lo que generaque se cumpla de mejor manera el contrato no es la pena y el esquemapor si solos, si no que estos hayan sido autoimpuestos por los propiosusuarios.

9Ver Anexo 5A


Lo antes mencionado tiene relación con los resultados obtenidos en elpaper de Ariely y Wertenbroch (2002) pues una de las conclusiones queobtienen de su experimento es que los mejores resultados (calificaciones)se obtienen cuando las fechas límites para la entrega de papers les son im-puestas a los alumnos de manera exógena y éstas están simétricamenteseparadas entre sí, pues la estructura de su experimento permite anali-zar qué tan óptima es la distribución de sus deadlines (tanto impuestas demanera exógenas como autoimpuestas). En los resultados de este traba-jo no podemos analizar cómo afecta la distribución de las fechas límitespues todas son impuestas de manera exógena, pero si podemos agregaral paper de Ariely y Wertenbroch que la imposición de fechas límites porun agente externo en el mercado de empeño no tiene el mismo impacto siel individuo no busca esta herramienta como una forma de autocontrolla cual, de no cumplirse, tendría consecuencias negativas significativas.

Es importante mencionar que no encontramos resultado significativo enlos contratos 3 y 5B a pesar de la elección. Esto se podría entender por elfactor compromiso pues, al no haber una pena de por medio, no hay ra-zón alguna que incentive a distribuir sus pagos de la manera correcta, loque genera que los pignorantes no tengan autocontrol y, en consecuencia,procrastinen su pago hasta la fecha limite.

Notamos que las características que se requieren para que el individuopague la mayor parte de los préstamos son las siguientes:

1. Que el usuario sea un cliente sofisticado, es decir, que elija (volunta-riamente) la modalidad de pagos mensuales forzosos con pena. Loanterior pues muestra el grado de precommitment que tiene sobre elpréstamo.

2. Que el ingreso del usuario sea superior al promedio de los clientes.

3. Tener experiencia en préstamos prendarios. Otro factor que estuvomuy cerca de ser considerado como una de las características queexplican un mayor porcentaje de pago fue tener un factor de des-cuento bajo en cuanto al dinero (p-value de .055) pues nos ayudaa explicar, hasta cierto punto, el nivel de sofisticación de un clientepues un cliente con un factor de descuento alto resulta menos sofis-ticado que uno con un factor de descuento más bajo.

Después de analizar los resultados de este trabajo, sería importante consi-derar una política pública la cual permita a los individuos emplear herra-mientas de autocontrol para decisiones financieras. Un ejemplo sería en elmercado de fondos de ahorro para el retiro con la intención de incentivarlas aportaciones voluntarias, pues si permitimos a un individuo que elijaentre dos escenarios: el status quo y otro en el que se auto penalice si noincrementa su aportación año con año o en periodos en donde reciba unincremento salarial, quizá se podría incrementar de manera importante


la forma y cantidad de ahorro destinado por los individuos para su futu-ro. Lo anterior traería beneficios muy importantes en la calidad de vidade los adultos mayores y reduciría la cantidad de apoyos destinados porel gobierno a este sector (estos podrían ser enfocados a incentivar otrossectores de la economía con el objeto de optimizar el empleo de los re-cursos gubernamentales). Para futuras investigaciones, sería interesanterealizar un experimento con estas características para analizar qué tanconveniente sería el empleo de una política de este tipo y el efecto de lasherramientas de autocontrol y el precommitment en el mercado de fondosde ahorro para el retiro.

REFERENCIAS

Ariely, D., y Wertenbroch, K. (2002). Procrastination, deadlines, and perfor-mance: Self-control by precommitment. Psychological science, 13(3), 219–224.

Charness, G., y Dufwenberg, M. (2006). Promises and partnership. Econome-trica, 74(6), 1579–1601.

Jain, S., y Mansuri, G. (2003). A little at a time: the use of regularly scheduledrepayments in microfinance programs. Journal of development Economics,72(1), 253–279.

Vanberg, C. (2008). Why do people keep their promises? an experimental testof two explanations. Econometrica, 76(6), 1467–1480.

a


ANEXO 1PARA USO DEL ENCUESTADOR ÚNICAMENTE: Fecha Sucursal

Encuestador Nuevo Actualización

ENCUESTA DE ENTRADA A LA SUCURSAL

La siguiente encuesta es CONFIDENCIAL y no afectará las condiciones de supréstamo. Por favor, responda marcando una X en el espacio indicado o llenando el

espacio indicado. ¡MUCHAS GRACIAS!

Si usted llenó anteriormente la encuesta, debe contestar al menos las preguntas 1, 2 y 3

1. Su prenda fue: heredada un regalo la compró prestada otro2. Del 0 al 100 marque con una X lo seguro que está usted de recuperar su pren-

da:0 (nada seguro) 2—2—2—2—2—2—2—2—2—2 100 (totalmente seguro)

3. ¿Cuánto cree que vale la prenda que piensa usted empeñar: Pe-sos.Si son varias prendas por favor ponga el valor total que usted estima.

4. Hombre Mujer5. Edad: años6. Casado Soltero Divorciado Viudo7. 2 Trabaja como empleado

2 Tiene negocio propio2 Es ama/amo de casa2 No trabaja2 Jubilado2 Estudia

8. Usted completó: primaria secundaria preparatoria más que prepa-ratoria ninguno

9. ¿En el mes pasado, algún familiar o amigo le pidió dinero?: Sí No10. ¿Es común que sus familiares o amigos le pidan dinero?: Sí No11. ¿Cuál preferiría tener?: (a) 100 pesos mañana (b) 150 pesos en un mes12. ¿Qué tan seguido se siente usted estresado por la situación económica de su

hogar? :2 Siempre2 Muy seguido2 A veces2 Nunca

13. Marque la razón principal por la que usted quiere empeñar una prenda:


2 Necesito el dinero porque alguien en mi familia perdió su empleo2 Necesito el dinero para atender una enfermedad mía o de un familiar2 Necesito el dinero para un gasto urgente2 Necesito el dinero para un gasto que no es urgente

14. ¿Qué tan estresado se siente usted por la situación que lo hace requerir estepréstamo? : Muy estresado Algo estresado Poco estresado Nadaestresado

15. En 3 meses, usted espera tener una situación económica: : Mejor La mismaPeor

16. ¿Ha empeñado antes? : SÍ NO17. ¿Ha empeñado en una sucursal de la Fundación? :

2 NO2 1-2 veces2 3-5 veces2 Más de 5 veces

18. Si usted está guardando dinero, ¿puede evitar que los miembros de su hogarle quiten ese dinero?:2 Siempre lo puedo evitar2 A veces lo puedo evitar2 Difícilmente lo puedo evitar2 Nunca lo puedo evitar

19. ¿Planea o realiza un presupuesto de sus gastos del mes? : Sí No Casisiempre

20. ¿Usted tiene otras prendas que podría empeñar? : Sí No21. ¿Usted tiene ahorros? : Sí No22. ¿Usted participa en una tanda o en una caja de ahorro? : Sí No23. Para llegar a esta sucursal, ¿Cuánto gastó? : pesos24. Para llegar a esta sucursal, ¿Cuánto tardó? : minutos25. ¿Cuánto cree que una familia como la suya gaste en una semana normal? :

2 Menos de 500 pesos2 Entre 500 y 1,0002 Entre 1,000 y 2,000


2 Entre 2,000 y 4,0002 Entre 4000 y 60002 Más de 6,000 pesos

26. ¿Cuánto piensa usted que una familia que empeña en esta sucursal ahorraen una semana normal? : Menos de 200 pesos Entre 200 y 500 Entre 500y 1,000 Más de 1,000

27. ¿Le ha pasado que gasta más de lo que quisiera porque cae en la tentación?:Nunca Casi nunca A veces Muy seguido

28. En los últimos seis meses, alguna vez faltó dinero en su hogar para pagar:2 La renta: Sí No2 La comida: Sí No2 Medicinas: Sí No2 La luz: Sí No2 El gas: Sí No2 El teléfono: Sí No2 El agua: Sí No

29. ¿Cuál preferiría tener?: (a) 100 pesos en tres meses (b) 150 pesos en cuatro meses30. ¿Le gustaría recibir recordatorios gratuitos en su celular?:

2 Sí No2 Si quisiera recibir recordatorios gratuitos, por favor escriba su número ce-

lular:


ANEXO 2

Prueba para Valor de Prenda(1)

VARIABLES Valor de Prenda (val_pren)

contrato2 2,111***(751.4)

contrato3 -382.1(787.9)

contrato4 -20.06(646.6)

contrato5 -929.48(705.5)

Constant 5,276***(523.8)

Observations 2,032R-squared 0.010

Standard errors in parentheses*** p<0.01, ** p<0.05, * p<0.1

Prueba para Género(1)

VARIABLES Género

contrato2 0.0118(0.0296)

contrato3 -0.0249(0.0323)

contrato4 0.00350(0.0256)

contrato5 -0.00260(0.0280)

Observations 2,721Standard errors in parentheses*** p<0.01, ** p<0.05, * p<0.1


Prueba para Nivel de Educación(1)

VARIABLES Educación

contrato2 -.087548(0.118)

contrato3 -.079(.1248)

contrato4 -.1348(.102)

contrato5 -.117(.111)


Prueba para Factor de Descuento(1)

VARIABLES Factor de Descuento

contrato2 -0.00504(0.0295)

contrato3 -0.0333(0.0303)

contrato4 -0.0496**(0.0248)

contrato5 0.0220(0.0280)



Prueba para Participación en Tanda

(1)VARIABLES Participación

contrato2 -0.0707**(0.0310)

contrato3 0.0249(0.0343)

contrato4 -0.0257(0.0274)

contrato5 -0.00402(0.0299)


Prueba para Experiencia en Prestamos Prendarios

(1)VARIABLES Experiencia

contrato2 0.0197(0.0181)

contrato3 0.00856(0.0199)

contrato4 0.0292*(0.0159)

contrato5 0.0209(0.0170)



Prueba para Gasto Familiar Semanal(1)

VARIABLES Gasto familiar semanalcontrato2 -2.84

(.121)contrato3 -0.33

(0.127)contrato4 -0.134

(0.127)contrato5 0.072

(0.113)



ANEXO 3Prueba para Valor de Prenda

(1)VARIABLES Valor de Prenda

contrato2 2,111***(750.5)

contrato3 -382.1(786.9)

contrato4A 273.6(678.3)

contrato4B -1,001(946.9)

contrato5A -1,651**(774.7)

contrato5B 620.3(987.4)

Constant 5,276***(523.1)

Observations 2,032R-squared 0.013Standard errors in parentheses*** p<0.01, ** p<0.05, * p<0.1

Prueba para Género(1)

VARIABLES Género

contrato2 0.0118(0.0296)

contrato3 -0.0249(0.0323)

contrato4A -0.0136(0.0271)

contrato4B 0.0622*(0.0354)

contrato5A -0.00813(0.0312)

contrato5B 0.00826(0.0381)



Prueba para Nivel de Educación(1)

VARIABLES Educación

contrato2 -.087(.118)

contrato3 -.079(.124)

contrato4A -.180(.107)

contrato4B .027(.159)

contrato5A -.400(.123)

contrato5B .50(.16)


Prueba para Factor de Descuento(1)

VARIABLES Factor de Descuento

contrato2 -0.00504(0.0295)

contrato3 -0.0332(0.0303)

contrato4A -0.0558**(0.0256)

contrato4B -0.0274(0.0365)

contrato5A 0.00978(0.0309)

contrato5B 0.0461(0.0396)



Prueba para Participación en Tanda(1)

VARIABLES Participación

contrato2 -0.0707**(0.0310)

contrato3 0.0249(0.0343)

contrato4A -0.0161(0.0288)

contrato4B -0.0576(0.0395)

contrato5A 0.00745(0.0334)

contrato5B -0.0265(0.0407)


Prueba para Experiencia en Prestamos Prendarios(1)

VARIABLES Experiencia

contrato2 0.0196(0.0181)

contrato3 0.00855(0.0199)

contrato4A 0.0273*(0.0163)

contrato4B 0.0319(0.0212)

contrato5A 0.0280(0.0180)

contrato5B 0.00639(0.0242)



Prueba para Gasto Familiar Semanal(1)

VARIABLES Gasto familiar semanalcontrato2 -2.85

(.121)contrato3 -0.033

(0.127)contrato4A -0.241

(0.109)contrato4B 0.23

(0.15)contrato5A -.071

(0.1263)contrato5B 0.352

(0.155)



ANEXO 4

Efectos por Contrato(1)

VARIABLES Porcentaje de pago

contrato2 0.00507(0.0387)

contrato3 -0.0302(0.0409)

contrato4 0.0646*(0.0331)

contrato5 0.0168(0.0361)

Constant 0.639***(0.0266)

Observations 2,721R-squared 0.003

Standard errors in parentheses*** p<0.01, ** p<0.05, * p<0.1


ANEXO 4A

Efectos por Contrato(1)

VARIABLES Pago

contrato2 0.00507(0.0386)

contrato3 -0.0302(0.0408)

contrato4A 0.0243(0.0347)

contrato4B 0.201***(0.0490)

contrato5A -0.00471(0.0400)

contrato5B 0.0588(0.0495)

Constant 0.639***(0.0265)



ANEXO 5

(1)VARIABLES Pago

val_pren 2.50e-06*(1.35e-06)

contrato2 0.00671(0.0449)

contrato3 -0.0438(0.0469)

contrato4A 0.00137(0.0405)

contrato4B 0.172***(0.0564)

contrato5A -0.0377(0.0463)

contrato5B -0.0135(0.0588)

gen -0.000713(0.0284)

educacion -0.0116(0.0128)

t_consis1 0.0550*(0.0287)

pres_antes 0.192***(0.0430)

cta_tanda 0.0322(0.0267)

gasto_fam 0.0332***(0.0126)

Constant 0.427***(0.0678)


EL IMPACTO DE LA DISCAPACIDAD EN EL HOGAR SOBRE LAASISTENCIA ESCOLAR: EVIDENCIA DE MÉXICO

ZEKY AHMED MURRA ANTÓN†.

En el presente trabajo se explora el impacto que tiene la presencia de un indivi-duo discapacitado dentro del hogar sobre la asistencia escolar de los individuos deéste. Para ello, se desarrolló un modelo microeconómico novedoso que sugiere unposible mecanismo de transmisión entre las variables, para luego, con informaciónde la Encuesta Nacional de Ingreso y Gasto de los Hogares (ENIGH) 2012, estimarformalmente la relación. Los resultados muestran que existe evidencia significativade que la presencia de un individuo discapacitado en el hogar reduce la probabi-lidad de asistencia escolar de los miembros de éste, en edad universitaria (19 a 24años). En concreto, estos individuos sufren una reducción del 14.7 %, significativa al99 % de confianza, en la probabilidad de asistir a la escuela, asociada exclusivamentea la presencia de un individuo discapacitado en el hogar. Este resultado fue robustoante variaciones significativas en la especificación del modelo econométrico, estima-ciones bajo diferentes definiciones de los grupos etarios y ante pruebas cuantitativasy cualitativas de exogeneidad y validez de la estrategia de identificación.

PALABRAS CLAVE: discapacidad en el hogar, asistencia escolar, economía de laeducación, economía laboral, economiía de la salud, desarrollo económico, micro-econometríaCÓDIGO JEL: I00, I1.

1. INTRODUCCIÓN

Un tema central en la política económica del actual gobierno mexicano, so-bre todo en materia de salud, ha sido el de la discapacidad y el rol que jueganlos individuos dispacacitados dentro de la sociedad. Por vez primera, el PlanNacional de Desarrollo (PND) 2013-2018 establece la elaboración del ProgramaNacional para el Desarrollo y la Inclusión de las Personas con Discapacidad.Este programa además está asociado con 4 de las 5 metas nacionales para elpresente sexenio y con la Ley General para la Inclusión de las Personas conDiscapacidad, publicada en 2011, que contempla medidas legislativas y admi-nistrativas para que los individuos con discapacidad obtengan sus derechos enmateria de salud y laboral, entre otrasGobierno Federal de los Estdos UnidosMexicanos (2014).

No obstante, un obstáculo que ha encontrado la política pública en materiade discapacidad, en el presente, es que los hacedores de política reconocen lafalta de investigación rigurosa, comparable y aplicable a la realidad mexicana,que determine la viabilidad e impacto de dichas políticas. Por ende, se entiende

†Health Economics and Outcomes Research Manager en Bayer de México, profesor de asig-natura del ITAM y miembro investigador de la International Society of Pharmacoeconomics andOutcomes Research (ISPOR). La presente investigación se realizó con total independencia de lasorganizaciones antes mencionadas y es responsabilidad total del autor. Agradezco infinitamentetodas las ideas y el apoyo que me brindaron Laura Juárez, Carlo Alcaráz, Claudia Aburto y Ma-nett Vargas en el desarrollo de este artículo. De igual manera, agradezco los comentarios de los tresrevisores anónimos que participaron en la edición del mismo. Contacto: [email protected]

[email protected]

128 EL IMPACTO DE LA DISCAPACIDAD EN EL HOGAR

y acepta que hasta la fecha, los individuos con discapacidad no tienen, en granparte de los casos, acceso a los derechos y equidad que como mexicanos lescorrespondenGobierno Federal de los Estdos Unidos Mexicanos (2014). Enton-ces, el generar evidencia aplicable para México sobre discapacidad y el cómoesta afecta a los individuos, relacionados directa o indirectamente con dichasituación, parece una necesidad fundamental.

En particular, en la presente investigación se exlorará la relación que exis-te entre la discapacidad en el hogar y la formación de capital humano en elmismo. Numerosa literatura se ha publicado sobre la relación del ingreso y laformación de capital humano que sugiere que este último es de vital importan-cia para el crecimiento económico. Así, la presente investigación es importanteporque, en caso de que la discapacidad y la formación de capital humano esténrelacionadas, podría entonces existir persistencia de los efectos negativos de ladiscapacidad sobre las decisiones de los individuos y su nivel de vida, y, porende, sobre el desempeño económico, a través del tiempo.

Esta investigación, además, representa una aportación novedosa a la teoríay evidencia del impacto de la discapacidad el mercado de la educación, puessi bien existe investigación previa realizada sobre educación y discapacidadpor separado, no se ha investigado mucho sobre las interacciones y la posiblecausalidad que pudiera existir entre ellas. Además, no fue posible identificaralguna otra investigación aplicable para México en temas de discapacidad ycapital humano.

En la literatura se ha investigado, por ejemplo, el impacto del entorno so-cioeconómico en el éxito escolar de los individuos, así como el impacto de ladiscapacidad en el hogar sobre el ingreso agregado de este y la atención que sededica a los hijos. Como consecuencia de combinar intuitivamente las conclu-siones de dichos estudios, parece defendible la idea de que la discapacidad enel hogar es un factor que tendrá un peso significativo al tomar las decisioneseducativas.

Esto se debe, principalmente, a dos factores: por una parte, la discapacidadde un individuo en el hogar es una condición de salud1 de alguno de sus inte-grantes que puede requerir una utilización muy específica de recursos mone-tarios, lo que modifica los recursos disponibles para el resto de los integrantesdel hogar, potencialmente utilizables en otras actividades, entre ellas estudiar.Por otra parte, dicha condición de salud puede requerir una utilización espe-cífica de recursos temporales por parte de los demás individuos en el hogar, loque modifica las restricciones temporales que están involucradas en el procesode asignación del tiempo entre diversas actividades, entre ellas, estudiar.

Por las razones expuestas, el presente trabajo se construye a partir de la

1Según la Organización Mundial para la Salud (OMS, 2014): Discapacidad es un término gene-ral que abarca las deficiencias, las limitaciones de la actividad y las restricciones de la participación.Las deficiencias son problemas que afectan a una estructura o función corporal; las limitaciones dela actividad son dificultades para ejecutar acciones o tareas, y las restricciones de la participaciónson problemas para participar en situaciones vitales. Por consiguiente, la discapacidad es un fe-nómeno complejo que refleja una interacción entre las características del organismo humano y lascaracterísticas de la sociedad en la que vive.

ZEKY AHMED MURRA ANTÓN 129

investigación formal de la siguiente pregunta de investigación: >Cuál es el im-pacto de que exista un individuo discapacitado en el hogar sobre la asistenciaescolar de sus integrantes?, o alternativamente, y derivado de la pregunta deinvestigación, de la hipótesis de investigación siguiente: la presencia de disca-pacidad en el hogar reduce la probabilidad de que los individuos en edad es-colar asistan a la escuela. Para lograrlo, y aportar evidencia novedosa y sólidaa la literatura, se utilizó una muestra de individuos en edad escolar (conside-rada de 6 a 24 años de edad), proveniente de la Encuesta Nacional de Ingresoy Gasto de los Hogares (ENIGH) 2012, levantada por el Instituto Nacional deEstadística y Geografía (INEGI).

La estimación estadística utilizada para probar la hipótesis de investigaciónderivó de un modelo de respuesta discreta tipo Probit que modeló la probabi-lidad de que un individuo perteneciente a la muestra mencionada asistiera ala escuela, en función de diversas variables de control socioeconómicas, de ca-racterísticas del jefe del hogar y de seguridad social. La variable de interés, eneste caso, fue una indicadora de presencia de discapacidad en el hogar. Adicio-nalmente, se realizaron estimaciones por rangos de edad, para buscar efectosdiferenciados por nivel educativo, contemplando educación primaria (6 a 12años), educación secundaria y preparatoria (13 a 18 años) y educación profe-sional (19 a 24 años).

A partir de las estimaciones estadísticas, se encontró evidencia empíricade que existe una relación inversa y causal de que haya un individuo disca-pacitado en el hogar sobre la probabilidad de asistencia escolar de los demásindividuos del hogar, en edad de asistir a educación profesional: en promedio,un individuo de estas características tiene 14.7 % menos probabilidades (signi-ficativo al 99 % de confianza) de asistir a la escuela, que su contrafactual en unhogar sin presencia de discapacidad.

Adicionalmente, se encontró que no existe un efecto significativo de tenerun individuo discapacitado en el hogar sobre la probabilidad de asistencia es-colar de sus miembros, para los individuos en edad de los niveles educativosprimaria, secundaria y preparatoria. El efecto es tan grande y significativo paraindividuos en edad de asistir a escuela profesional que se hace extensivo a lasestimaciones agregadas para toda la muestra.

Estos resultados implican que los individuos en edad de asistir a un ni-vel educativo profesional, dentro de la sociedad mexicana, son sensibles a lasmodificaciones en las restricciones temporales y monetarias que enfrentan altomar decisiones, en este caso instrumentadas a través de la presencia de dis-capacidad en el hogar, por lo que la hipótesis de investigación es aceptada, almenos, para esta muestra de la población analizada.

El trabajo está estructurado de la siguiente manera: en la sección 2 se pre-senta una revisión de la literatura, tanto teórica como empírica; en la sección3 se desarrolla el modelo teórico que da sustento a la estimación empírica; enla sección 4 se describe la información utilizada, el modelo econométrico a es-timar y se discute la validez de la estrategia de identificación; en la sección 5se presentan los resultados de las estimaciones; en la sección 6 se discuten losresultados y la robustez de las estimaciones; en la sección 7 se presentan las


conclusiones de la presente investigación.

2. REVISIÓN DE LA LITERATURA.

Se llevó a cabo una revisión de la literatura teórica sobre la relación entrela discapacidad de alguno de los individuos del hogar y la asistencia escolarde sus integrantes, sin embargo, no se encontró ningún modelo que relaciona-ra estas dos variables. Como consecuencia de la ausencia de material teóricopublicado, se desarrolló un modelo microeconómico en dos etapas que ayudaa entender un posible mecanismo de transmisión del impacto de la discapa-cidad en el hogar sobre las decisiones educativas. Este modelo es detallado aprofundidad en la sección 3.

Asímismo, se llevó a cabo una revisión de la literatura empírica sobre larelación entre la discapacidad de alguno de los individuos del hogar y la asis-tencia escolar de sus integrantes, sin embargo, no se encontró ningún artículoque estudiara esta relación directamente. Al respecto, se encontraron algunosartículos sobre el impacto de la discapacidad de alguno de los miembros delhogar sobre el ingreso y nivel de consumo del mismo, así como del entornosocial sobre el éxito escolar, que, por ser de estrecha relación con la hipótesisde investigación, aunque de manera indirecta, se incluyeron en la presente re-visión.

En relación al entorno socioeconómico, la literatura habla de que es un fac-tor importante sobre el desempeño escolar. Se encontró un estudioThomas yStockton (2003) que hace una revisión extensiva de la literatura publicada alrespecto: entre las principales variables socioeconómicas que afectan al desem-peño escolar están el género, la raza y la posición social de la familia. Este resul-tado es relevante porque la discapacidad de un miembro del hogar, al reducirel ingreso disponible para bienes de consumo, podría modificar negativamentela posición social de la familia.

Como complemento, la literatura también menciona que la estructura fa-miliar y la concentración racial en las escuelas son factores que impactan signi-ficativamente el desempeño escolarBankston III y Caldas (1998). Este artículofue realizado con información de 18,000 estudiantes afroamericanos y blancosde décimo grado en Estados Unidos, donde se analizó el impacto del entornoescolar sobre el desempeño escolar, a través de un análisis de regresión linealmúltiple.

Uno de los principales resultados al respecto es que los hijos que provie-nen de familias donde el jefe del hogar es mujer tienen un menor éxito escolarque aquellos que provienen de familias donde el jefe del hogar es hombre. Es-te resultado es importante para el presente trabajo pues hogares donde el jefedel hogar es mujer podrían estar correlacionados positivamente con ser hoga-res donde sólo existe un aportador de ingresos y, en promedio, el ingreso nolaboral que tienen disponibles los hijos tendería a ser menor.

Finalmente, dentro de la literatura también hay evidencia de que los niñosprovenientes de hogares con un miembro discapacitado son más propensos


a experimentar desórdenes de comportamiento severosLeClere y Kowalews-ki (1994). Este estudio fue realzado con información de la Encuesta Nacionalde Salud de Estados Unidos en 1988, en donde se analizó a los niños entre 5y 17 años para determinar si éstos son mas propensos a sufrir desórdenes decomportamiento o accidentes si viven en un hogar donde un miembro es dis-capacidato.

Este resultado se debe, seguún los autores, a que en los hogares con un in-dividuo que experimenta discapacidad crónica, este requiere mucho tiempo yrecursos monetarios, lo que podría desplazar a los demás individuos del focode atención de los padres. El fenómeno descrito, aunado a que los niños másgrandes del hogar, según los autores, enfrentan responsabilidades mayores alas que pueden cargar, dada su edad, podría tener un impacto sobre el desem-peño escolar de los individuos.

Con respecto al efecto de la discapacidad sobre el consumo y el ingreso delhogar, se encontró evidencia en la literatura de jefes del hogar de género mas-culino, de que 10 años después de haber experimentado discapacidad severay crónica por primera ocasión, el ingreso del hogar se redujo en 79 %, el gastodel hogar en consumo se redujo en 24 % y el gasto en consumo de alimentos enel hogar se redujo en 22 %, en promedio Meyer y Mok (2009).

Este artículo utilizó información longitudinal para el periodo de 1968-2005,con una muestra para jefes del hogar de género masculino para analizar cómola prevalencia de discapacidad en años productivos afectó las trayectorias delconsumo, ingreso y ganancias inclusive a mediano plazo. Asímismo, el artícu-lo analizó diferentes tipos de severidad de la discapacidad y duración de lamisma.

A partir de los artículos revisados en esta sección, podemos concluir que laliteratura empírica sugiere, al menos de manera indirecta, que el entorno socio-económico y el estatus de discapacidad en el hogar son situaciones que podríanafectar de manera significativa el desempeño escolar de los niños (siendo laasistencia un nivel de incidencia derivado del desempeño), ya sea por razonesmonetarias, de demandas de mayor responsabilidad en el hogar o de atenciónpor parte de los padres.

3. MODELO TEÓRICO

Al no haber encontrado artículos dentro de la literatura teórica que relacio-nen la discapacidad de algún miembro del hogar con la asistencia escolar delos demás, este artículo representa una aportación novedosa a la literatura deeconomía de la salud y laboral. A continuación se presenta un modelo micro-económico que ilustra un potencial mecanismo de transmisión para el impactode la presencia de discapacidad en el hogar sobre sus decisiones escolares. Elmodelo se basa en la idea de que la discapacidad en el hogar, de alguna mane-ra, reduce el ingreso no laboral de los individuos que no trabajan.

El modelo incluye dos etapas: en la primera, el jefe del hogar decide có-mo repartir los recursos disponibles en el hogar entre los integrantes de este,


de tal suerte que transfiere la parte correspondiente a estos en forma de unatransferencia de suma fija; en la segunda, los agentes observan el ingreso nolaboral que se les asignó y deben maximizar sus canastas de consumo presentey futuro, tomando en cuenta la asignación de recursos temporales.

En la primera etapa, el jefe del hogar maximiza una función de utilidadagregada para el hogar (que contiene n individuos), representada por UH , quedepende positivamente de la utilidad de cada uno de los individuos (i) al inte-rior del hogar, representada por ui. La utilidad individual en la primera etapadepende del consumo de los individuos en ese periodo inicial, Ci0, del nivel desalud inicial exógeno Si0, del nivel de salud que el individuo puede comprar,Si y de un nivel mínimo de salud que debe mantener el individuo (S) para con-tinuar derivando utilidad en el consumo. Estas relaciones se pueden observara continuación:

UH = U(u1, u2, ..., un) (3.1)

ui = Vi(Ci0) + Θ(Si0 + Si − S);∀i = 1, 2, ..., n (3.2)

Para el análisis, supondremos que existen rendimientos marginales positi-vos y decrecientes en UH y ui, además de que estas son diferenciables y cua-sicóncavas. En el caso de la función de utilidad individual (ecuación 3.2), lafunción Θ solamente genera utilidad positiva en los números reales positivos,por lo que es necesario que se cumpla que Si+S0 > S. Asumiremos que si estarelación no se cumple, el individuo recibe una desutilidad tan grande que nole permite disfrutar de ningún otro bien. Un ejemplo para la forma funcionalde Θ es el logaritmo natural, donde si no se alcanza el valor mínimo de salud,la desutilidad generada tiende a infinita.

En esta primera etapa, los individuos maximizan entonces su función deutilidad, representada en la ecuación 3.2, sujeto a una restricción presupuesta-ria que inicialmente sólo depende de ingreso no laboral (Ni), para determinarel nivel eficiente de consumo de salud. La idea de esta primera etapa es que elindividuo debe alcanzar un nivel de salud mínimo y un consumo inicial antesde poder decidir si prefiere trabajar o estudiar. En este contexto, cada individuoresuelve el siguiente problema de maximización:

maxCi0,Si

Vi(Ci0) + Θ(Si0 + Si − S) (3.3)

s.a. Ni = PcCi0 + PsSi (3.4)

Del cual se deriva la demanda individual por salud del individuo i (S∗i ),que depende de los parámetros: precio del consumo, Pc, precio de la salud, Ps,Ni, Si0 y S. Aquí es importante recordar que debe ser cierto que Si + Si0 > Spara que el individuo pueda derivar utilidad positiva en el consumo, por loque podemos estar en presencia de soluciones de esquina, donde el individuobusca utilizar todos los recursos que sean necesarios para alcanzar el nivel mí-nimo de salud o estar lo más cerca posible de este. Lo anterior provoca que la


demanda por salud forzosamente dependa del nivel inicial de salud y del nivelmínimo de salud. Entonces, la demanda por salud será de la siguiente forma:

S∗i = Si(Ps, N0, Pc, Si0, S) (3.5)

Esta función tiene una particularidad importante: si se reduce el nivel ini-cial de salud Si0, Si aumentará para cumplir con la restricción Si + Si0 > S,con lo cual, para mantener el mismo nivel de salud de antes del cambio, es ne-cesario reducir el consumo (pues ahora el individuo gasta mayor proporciónde su ingreso en salud y el precio del consumo se mantiene constante). Estoforzosamente reduce la utilidad del individuo. Adicionalmente, gracias a losrendimientos marginales positivos y decrecientes, sabemos que al aumentarNi, aumentará la utilidad del individuo i. Estas relaciones se pueden formali-zar en las siguientes ecuaciones:

ui = ui(Ps, Ni, Si0, S) (3.6)

Con

∂ui∂Ni

> 0 (3.7)

∂ui∂Si0

< 0 (3.8)

Entonces, el jefe del hogar que maximiza la utilidad del hogar enfrenta elsiguiente problema de optimización (donde N es el ingreso total a distribuir):

maxN1,N2,...,Nn

U(u1(Ps, N1, S10 , S), u2(Ps, N2, S

20 , S), ..., un(Ps, Nn, S

n0 , S)) (3.9)

s.a.n∑i=1

Ni = N (3.10)

Donde todos los términos de la siguiente expresión, derivada de las condi-ciones de primer órden, son positivos:

∂U

∂ui

∂ui∂N∗i

=∂U

∂uj

∂uj∂N∗j

;∀i, j = 1, 2, ...n (3.11)

Ahora bien, para analizar el efecto de la discapacidad en el individuo j en elhogar, podemos hacerlo a través de un ejercicio de estática comparativa dondesupondremos que la discapacidad en el individuo j se puede representar comoun choque exógeno negativo a Sj0 (donde el individuo comienza con un menornivel de salud). En presencia de este choque, ceteris paribus, entonces será ciertoque

∂U

∂Ui

∂ui∂Ni

<∂U

∂uj

∂uj∂Nj

;∀i, j = 1, 2, ...n (3.12)


Esto lo sabemos porque la relación 3.8 implica que un cambio exógeno enSj0 reducirá la utilidad del individuo j. Al tener rendimientos marginales posi-tivos y decrecientes, entonces sabemos que esto aumentará la utilidad marginaldel bienestar del individuo j dentro de la función de utilidad agregada del ho-gar, desbalanceando la ecuación 3.11 en la ecuación 3.12. Esto claramente esuna situación subóptima para el jefe del hogar por lo que éste deberá reasig-nar los recursos que tiene disponibles de tal suerte que se vuelva a cumplir larelación 3.11.

Para esto, dado que la función de utilidad individual también tiene rendi-mientos marginales positivos decrecientes, el jefe del hogar deberá incrementarel ingreso no laboral que transfiere al individuo j, de tal suerte que la utilidadde dicho individuo se incremente y su utilidad marginal dentro de la funcióndel hogar se reduzca hasta encontrar el equilibrio nuevamente. Sin embargo,dada la restricción de recursos al interior del hogar (ecuación 3.10), esto impli-cará forzosamente una reducción en el ingreso no laboral transferido al resto delos individuos del hogar (sabemos que no reduciá el ingreso de sólo un indivi-duo manteniendo a los demás constantes porque esto también desbalancearíala relación 3.11 para dicho individuo, en comparación a los demás). Por ende,sabemos que

∂N∗i∂Si0

< 0→ ∂N∗i

∂Sj0> 0;∀i 6= j. (3.13)

En la segunda etapa, una vez recibido el ingreso no laboral inicial (calcula-do eficientemente a partir del problema descrito por las ecuaciones 3.3 y 3.4),los individuos escogerán su consumo y las horas que dedican a trabajar (y porende, a estudiar) pensando en dos periodos. Los individuos en el presente pue-den trabajar(LP ) o estudiar(EP ), y en el futuro, sólamente pueden trabajar(LF ).Si un individuo trabaja en el presente, gana un salario wp. En el futuro, el indi-viduo ganará un salario ponderado por la fracción del tiempo que dedicó en elpresente a cada tarea, donde wA es el salario máximo ganado si sólo estudió ywB es el salario máximo ganado si sólo trabajó (con wA > wB). Mas adelante sedetalla la aplicabilidad de un diferencial en salarios basádo en la educación. Apartir de ahora, el análisis se basa en un sólo individuo del hogar, sin embargo,el análisis es similar para los demás.

El modelo supone preferencias aditivas cuasicóncavas y diferenciables, conrendimientos marginales positivos y decrecientes, sobre el consumo en un pe-riodo determinado. El individuo analizado enfrenta una economía de dos pe-riodos, donde consumeCP en el presente yCF en el futuro. En las preferencias,el futuro se descuenta a una tasa β < 1. En el presente, aunque el individuopuede decidir cuántas horas trabajar, las horas trabajadas sólamente impactanel nivel de consumo, es decir, no tienen otra implicación sobre el bienestar delindividuo por otro mecanismo.

Adicionalmente, el individuo puede transferir recursos libremente entre pe-riodos a una tasa de interés competitiva de r. T son las horas disponibles paracada periodo y N representa el ingreso no laboral estimado en la etapa ante-


rior, mismo que sólo recibe en el primer periodo, cuando todavía es dependien-te económico de su hogar. El individuo de esta economía enfrenta el siguienteproblema de maximización, donde se asume que el consumidor tiene preferen-cias de tipo CES por consumo en cada periodo, con elasticidad de sustituciónintertemporal 0 ≤ γ ≤ 1.

maxCP ,CF ,LP

CP1− 1

γ

1− 1γ

+ βCF

1− 1γ

1− 1γ

(3.14)

s.a. CP +CF

1 + r= N + wPLP +

1

1 + r

(wAEP + wBLP

T

)T (3.15)

Las condiciones de primer órden para el consumo en cada periodo requie-ren que para optimizar, se cumpla la condición de equimarginalidad intertem-poral, representada por la ecuación 3.16, además del cumplimiento de la ecua-ción 3.15.

C1γ

F

C1γ

P

= (1 + r)β (3.16)

Entonces, las condiciones de optimalidad descritas, combinadas entre sí,determinan la siguiente relación entre consumo y trabajo en el presente (el de-talle del cálculo se puede consultar en el Apéndice 1):

LP =CP (1 + (1 + r)

γ−1βγ)−N − 1

1+rwAT

wP − 11+r (wA − wB)

(3.17)

Para medir el impacto teórico que tendría una disminución del ingreso nolaboral sobre la asistencia escolar, basta con ver la derivada parcial de el trabajoen el primer periodo con respecto a esta variable: la asistencia escolar tendrájustamente el signo opuesto. En este caso, si derivamos la ecuación 3.17 conrespecto al ingreso no laboral tendremos la siguiente relación:

∂LP∂N

=1

wP − 11+r (wA − wB)

(∂CP∂N

(1 + (1 + r)γ−1

βγ)− 1) (3.18)

Y para determinar el signo de esta derivada, analizaremos el signo de lossiguientes dos términos por separado:

1

wP − 11+r (wA − wB)

(3.19)

∂CP∂N

(1 + (1 + r)γ−1

βγ)− 1 (3.20)

Con respecto al primer término, existe literatura empírica que ha estudia-do ampliamente el retorno a la educación y que ha encontrado que el sala-rio de mercado crece con la educación y lo hace de manera significativaCard


(1999)Psacharopoulos (1981)Spence (2002)Hanushek y cols. (2015). Por esta ra-zón, podemos esperar que la siguiente relación se cumpla, ya que wA >> wB :

1

wP − 11+r (wA − wB)

≤ 0 (3.21)

Con respecto a la expresión 3.20, es clave conocer el signo y magnitud dela derivada parcial del consumo en el presente respecto del ingreso no laboral.En particular, esta expresión podría tener signo ambiguo dependiendo de lamagnitud de los componentes. En general, en población donde los individuossean de ingreso medio o bajo y no descuenten demasiado el futuro, podemosesperar que la expresión tenga signo positivo. Aunque estos dos supuestoshacen referencia al grueso de la población común, se presenta a continuaciónel resultado de una calibración más detallada para México.

Con información proveniente de la ENIGH 2012Instituto Nacional de Esta-dística Geografía e Información (2015), es sencillo comprobar que la proporcióndel ingreso que consumen los individuos es cercana al 80 %, donde el 80 % dela población con menor ingreso consume el 81.8 % de su ingreso y el 10 % conmás ingreso consume el 55 % de su ingreso. Adicionalmente, al estimar el cre-cimiento relativo del consumo con respecto al ingreso, podemos verificar quetiene un valor cercano a 0.61. Por ende, podemos esperar que, al menos en lamayoría de la población mexicana, la derivada parcial del consumo presenterespecto del ingreso sea positiva.

Además, es cierto que la derivada parcial del consumo presente respectodel ingreso no laboral multiplica a un término que también es positivo y mayorque 1. Según la literatura emírica publicadaFrederick y cols. (2002)Cairns y Pol(1997)Moore y Viscusi (1988), 0,80 ≤ β ≤ 0,95 en un horizonte temporal demediano plazo y r ≈ 0,04 para México. Por esta razón y por el argumentoanterior, podemos esperar que para la mayoría de los individuos mexicanossea cierta la relación 3.22, independientemente del valor de γ. En el Apéndice2 se presenta el ejercicio empírico realizado, con diferentes valores para losparámetros, incluido γ, donde la relación se mantiene.

∂CP∂N

(1 + (1 + r)γ−1

βγ)− 1 ≥ 0 (3.22)

Por ende, podemos concluír, bajo supuestos razonables, que la derivadaparcial del trabajo en el presente, con respecto al ingreso no laboral, es negativa,es decir, que ante un cambio negativo en el ingreso no laboral, los individuosdeciden trabajar más y estudiar menos. Formalmente:

∂LP∂N

< 0 (3.23)

De tal suerte que, combinando la relación 3.13 con la relación 3.23, pode-mos esperar que ante la presencia de un individuo discapacitado en el hogar,los demás individuos tengan incentivos a estudiar menos en el presente. For-malmente,


∂LiP∂Sj0

=∂LiP∂Ni

∂Ni

∂Sj0< 0⇒ ∂EiP

∂Sj0=∂EiP∂Ni

∂Ni

∂Sj0> 0;∀i 6= j (3.24)

4. METODOLOGÍA EMPÍRICA

En esta sección se describe la información que se utilizó para probar la vali-dez de la hipótesis de investigación planteada. Adicionalmente, se desarrolla elmodelo econométrico que se utilizó para estimar cuantitativamente la relacióncausal entre la presencia de un individuo discapacitado en el hogar y la asis-tencia escolar de los miembros de éste. Finalmente, se discute la consistenciade los resultados generados por el modelo econométrico y los posibles riesgosque la estimación podría tener en términos de endogeneidad y/o selección.

4.1 DESCRIPCIÓN DE LOS DATOS

La información utilizada para probar la validez de la hipótesis de investi-gación proviene de la Encuesta Nacional de Ingreso y Gasto de los Hogares(ENIGH) 2012, publicada por el Instituto Nacional de Estadística y Geografía(INEGI) en agosto de 2013. Esta encuesta es representativa a nivel nacional,con una periodicidad bianual, de tal forma que el levantamiento utilizado es elmás reciente posible a la realización de éste artículo.

El levantamiento de la ENIGH se hace a través de un diseño probabilísticopor Unidades Primarias de Muestreo (UPM) estratificadas, y en su versión 2012constó de 10,062 viviendas encuestadas (las cuales pueden contener varios in-dividuos, a su vez, encuestados). Esto que garantizó una significancia al 90 %de confianza. Esta información es relevante pues el levantamiento no se realizópor Muestreo Aleatorio Simple (MAS), por lo que todas las estimaciones pre-sentadas en este trabajo, tanto econométricas como de estadística descriptiva,requirieron de un ajuste por sesgos de encuesta.

Para estudiar la validez de la hipótesis de investigación se consideró unamuestra de la ENIGH 2012 para la población en edad escolar, contempladadesde los 6 y hasta los 24 años (que consistió en 11,967 observaciones a ni-vel individuo). La razón de esto es que se seleccionó a la población elegiblepara cierto grado escolar por edad, para evitar sesgos por habilidad u otrosfactores relacionados con individuos de diferentes edades estudiando gradosque no les corresponderían (por ejemplo, adultos mayores en la primaria o re-petidores). Los individuos discapacitados en el rango de edad escolar fueronexcluidos del análisis para evitar sesgar las estimaciones con el efecto de supropia discapacidad, vía la habilidad, sobre la asistencia escolar. No obstante,estos individuos fueron tomados en cuenta al determinar si un hogar tiene unmiembro discapacitado o no2.

2El número total de individuos que viven en un hogar con al menos un discapacitado en lamuestra fue de 1,961, representando el 16.4 % del total de individuos en edad escolar.


La manera en la que se midió la presencia de discapacidad en el hogar fue lasiguiente: dentro de la tabla de población en la encuesta, la cual es medida a ni-vel individuo, se preguntó si el individuo en cuestión tiene algún tipo de disca-pacidad3. Con esta información, fue posible entonces registrar en una variableindicadora de si el individuo en cuestión sufre algún tipo de discapacidad. Ca-be resaltar que la discapacidad en el hogar se contempló independientementedel parentesco del individuo con los individuos en edad escolar.

Posteriormente, se creó una variable indicadora que registra, para todos losindividuos del hogar, si en éste vive un individuo que sufre de algún tipo dediscapacidad. Esta variable indicadora, finalmente, fue la que se utilizó comovariable de discapacidad en el hogar. Es importante recalcar que se hicieronvarios intentos por desagregar por tipo de discapacidad o inclusive por severi-dad, sin embargo, el número de observaciones se reduce significativamente yno permitió estimar de manera confiable algún tipo de diferencias por tipo dediscapacidad. Este hecho se discute más a profundidad en la sección de discu-sión.

4.2 MODELO ECONOMÉTRICO

La estimación empírica de la hipótesis de investigación se hizo a través deun modelo de respuesta discreta, centrado en la probabilidad que una variableindicadora de asistencia escolar tome el valor de 1; en particular, se utilizó unProbit4Wooldridge (2010).

Es importante mencionar que la naturaleza de la pregunta de investigacióndetermina directamente la naturaleza de la variable de interés, a decir, una va-riable discreta que indica si un individuo determinado asiste o no a la escuela.En este contexto, es necesario utilizar un modelo de respuesta discreta, ya queeste tipo de modelos asimilan la información de las variables explicativas in-cluidas y la transforman, a través de cierta forma funcional, en la probabilidadde que la variable indicadora de interés tome el valor unitario.

La forma funcional seleccionada para la distribución de probabilidades enel presente trabajo fue no lineal, pues los modelos lineales como el Modelo deProbabilidad Lineal (LPM, por sus siglas en inglés), además de representar ladensidad de probabilidad de forma lineal, podrían no cumplir con el axioma decerradura, donde toda probabilidad debe estar acotada en el intervalo cerradode cero a uno.

También es importante tener en cuenta que los modelos de respuesta dis-creta tienen sus propias limitaciones, mismas que se plantean a continuación,con sus respectivas soluciones: por una parte, los coeficientes estimados en elmodelo no representan efectos marginales de una variable independiente sobrela probabilidad de que ocurra un evento (variable dependiente). Esta situación

3En particular, las siguientes categorías están incluidas en la encuesta: caminar, moverse, subiro bajar; ver, hasta usando lentes; hablar, comunicarse o conversar; oír, aun usando aparato auditivo;vestirse, bañarse o comer; poner atención o aprender cosas sencillas; tiene alguna limitación mental

4Se hicieron pruebas con un modelo Logit, sin embargo, los resultados fueron muy similaresen magnitud e interpretación. Los resultados de esta estimación se presenta en el Apéndice 6.


fue tomada en cuenta al hacer las estimaciones empíricas, para calcular losefectos marginales de la ecuación 4.2. Por otra parte, al no tener efectos margi-nales lineales, los errores estándar comúnmente utilizados no son válidos. Estasituación fue tomada en cuenta al hacer las estimaciones, donde se calcularonlos errores estándar por el método delta .

La especificación particular del modelo Probit utilizado para estimar la va-lidez de la hipótesis de investigación se presenta a continuación

Pr(Ai = 1 | Xi) = Φ(α+ βDi + θSocEci + γSegSoci + πCJHi + εi) (4.1)

DondeAi es una variable indicadora que toma el valor de 1 si el individuo iasiste a la escuela y 0 de otra forma; Xi es un vector de controles de dimensiónK compuesto por SocEci, un vector de controles socioeconómicos de dimen-sión k1, SegSoci, un vector de controles de seguridad social de dimensión k2, yCJHi, un vector de controles de características del jefe del hogar de dimensiónk3, donde k1 + k2 + k3 = K. Di es la variable de interés y es una variable indi-cadora que toma el valor de 1 si existe un individuo discapacitado en el hogardonde vive el individuo i, y 0 de otra forma5. εi es un error aleatorio con media0 y varianza σ2 > 0, con Φ(·) la función de densidad normal.

El efecto de interés, el impacto de la presencia de un individuo discapaci-tado en el hogar sobre la probabilidad de asistencia escolar, se estimó como elefecto marginal para la variable de interés, la indicadora de discapacidad en elhogar. En este caso, el efecto marginal, al estimarse sobre una variable discre-ta, se calcula a partir de una diferencia en probabilidades ajustadas, como semuestra en la siguiente ecuación:

∆Pr(Ai=1|Xi)∆Di

= ∆Pr(Ai = 1 | Di = 1, Xi)− Pr(Ai = 1 | Di = 0, Xi) =

Φ(α+β+θSocEci+γSegSoci+πCJHi+εi)−Φ(α+θSocEci+γSegSoci+πCJHi+εi)(4.2)

Es importante, antes de continuar con las estimaciones estadísticas, comen-tar acerca de los controles considerados en el modelo econométrico. Los con-troles, como se mencionó anteriormente, están agrupados en tres categoríasdiferentes con las que, posteriormente, se hizo la estimación. La razón de es-to fue, principalmente, para verificar la robustez de las estimaciones en unaadición progresiva de controles. Este es un tema que será comentado más ade-lante. La descripción de los controles mencionados en esta sección se puedeencontrar en el Apéndice 3.

El primer grupo de controles, los de características del jefe del hogar, tienecomo finalidad capturar cualquier factor que pudiera contaminar el efecto deinterés a través de la libertad que tienen los individuos dentro del hogar paratomar una decisión como la de estudiar. Al interior de los hogares, el jefe delhogar puede influir en las decisiones escolares de los hijos, con base en sus pro-pias preferencias. Por ejemplo, si el jefe del hogar tiene estudios universitarios,es probable que sea más propenso a influenciar y propiciar que su hijo también

5Esta es la variable que se construye conforme a la explicación de la sección 4.1


los tenga. Si este hecho y otros similares no son controlados, las estimacionesestarán sesgadas.

Las variables consideradas para controles de características del jefe del ho-gar son las siguientes: género del jefe del hogar, edad del jefe del hogar, unavariable indicadora para jefes del hogar sin educación formal y una variableindicadora para jefes del hogar con educación profesional o de posgrado.

El segundo grupo de controles, los socioeconómicos, tiene como finalidadcapturar cualquier factor de la realidad del individuo que pudiera contaminarel efecto de interés a través del escenario en el cual los individuos toman deci-siones. Es probable que el ingreso y las características de las familias, como elnúmero de integrantes, la composición del hogar, el tipo de comunidad don-de habitan los individuos, entre otros, sean factores altamente correlacionadoscon la asistencia escolar, por lo que es importante tomarlos en cuenta.

Las variables utilizadas para controlar estos factores socioeconómicos sonlas siguientes: género del individuo, tipo de localidad en la que reside, tamañodel hogar, número de integrantes menores de 5 años, número de hombres enedad productiva, número de mujeres en edad productiva, número de ancianosy el logaritmo natural del ingreso.

El último grupo de controles, los de seguridad social, tienen como finalidadcapturar los efectos suavizadores que debería tener la seguridad social sobre elimpacto de la discapacidad de un individuo del hogar en la asistencia escolarde los demás individuos. La lógica es la siguiente: la seguridad social, en tér-minos de discapacidad, brinda atención al discapacitado e, inclusive, podríallegar a pensionarlo, por lo que el efecto de ésta discapacidad sobre las restric-ciones que enfrentan los individuos al tomar decisiones podría verse afectado.Las variables tomadas en cuenta como controles de seguridad social son las si-guientes: una variable indicadora de si el discapacitado tiene afiliación a insti-tuciones de salud y una variable indicadora de si el discapacitado tiene algunapensión por discapacidad.

La estadística descriptiva de estas variables, presentada en la tabla 1, mues-tra que existen diferencias estructurales en los hogares, dependiendo de si hayun individuo discapacitado en ellos o no. De hecho, existen diferencias esta-dísticamente significativas en prácticamente todas las variables, a partir de lacondición de discapacidad, lo cual nos lleva a pensar que son variables rele-vantes para poder aislar correctamente el efecto de interés. En la variable deinterés, proporción de asistencia escolar, existe una diferencia significativa al99 % de confianza entre tipos de hogares, lo cual es congruente con la hipótesisde investigación.

Existen otros dos factores que llaman la atención: el primero, que los hoga-res con un discapacitado tienen mayor edad, en promedio, que los hogares sindiscapacitado. Esto lo podemos observar a partir de las diferencias estadísti-camente significativas en la edad del jefe del hogar y el número de ancianos.El segundo, que los hogares con un discapacitado tienen, en promedio, másintegrantes que los hogares sin discapacitados y que esta diferencia se refle-ja principalmente en el número de ancianos, el número de personas en edadproductiva y la proporción de hogares nucleares (padres e hijos). Esto podría


deberse a la necesidad de contar con adultos, externos al hogar nuclear, queayuden a atender al discapacitado y/o aportar recursos al hogar para suavizarlos impactos negativos sobre el consumo (de hecho, los hogares con un disca-pacitado tienen menor ingreso que los hogares sin éste), lo cual aporta eviden-cia empírica que sugiere que el mecanismo de acción de la discapacidad en elhogar sobre la asistencia escolar planteado, en el modelo teórico, es válido.

142E

LIM

PAC

TO

DE

LA

DISC

APA

CID

AD

EN

EL

HO

GA

R

CUADRO 1: ESTADÍSTICA DESCRIPTIVA

Variable Todos ConDiscapacitado

SinDiscapacitado Valor P

Características del Jefe del HogarAsistencia Escolar 0.70 0.66 0.71 0.0001Género del Jefe del Hogar 0.78 0.73 0.79 0.0001Edad del Jefe del Hogar 44.36 53.95 42.65 <0.0001Jefe del Hogar sin Educación Formal 0.08 0.19 0.06 <0.0001Jefe del Hogar con Educación Profesional o de Posgrado 0.12 0.07 0.13 <0.0001

Características SocioeconómicasGénero 0.50 0.51 0.50 0.1340Gran Ciudad 0.45 0.36 0.47 <0.0001Tamaño del Hogar 5.16 5.71 5.06 <0.0001Individuos Menores de 5 años 0.44 0.43 0.45 0.3945Hombres en Edad Productiva 0.87 0.92 0.87 <0.0001Mujeres en Edad Productiva 1.03 1.19 1.00 <0.0001Ancianos en el Hogar 0.13 0.42 0.07 <0.0001Ingreso (Trimestral) $38,560.77 $34,402.29 $39,303.31 <0.0001Hogar Nuclear 0.65 0.40 0.70 <0.0001

Características de Seguridad SocialPensión Por Discapacidad* 0.03 0.19 0 <0.0001Afiliación a Institución de Salud* 0.08 0.52 0 <0.0001Observaciones 12,592 2,041 10,551

* Estadística informativa para el grupo con discapacidad.Fuente: Elaboración propia con información de la ENIGH 2012.


De cualquier manera, la evidencia aportada por la estadística descriptivamuestra que existen diferencias estructurales considerables entre los hogares,para la gran mayoría de los controles contemplados, según la condición dediscapacidad. Por esta razón, será importante incluirlos en las estimacionescausales, de tal manera que tengamos la mayor confianza posible, de que soninsesgadas.

Una investigación de estadística descriptiva que vale la pena presentar esla diferencia por grupos de edad en la asistencia escolar, dependiendo de lacondición de discapacidad en el hogar. En el cuadro 2 se puede observar laproporción de individuos, por rango de edad, que asisten a la escuela. Losresultados muestran diferenciación en los patrones de asistencia escolar con-forme aumenta el rango de edad.

Los rangos de edad considerados en la estadística descriptiva, y en el restode las estimaciones, fueron los siguientes: educación primaria (de 6 a 12 años),educación secundaria y preparatoria (de 13 a 18 años)6 y educación profesional(de 19 a 24 años). Al respecto, es importante notar que los datos sugieren dife-rentes patrones de comportamiento de la variable de asistencia escolar, dada ladiscapacidad, conforme avanza la edad de los individuos.

La estadística descriptiva, por rango de edad, muestra que el 69.5 % de losindividuos en edad escolar asiste a la escuela cuando no tiene un individuodiscapacitado en el hogar, mientras que el 64.4 % lo hace cuando tiene un in-dividuo discapacitado, lo que representa una disminución en la proporción deasistencia escolar de 5.1 %, con un nivel de confianza del 99 %.

CUADRO 2: ASISTENCIA ESCOLAR POR EDAD Y CONDICIÓN DE DISCAPACIDAD.

Grado Escolar Sin Discapacidad Con Discapacidad DiferenciaEdad Escolar 69.5 % 64.4 % 5.1 %***

Primaria 98.8 % 98.6 % 0.2 %Secundaria y Preparatoria 72.7 % 67.8 % 4.9 %*

Profesional 28.8 % 19.7 % 9.1 %***

* p < 0,01, ** p < 0,05, *** p < 0,01Fuente: elaboración propia con información de la ENIGH 2012.

Sin embargo, la información desagregada por perfil de edad aporta infor-mación importante: la diferencia de proporciones para individuos en edad deeducación primaria es de solo 0.2 %, no significativa. La diferencia de propor-ciones para individuos en edad de educación secundaria y preparatoria es de4.9 %, significativa al 90 % y la diferencia en proporciones para individuos en

6Se decidió utilizar en conjunto el rango de secundaria y preparatoria porque son rangos don-de se observó mayor varianza, por lo que, el tener más observaciones solidificó los resultados. Sehizo el ejercicio de todas las estimaciones presentadas en el trabajo para los grupos separados yesta diferenciación no aportó evidencia diferente a la de la estimación en conjunto, por lo que elagrupamiento de este rango no es relevante.


edad de educación profesional es de 9.1 %, significativa al 99 %.Este es un hecho que podría responder a que, en edades tempranas, los tra-

bajadores no calificados pueden tener dificultades para realizar tareas físicasque, con el paso de la edad, se vuelven más sencillas. Esto aumenta su produc-tividad en dichas tareas, y con ello, el salario que pueden ganar. A su vez, estopuede incrementar el incentivo a abandonar la escuela e ir a trabajar, confor-me avanza la edad, para suavizar el consumo. Esta premisa será estudiada conmás detalle dentro de las estimaciones econométricas.

4.3 VALIDÉZ DE LA ESTRATÉGIA DE IDENTIFICACIÓN

Hasta este momento no se ha discutido la validez de la estrategia de iden-tificación planteada en las secciones anteriores, sin embargo, este es un temade vital importancia pues las decisiones escolares de los individuos podríanverse afectadas por factores observables o no observables que ponen en ries-go la consistencia de las estimaciones. Por esa razón, se dedica esta sección acomentar el riesgo de endogeneidad, simultaneidad y selección muestral, y elcomo se controlan estos problemas en las estimaciones.

MARCO TEÓRICO

En temas de educación existe toda una literatura que sugiere que las relacio-nes causales pueden ser simultáneas, estar sujetas a selección o ser endógenasCard (1999). Por ello, en el presente trabajo es importante dedicar el siguienteespacio a esta discusión. Para comenzar, es importante no perder de vista que,si bien pudieran existir problemas que dificultan la estimación consistente lacausalidad de algunas de las variables de control utilizadas, la discusión secentrará en los problemas que pudieran afectar la estimación de la causalidadde la presencia de un individuo discapacitado en el hogar sobre la asistenciaescolar de sus integrantes, pues esta es la variable de interés que sustenta lashipótesis del presente trabajo.

Con respecto a la simultaneidad, no parece factible que la asistencia escolarde los individuos del hogar tenga impacto sobre la presencia de la discapaci-dad en el mismo. Un posible camino que podría provocar la simultaneidad, encaso de que existiera, es que los individuos que asisten a la escuela se econtra-rán con mejor información, obtenida en la escuela, sobre la discapacidad y queentonces el hogar, con esta información, reportara de manera distinta la disca-pacidad en la encuesta. Sin embargo, este problema no permea a los resultadosgracias a que la variable de discapacidad es construida a través de un hechoobservable sobre alguna limitación de un individuo del hogar, no a través deun reporte binario sobre esta condición.

Con respecto a la selección muestral, no existe un problema claro, pues lainformación de la encuesta se encuentra disponible para todos los individuos,con discapacitado en el hogar, o sin él, y en todo el rango de posibilidades deinformación (excepto el ingreso, que no es la variable de interés, por lo queno es un problema). Por ende, no existe problema de censura o truncamiento


de los datos, que pudieran inducir un potencial sesgo de estimación Heckman(1976).

Finalmente, el problema que pudiera ser más difícil de descartar, es el deendogeneidad. De primera instancia, este problema deriva del hecho de queel error de estimación no tenga valor esperado cero, condicional en la varia-ble de interés y/o los controles, sesgando así los estimadores. Formalmente,que el efecto causal que se pretende estimar sea insesgado depende de que secumplan la condiciones siguientes:

yi = βX + u

E(u) = 0;Cov(u, xi) = 0; ∀i ∈ 1, ...,K (4.3)

Donde y es la variable de interés,X es un vectorK de controles, β un vectorde K de parámetros poblacionales, y u, un error de estimación. Al respecto, esimportante mencionar que la condición 4.3 se cumplirá para alguna de las va-riables si esta es asignada aleatoriamente. Sin embargo, aún en caso de que noexistiera aleatoriedad en la variable de interés, a través de variables de controles posible lograr estimaciones insesgadas.

Como refuerzo a lo anterior, en la literatura se habla de que el problemade endogeneidad se deriva de que exista una variable no observable que afectetanto a la probabilidad de asistencia escolar como a la discapacidad en el hogar,provocando así que la media del error, condicional en alguno de los controles,no sea cero. Sin embargo, la literatura también menciona que cuando existenvariables observables que controlan o aproximan por efectos simultáneos sobrela variable dependiente e independiente de interés, y estas son incluidas en lasestimaciones, el problema desapareceAngrist y Pischke (2008). A continuaciónse analizan algunos casos que podrían afectar la estimación, bajo esta óptica.

Pensemos, en primer lugar, en que el gasto en salud o alimentación del ho-gar pudieran haber ocasionado la discapacidad de alguno de los individuos,y que estos gastos, además, indujeran un bajo rendimiento escolar de los in-dividuos que estudian, de tal suerte que su costo de oportunidad de seguirestudiando fuera relativamente más alto y decidieran dejar la escuela. En esecaso, controlando por el ingreso eliminaríamos la potencial endogeneidad dela discapacidad, consiguiendo un estimador insesgado.

Otra potencial fuente de endogeneidad, bajo la línea de pensamiento plan-teada, podría ser que el individuo tenía un trabajo de alto riesgo y quedó disca-pacitado desempeñándo, y además, este trabajo implicaba que los individuosen edad escolar faltaran a la escuela. Este podría ser el caso de un trabajadorpetrolero, cuyo trabajo implicaba moverse de ciudad constantemente, y ade-más, un alto riesgo de quedar discapacitado. En este caso, la endogeneidad seeliminaría controlando por el ingreso, pues está relacionado con el riesgo detrabajo, además de que parece un caso remoto.

Un tercer posible camino para la endogeneidad deriva de que existe la posi-bilidad de que los individuos no asistan a la escuela por provenir de un hogarde bajos recursos y que además, el tener bajos recursos también haya propi-ciado la discapacidad, por ejemplo, por la falta de ácido fólico en el embarazo.


En este caso, con la inclusión de un control por ingreso en el hogar, podemosasegurar que este problema no afecta las estimaciones.

Finalmente, podría argumentarse que el ingreso del hogar está relacionadocon la educación del jefe del hogar y que estos factores a su vez se relacionancon la asistencia escolar y la discapacidad en el hogar. Por ejemplo, este seríael caso en el que el jefe del hogar tiene mayor eduación e impulsa a los hijosa estudiar más y, además, como consecuencia de su educación puede evitarla discapacidad en un individuo del hogar por un cambio en hábitos. En estecaso, controlar por educación del jefe del hogar y por ingreso es suficiente paraasegurar una estimación insesgada.

Para reforzar el argumento anterior a favor de la exogeneidad de la esti-mación, se realizó una prueba formal de endogeneidad por factores no obser-vables, adaptada de una versión para estimadores lineales, conocida como laprueba de HaussmanWooldridge (2010). Es importante comentar que no exis-te prueba formal que pueda demostrar la no endogeneidad de los controles,sin embargo, esta prueba es útil pues, cualitativamente, es capaz de mostrarun potencial problema con los controles. La prueba parte de un modelo base aestimar, en este caso

Pr(A = 1 | X1, D) = Φ(α0 + α1X1 + α2D + v) (4.4)

Donde A es una variable indicadora de asistencia escolar, X1 es un vectorde controles de dimensión L1,D es una variable indicadora de discapacidad enel hogar, y v es un error de estimación. Un posible problema de endogeneidadderiva de que en el presente modelo se sospecha que existe un factor no obser-vable que afecta a D y A al mismo tiempo. Para probar dicha sospecha, es útilmodelar, a través del Linear Probability Model (LPM), la probabilidad de queun hogar tenga un individuo discapacitado, como se presenta a continuación:

Pr(D = 1 | X) = β0 + β1X + u (4.5)

Donde D es una variable indicadora de discapacidad en el hogar, X es unvector de dimensión L, con L > L1 para evitar problemas de multicolinealidaden la siguiente etapa de estimación, y u es un error de estimación. Es impor-tante notar que al estimar el modelo 4.5 y ajustar el error de estimación, estecontendrá todos los factores no observables que afectan al hecho de que existaun discapacitado en el hogar. Por ende, el error ajustado es una variable proxypara los factores no observables que afectan a la discapacidad en el hogar. Estose puede apreciar en la siguiente relación:

u = P r(D = 1 | X)− β0 − β1X (4.6)

Para esta prueba, se seleccionó un modelo de probabilidad lineal para ladiscapacidad en el hogar, pese a sus limitaciones, pues es con el único que esposible ajustar los errores de estimación. Si bien este modelo tiene sus limitan-tes, alrededor de la media es un predictor bastante acertado de la probabilidadde ocurrencia de un fenómenoWooldridge (2010). En este caso, el modelo lineal


es útil porque en realidad no se pretende estimar la probabilidad de discapaci-dad, sino capturar la variación de ésta por factores no observables, de lo que elmodelo es capaz perfectamente.

Finalmente, se debe estimar el modelo original (ecuación 4.4), incluyendoel ajuste de los errores de estimación (ecuación 4.6). Este modelo se puede ob-servar en la ecuación 4.7. Si hubiera algún factor no observable que estuvieraafectando a la existencia de discapacidad en el hogar y simultáneamente a laasistencia escolar, se debería estimar un un valor para δ significativo.

Pr(A = 1 | X1, D) = Φ(α0 + α1X1 + α2D + δu+ v) (4.7)

Por lo tanto, para la hipótesis de que la presencia de discapacidad en elhogar es una variable endógena dentro de la estimación de la probabilidad deasistencia escolar gracias a factores no observables, se utilizará la hipótesis nulapost estimación siguiente:

H0 : δ = 0 (4.8)

En la siguiente sección se presentan los resultados a la estimación empíricade la prueba de Haussman detallada en esta sección. Para cerrar la presentediscusión sobre la posible endogeneidad de la discapacidad en el hogar dentrodel modelo planteado en secciones anteriores, vale la pena adelantar que laprueba de Haussman no encontró signos de endogeneidad para la variable deinterés.

ESTIMACIÓN EMPÍRICA DE LA PRUEBA DE HAUSMANN

La definición de los vectores utilizados en la estimación de Hausmann, asícomo los resultados de la misma, pueden ser consultadas en el apéndice 4. Conlas variables utilizadas, el resultado de la estimación empírica del modelo deHaussman es la siguiente:

δ = −0,106; σ(δ) = 0,620; tstat = −0,17; p = 0,864 (4.9)

Por ende, la hipótesis planteada en la ecuación 4.8 no puede ser rechaza-da a un nivel de confianza del 90 %. Podemos concluir entonces que no existeevidencia estadística de que hay un problema de endogeneidad entre la disca-pacidad en el hogar y la probabilidad de asistencia escolar de los individuosen el hogar por factores no observables, dentro de la ENIGH 2012.

5. RESULTADOS

En esta sección se presentan los resultados de las estimaciones del modeloeconométrico desarrollado en las sección 4. Las estimaciones se hicieron paratoda la población en edad escolar, así como para los subgrupos de edad deprimaria, secundaria y preparatoria, y profesional. Esto se debió a que hubohallazgos en la estadística descriptiva que sugirieron que dicha división podría


aportar información adicional al análisis. Los resultados se pueden consultaren el cuadro 3.

Las estimaciones econométricas confirman que la hipótesis de investigaciónes válida para la población mexicana en edad de educación profesional, puesse encontró evidencia significativa al 99 % de confianza de que la presencia deun individuo discapacitado reduce la probabilidad promedio de asistencia a laescuela en 14.7 %, para los individuos que habitan ese hogar. Adicionalmente,existe evidencia de que el efecto no es igual para todos los rangos de edadesy de que en cada rango de edad existen características particulares que mo-difican la probabilidad de asistencia escolar. Esto es de particular importanciaporque la estimación para toda la población en edad escolar probablemente es-tá sesgada, dadas las diferencias estructurales de cada grupo. Este hecho serádiscutido con mayor profundidad en la siguiente sección.

En individuos en edad de asistir a la escuela primaria no existe evidenciade que la discapacidad de un individuo del hogar modifique la probabilidadde asistencia escolar de sus integrantes. De hecho, llama la atención que lasúnicas variables relevantes para alterar la probabilidad de asistencia escolar esla educación del jefe del hogar así como el ingreso, lo cual seguramente res-ponde a la cobertura universal que existe a nivel primaria en el paísSecretaríade Educación Pública (2013).

Al respecto, observamos que los individuos en edad de asistir a la escue-la primaria aumentan la probabilidad de asistir a la escuela en una escala depredicción perfecta si el jefe del hogar estudió nivel profesional o posgrado,mientras que reducen la probabilidad de asistir a la escuela si el jefe del hogarno tiene estudios formales. Adicionalmente, un incremento de 1 % en el ingresodel hogar aumenta, en promedio, 0.49 % la probabilidad de asistencia escolarpara este grupo de edad. Todos estos efectos fueron significativos al 99 % deconfianza.

En individuos en edad de asistir a la escuela secundaria y preparatoria tam-poco existe evidencia de que la discapacidad de un individuo del hogar modi-fique la probabilidad de asistencia escolar de sus integrantes. En este caso, laeducación del jefe del hogar y el ingreso del hogar son variables importantes,además de que a diferencia del caso de individuos en edad de escuela pri-maria, el tamaño y la composición del hogar parecen ser factores que cobranrelevancia también.

De los resultados para este grupo de edad, llaman la atención dos de ellos:por una parte, el que el jefe del hogar tenga estudios profesionales o de posgra-do incrementa la probabilidad de asistencia escolar en 34.9 % con una confian-za del 99 %. Por otra parte, el tener un individuo adicional en el hogar reduce,en promedio, la probabilidad de asistencia escolar en 1.9 %, con una confianzade 95 %. Este efecto es diferente según el tipo de individuo adicional: el efec-to es particularmente marcado si el individuo adicional es menor de 5 años,mientras que si el individuo es un adulto en edad productiva, la probabilidadaumenta.

Finalmente en individuos en edad de asistir a la escuela profesional exis-te evidencia sólida de que la presencia de un individuo discapacitado en el


hogar reduce la probabilidad de asistencia escolar de sus integrantes: el queexista un individuo discapacitado en el hogar, ceteris paribus, reduce la proba-bilidad promedio de asistencia escolar de los demás individuos, con un 99 %de confianza, en 14.7 %, en comparación al hogar contrafactual donde no haypresencia de discapacidad.

Además de esto, la probabilidad de asistencia escolar para individuos enedad de asistir a educación profesional es más sensible a otros factores socio-económicos y de características del jefe del hogar que para los demás niveleseducativos. Los más relevantes son la edad del jefe del hogar, la educación deljefe del hogar, el género del individuo, el tipo de localidad en la que vive el in-dividuo, el número de individuos menores de 5 años en el hogar y el logaritmodel ingreso.

150E

LIM

PAC

TO

DE

LA

DISC

APA

CID

AD

EN

EL

HO

GA

R

CUADRO 3: EFECTOS MARGINALES DE LA ESTIMACIÓN PROBIT

Variable dependiente: probabilidad de asistencia escolar.

Primaria Secundaria y Preparatoria Profesional Edad Escolar

Discapacidad en el Hogar -0.00282 -0.0256 -0.147∗∗ -0.0654∗∗

(-0.67) (-0.90) (-3.25) (-2.65)Características del Jefe del Hogar

Género del Jefe del Hogar 0.000845 -0.0325 -0.0349 -0.0289(0.19) (-1.02) (-1.16) (-1.21)

Edad del Jefe del Hogar 0.000200 -0.00162 0.00384∗∗∗ -0.00451∗∗∗

(1.33) (-1.12) (3.53) (-4.93)

Jefe del Hogar sin Educación Formal -0.00980∗∗ -0.0483 -0.136∗ -0.0821∗∗

(-2.71) (-1.28) (-2.26) (-3.12)

Jefe del Hogar con Educación Profesional o Posgrado 0.349∗∗∗ 0.299∗∗∗ 0.266∗∗∗

(7.04) (7.94) (8.53)Características Socioeconómicas

Género -0.00493 -0.00639 -0.0469∗ -0.0200(-1.48) (-0.36) (-2.18) (-1.84)

Gran Ciudad -0.00297 0.0210 0.0840∗∗ 0.0316(-0.75) (0.68) (3.10) (1.75)

Tamaño del Hogar -0.000964 -0.0186∗ 0.00830 0.00451(-1.08) (-2.29) (0.80) (0.85)

Individuos Menores de 5 Años -0.000984 -0.0896∗∗∗ -0.172∗∗∗ -0.124∗∗∗

ZE

KY

AH

ME

DM

UR

RA

AN

TÓ

N151

CUADRO 3: CONTINUACIÓN


(-0.36) (-4.05) (-7.19) (-9.73)

Hombres en Edad Productiva 0.00112 0.0606∗ -0.00987 0.0816∗∗∗

(0.39) (2.50) (-0.45) (4.42)

Mujeres en Edad Productiva 0.000536 0.120∗∗∗ 0.0372 0.204∗∗∗

(0.16) (4.58) (1.26) (10.16)

Ancianos 0.00133 0.0516 -0.0306 0.120∗∗∗

(0.39) (1.38) (-0.91) (4.65)

Log(Ingreso) 0.00490∗∗ 0.0344 0.0332∗ -0.0732∗∗∗

(2.92) (1.80) (2.06) (-5.78)Características de Seguridad Social

Afiliación a Institución de Salud 0.000228 0.0113 0.0446 -0.0207(0.03) (0.21) (0.71) (-0.54)

Pensión Por Discapacidad 0.000802 -0.0667 0.125 0.0483(0.09) (-1.05) (1.78) (1.03)

N 4,618 3,912 3,473 11,967

Estadístico t entre paréntesis∗ p < 0,05, ∗∗ p < 0,01, ∗∗∗ p < 0,001

Fuente: elaboración propia con la información de la ENIGH 2012


Con respecto a las características del jefe hogar, se encontró que la edad deljefe del hogar incrementa la probabilidad promedio de asistencia escolar de losindividuos en edad de educación profesional en 0.38 %. El que el jefe del hogarno tenga educación formal la reduce en 13.6 % y el que el jefe del hogar tengaeducación profesional o de posgrado la aumenta en 29.9 %. Todos estos efectosfueron significativos.

Con respecto a las características socioeconómicas de los hogares, se encon-tró que el género de los individuos en edad de educación profesional reducesu probabilidad promedio de asistencia escolar en 4.69 %, el vivir en una granciudad la aumenta en 8.4 % y el tener un individuo adicional menor de 5 añosde edad la reduce en 17.2 %, significativos todos ellos. No se encontró efectosignificativo al 95 % de confianza o superior para la seguridad social.

6. DISCUSIÓN

En la primera parte de esta sección se discuten los principales hallazgosempíricos de las estimaciones y se puntualiza acerca de su interpretación, apli-cación y vinculación con la política pública actual. Posteriormente, se analiza larobustez de las estimaciones para clarificar acerca de su validez y sensibilidada diferentes posibles escenarios.

Si bien se encontró evidencia que parece sustentar, a un 99 % de confianza,que existe un impacto negativo de la presencia de un discapacitado en el hogarsobre la probabilidad de asistir a la escuela, para toda la población analizada,es importante tomar en cuenta la información adicional que aporta el estudiodesagregado por rangos de edad. La evidencia es sólida y contundente única-mente para el grupo de individuos en edad de cursar el nivel profesional. Estoes congruente con la diferenciación por rango de edad sugerida por la estadís-tica descriptiva, aunque no fue posible probar el efecto escalonado que ahí seobserva.

De hecho, la evidencia para el grupo de los que están en edad de cursarnivel profesional es tan sólida y significativa que tiene un efecto extensivo, auncuando se incluyen los otros dos grupos de edad (primaria y, secundaria y pre-paratoria) para los cuales no se encontró evidencia alguna de que la hipótesisde interés se cumpla. Por ello, al observar el resultado extensivo para toda lapoblación en edad escolar, e interpretarlo, debemos tener cuidado, pues este noes más que el resultado de sesgar hacia abajo la estimación para individuos enedad de estudios profesionales con las observaciones de los otros dos gruposde edad, para los que no hay efecto alguno. Un ejercicio adicional que se hizoal respecto fue estimar sobre toda la muestra pero utilizando como variablesde interés la interacción de la discapacidad en el hogar con una indicadora porrango de edad. El resultado confirmó un impacto diferente por nivel educativo.El resultado se puede consultar en la tabla 10 del Apéndice 4.

Por esta razón, una estimación para la muestra completa no es confiable.Esto se confirma además gracias a la evidencia de que existe una diferencia es-tructural importante en el impacto de los controles y la variable de interés para


cada rango de edad. Por ende, una la especificación general para toda la mues-tra de edades no es sostenible como una estrategia válida de identificación. Unejemplo claro de esto es el caso del ingreso, donde a lo largo de cada uno de lossubgrupos mantiene el signo esperado, sin embargo, con magnitudes diferen-tes, aunado a que para el grupo en edad de secundaria y preparatoria presentauna gran varianza (y no significancia), lo que hace que en el agregado, al igualque la variable de interés, no presente una estimación creible ni congruente.

Derivado de lo anterior, el presente trabajo aporta evidencia empírica y teó-rica novedosa a la literatura de la relación causal entre la discapacidad y laasistencia escolar, la cual existe únicamente en individuos en edad de asistir alnivel profesional. Un posible canal de transmisión es a través de la disminucióndel ingreso no laboral que enfrentan los individuos, y por ende, el aumento enel costo de oportunidad de estudiar. El presente trabajo también aporta eviden-cia empírica de que la relación causal no se cumple para los demás rangos deedad y de que los factores que impactan a cada grupo de edad, lo hacen de ma-nera diversa, por lo que no es correcto hablar de efectos al nivel de la poblaciónen edad escolar, en general.

Este hallazgo, sin embargo, no es del todo sorprendente pues en Méxicoexiste cobertura prácticamente universal en educación para individuos de pri-maria y secundariaSecretaría de Educación Pública (2013), y para individuosde preparatoria todavía existe un gran nivel de participación, independien-temente de su nivel socioeconómico o características del hogar, mientras quepara nivel profesional, la cobertura desciende dramáticamente. Este hecho ha-bla de una mayor vulnerabilidad del grupo en edad de educación profesionala diferentes factores.

Dentro de los factores que podrían afectar a los individuos en edad de niveleducativo profesional, y en parte a los de nivel preparatoria, llama la atenciónel del número de niños menores de 5 años en el hogar. Esta variable podría te-ner impacto negativo sobre la probabilidad de asistencia escolar por al menosdos canales de transmisión: el primero, que al haber más individuos depen-dientes económicamente en el hogar, los integrantes más preparados para en-frentar el mercado laboral, calificados o no, tienen mayores incentivos a buscartrabajo. El segundo, que esos niños menores de 5 años pudieran ser los hijosde los individuos en edad de estudiar nivel profesional, lo que requiere uncompromiso familiar serio que aumenta el costo de oportunidad de estudiarsignificativamente.

Este es un hecho importante, pues, dados los mecanismos de transmisiónplanteados, existe un potencial riesgo de generar círculos viciosos. Dentro dela política pública se ha hablado de que la educación es fundamental para con-tener el crecimiento demográfico insostenible que asedió a los países subdesa-rrollados en las décadas pasadas. Entonces, es de especial cuidado el que losindividuos puedan estar abandonando la escuela por cuidar a sus hijos, puesesto eliminaría una de las herramientas más importantes de dicha política pú-blica.

Los resultados de la presente investigación, además, sugieren que los in-dividuos en edad de cursar la educacion profesional son más sensibles en sus


decisiones escolares que los demás individuos. Esto es un hecho fundamentalque debe ser tomado en cuenta para futuro diseño y evaluación de política pú-blica educativa, pues sugiere que esta debe estar diferenciada dependiendo dela focalización deseada.

Con respecto a los individuos en edad de nivel educativo secundaria y pre-paratoria, llama la atención la importancia de la composición del hogar. Estopodría ser tan relevante porque es una edad en la que se toman decisionesimportantes en materia de educación. Los individuos en edad de asistir a laprimaria parecen no tener mucho margen de decisión sobre si estudiar o no,sin embargo, justo en el cambio entre la secundaria y la preparatoria pudie-ra existir una decisión fuerte que hacer, pues es donde la escuela deja de serobligatoria, y por ende, disminuyen los costos de acceder al mercado laboral.

Este hecho va muy de la mano con el diseño del programa Oportunidades,antes PROGRESA, que está enfocado a incentivar la salud y la educación, sobretodo al final del nivel básico y medio superior, pues estudios formales encon-traron que este es un punto de decision muy importante en las familias, sobretodo de escalas sociales pobres Schultz (2004).

Un hecho que debe ser tomado en cuenta, también importante en términosde política pública, es el que se hubiera esperado un mejor desempeño de loscontroles de seguridad social sobre la probabilidad de asistencia escolar en lasestimaciones. Este es un tema relevante pues la evidencia empírica presentadaen la sección anterior sugiere que la seguridad social enfocada a discapacidadpudiera no tener el efecto estabilizador deseado en la toma de decisiones delos hogares, siendo así, gasto ineficiente.

Este es un tema, sin embargo, que por sí mismo podría motivar futura in-vestigación para determinar la efectividad de los programas de seguridad so-cial, pues la no significancia de estas variables puede deberse también a queexiste sólo una muestra muy reducida de individuos para generar la suficientevariación adecuada en estas variables.

Otro tema que es motivado por las estimaciones es el de un posible círculovirtuoso alrededor de la educación de los individuos. Se encontró evidencia deque tener jefes de familia con nivel educativo profesional incrementa la proba-bilidad de que los individuos en edad escolar, en conjunto y por nivel educa-tivo, asistan a la escuela. Esto es un hecho importante que nos puede ayudara reorientar la política educativa del país hacia un fortalecimiento del nivelmedio-superior sin perder de vista el nivel profesional.

Alternativamente, y a manera de prevención, las estimaciones también en-contraron que se puede generar un círculo vicioso al respecto. Se encontró evi-dencia de que tener jefes de familia sin estudios formales disminuye la probabi-lidad de que los individuos en edad escolar, en conjunto y por nivel educativo,asistan a la escuela. Esto refuerza la idea de reorientación de la política públicaplanteada en materia educativa.

Discutidos los resultados que aportan las estimaciones en su conjunto, tan-to en el agregado como para el corte por nivel educativo, vale la pena discutirsobre la robustez de éstas. Al respecto, se hicieron varios ejercicios de especifi-cación y abordaje de las estimaciones que se presentan a continuación.


El primero de los ejercicios realizados fue la progresiva adición de controlesa las estimaciones para probar si había modificaciones cualitativas o cuantita-tivas importantes en el efecto de interés, que señalaran posibles sesgos poromisión de variables. La adición de controles fue hecha de la siguiente forma:primero, se estimó la relación de interés sin controles. Posteriormente, se es-timó la relación de interés con controles por características del jefe del hogar.En una siguiente instancia, se estimó la relación de interés con controles porcaracterísticas del jefe del hogar y características socioeconómicas. Finalmente,se estimó la especificación completa, presentada en las secciones anteriores.

Cualitativamente, los resultados para el efecto de interés se mantuvieronestables en casi todas las estimaciones, en signo y significancia. De hecho, elsigno del efecto de interés nunca cambió ante las variaciones y su significanciacambió en una sola ocasión. El efecto de interés apareció como significativo enla estimación sin controles para los individuos en edad de secundaria y prepa-ratoria, pero cuando se agregaron controles, perdió la significancia definitiva-mente.

Además, los resultados para el efecto de interés se mantuvieron en el mis-mo orden de magnitud para primaria, secundaria y preparatoria, y poblaciónen edad escolar. Para el grupo de individuos en edad de educación profesionalhubo algunos saltos a la baja y a la alza, sin embargo, pueden estar asociadosa que el estimador, en versiones menos controladas de la especificación, estásesgado hacia arriba o hacia abajo por la omisión de variables relevantes. Estoes una hipótesis fácilmente verificable pues la mayoría de los controles agrega-dos progresivamente son significativos en las estimaciones, lo cual termina deconfirmar la importancia de su inclusión.

Por lo anterior y el análisis de los resultados, se puede defender que lasestimaciones son sólidas ante la especificación del modelo y que el modeloseleccionado es adecuado para describir la probabilidad de asistencia escolarexplicada por la presencia de un individuo discapacitado en el hogar, siempreque se analicen separadas por rango de edad. Los resultados de la prueba deadición progresiva de controles se pueden consultar en los cuadros 6, 7 y 8 delApéndice 6.

Una prueba adicional de robustez realizada fue la estimación del modelomás especificado, por rango de edad, a través de una especificación funcionalLogit. El resultado de esta estimación se puede consultar la tabla 11 del Apén-dice 6. La finalidad de esta prueba fue verificar la solidez de las estimacionesante la variación en la función de densidad de probabilidad utilizada. Los re-sultados no cambiaron de signo ni de órden relativo entre estimaciones, concambios de magnitud muy pequeños, por lo que podemos concluir que las es-timaciones son confiables independientemente de la forma funcional elegida.

Una tercera prueba de robustez fue realizada al estimar por separado losmodelos para los individuos en edad de nivel educativo secundaria y prepara-toria, para verificar el caso en el cual se hubieran diluido los resultados signifi-cativos de un grupo con los resultados no significativos del otro grupo. Comoresultado se encontró que separar los grupos no aporta ningún tipo de eviden-cia o información adicional a la investigación de la hipótesis de interés, por lo


que el mantener ambos grupos unidos no representa un problema. Adicional-mente, el tener a este grupo unido en las estimaciones aumenta el número deobservaciones del mismo y, por ende, mejora la precisión de los estimadores.Los resultados de la prueba pueden ser consultados en la tabla 9 del Apéndice6.

Finalmente, y de la mano con la literatura analizadaMeyer y Mok (2009), seintentó hacer una prueba por grado y tipo de discapacidad. Los autores sugie-ren que podría haber diferencias si se utiliza un criterio (arbitrario) para sepa-rar la discapacidad leve de la discapacidad severa. En este caso, se consideródiscapacidad severa las siguientes características: discapacidad para caminar,moverse, subir o bajar; ver, aun usando lentes; hablar, comunicarse o conver-sar; oír, aun usando aparato auditivo; y, vestirse, bañarse o comer, según la cla-sificación del INEGI para la ENIGH 2012. Se consideró discapacidad leve lassiguientes características: discapacidad para poner atención o aprender cosassencillas; y, tener alguna limitación mental.

Al respecto, no hubo diferencias con la estimación planteada de inicio puesen la muestra sólo se encontraron 31 observaciones de discapacidad leve, delas cuales, sólo 9 se encontraron en hogares de individuos en edad escolar. Estorepresenta un 0.08 % de la muestra total de individuos en edad escolar, lo cualestá por debajo del margen de error de la ENIGH 2012. Por ende, cualquierestimación sobre esta muestra no es fiable. Como resultado, la estimación sindiferenciar por nivel de discapacidad es una estrategia correcta en el contextode la ENIGH 2012, pues no existe información suficiente para hacer diferencia-ción.

En este contexto, hubiéramos esperado, de tener la información necesariapara estimar por separado, que el efecto de la presencia de un individuo dis-capacitado en el hogar sobre la probabilidad de asistencia escolar fuera mayor,en general, cuando los individuos experimentan discapacidad severa. Esto sedebe a que la discapacidad leve, en algunos casos, permitiría al discapacitadorealizar tareas sencillas, productivas y de sustento, que redujeran el impactonegativo sobre el ingreso no laboral de los demás individuos del hogar.

Como resultado de las pruebas de robustez realizadas, podemos concluirde que la estimación realizada es robusta y confiable para describir la relacióncausal entre la presencia de un individuo discapacitado en el hogar y la asis-tencia escolar de sus integrantes, ya que no se encuentran sujetas a un evidenteproblema de inconsistencia por omisión de variables relevantes, especificaciónde forma funcional o definición por rangos de edad.

En este sentido, es importante tener en mente que, dada la naturaleza delproblema que se estudió, múltiples factores que podrían influenciar las decisio-nes de los hogares fueron analizados y se justificó el por qué no son un peligropara las consistencia estimaciones. De cualquier manera, es importante tenersiempre presente que, de existir algún factor externo no controlado en el mo-delo que impactara la presencia de discapacidad en el hogar y al mismo tiempoa la asistencia escolar, las estimaciones presentadas en este trabajo dejarían deser causales, para convertirse en meras correlaciones.


7. CONCLUSIONES.

La presente investigación aporta evidencia, tanto teórica como empírica, ala literatura sobre economía de la salud, de la educación y laboral. En particu-lar, se estudió el impacto que tiene la presencia de un individuo discapacitadoen el hogar sobre la probabilidad de asistencia escolar de sus integrantes. Laevidencia encontrada, la cual es significativa, apunta a que existe un impactoinverso y causal entre discapacidad en el hogar y la probabilidad de asistenciaescolar de sus miembros, en individuos en edad de asistir a educación profe-sional.

Este es un resultado confiable tanto en términos de consistencia como entérminos de robustez de las estimaciones. Por una parte, se desarrolló una ar-gumentación teórica, acompañada de una prueba empírica, que sustentaronque la estrategia de identificación planteada genera estimaciones insesgadas.Por otra parte, se desarrollaron análisis de sensibilidad para la especificacióndel modelo y la manera de estimar, los cuales mostraron que los resultadosobtenidos se mantienen ante cambios estructurales importantes.

Además de probar la hipótesis de interés, el presente trabajo aporta eviden-cia sobre otras variables importantes del entorno social, de las característicasdel jefe del hogar y de seguridad social que juegan un rol fundamental en latoma de decisiones escolares por parte de los individuos. De aquí, y de la prue-ba de la hipótesis de interés, se desprenden algunas ideas que pueden servirpara posterior análisis de política pública y futuros trabajos de investigación.Estas ideas son las siguientes:

1) La presente investigacin es una herramienta que ayuda a predecir cua-litativamente el impacto de política pública enfocada al ingreso no laboral delos individuos para generar incentivos de educación. Además, este artículo res-ponde a una necesidad expresamente aceptada del vacío de información empí-rica para México que oriente a los hacedores de política pública en materia dediscapacidad. Como resultado de la presente investigación, podemos pensarque la política pública orientada al ingreso no laboral de los individuos gene-rará incentivos para asistir a la escuela, por lo menos para individuos en edadde asistir a la escuela profesional. Al respecto, existe gran oportunidad paravalidar empíricamente qué sucede con política pública de discapacidad enfo-cada al ingreso laboral, donde podrían existir incentivos y distorsiones de otrotipo.

2) La presente investigación señala un potencial problema con la seguridadsocial en términos de discapacidad, que puede responder a dos razones. Laprimera, es que no existe suficiente cobertura a los discapacitados, como semuestra en la baja incidencia de hogares con un discapacitado que ademáscuentan con seguridad social para el mismo. La segunda, es que además de lapoca cobertura, la que existe podría ser ineficiente. Por ende, existe una granoportunidad de orientar los esfuerzos académicos a estudiar este fenómenomás a fondo, para definir áreas de oportunidad y reorientar la política públicahacia la eficiencia económica.

3) Es importante considerar que los cambios sociales y estructurales que se


viven al interior de los hogares, producto del cambio demográfico y de cos-tumbres que viven los países en vías de desarrollo, no pasan desapercibidosen la toma de decisiones escolares al interior de éstos. Padres más educadosgenerarán círculos virtuosos hacia la construcción de mayor capital humano.Hogares con menos individuos dependientes económicos incentivarán que losindividuos estudien. Mayor estabilidad económica, ya sea vista como mayoringreso o mayor número de aportadores, traerá mayores incentivos para quelos individuos estudien. Por ello, este trabajo sugiere áreas de oportunidad alas que puede enfocarse la política pública que busque ampliar la asistenciaescolar, sin perder de vista que la calidad también es importante y es un temaque vale la pena abordar por sí mismo, donde también hay mucho por hacer.

REFERENCIAS

Angrist, J. D., y Pischke, J.-S. (2008). Mostly harmless econometrics: An empiricist’scompanion. Princeton university press.

Bankston III, C. L., y Caldas, S. J. (1998). Family structure, schoolmates, andracial inequalities in school achievement. Journal of Marriage and the Family,715–723.

Cairns, J., y Pol, M. Van der. (1997). Constant and decreasing timing aversionfor saving lives. Social Science & Medicine, 45(11), 1653–1659.

Card, D. (1999). The causal effect of education on earnings. Handbook of laboreconomics, 3, 1801–1863.

Frederick, S., Loewenstein, G., y O’donoghue, T. (2002). Time discounting andtime preference: A critical review. Journal of economic literature, 351–401.

Gobierno Federal de los Estdos Unidos Mexicanos. (2014). Programa nacionalpara el desarrollo y la inclusión de las personas con discapacidad 2014-2018.Diario Oficial de la Federación.

Hanushek, E. A., Schwerdt, G., Wiederhold, S., y Woessmann, L. (2015). Re-turns to skills around the world: Evidence from piaac. European EconomicReview, 73, 103–130.

Heckman, J. J. (1976). The common structure of statistical models of truncation,sample selection and limited dependent variables and a simple estimator forsuch models. En Annals of economic and social measurement, volume 5, number4 (pp. 475–492). NBER.

Instituto Nacional de Estadística Geografía e Información. (2015). Encuestanacional de ingreso y gasto de los hogares 2012: Tabulados básicos.

LeClere, F. B., y Kowalewski, B. M. (1994). Disability in the family: the effectson children’s well-being. Journal of Marriage and the Family, 457–468.


Meyer, B., y Mok, W. K. (2009). Disability, earnings, income and consumption (Inf.Téc.). National Bureau of Economic Research.

Moore, M. J., y Viscusi, W. K. (1988). The quantity-adjusted value of life. Eco-nomic Inquiry, 26(3), 369–388.

Psacharopoulos, G. (1981). Returns to education: an updated internationalcomparison. Comparative education, 17(3), 321–341.

Schultz, T. P. (2004). School subsidies for the poor: evaluating the mexicanprogresa poverty program. Journal of development Economics, 74(1), 199–250.

Secretaría de Educación Pública. (2013). Principales cifras del sistema nacionaleducativo 2012-2013.

Spence, M. (2002). Signaling in retrospect and the informational structure ofmarkets. American Economic Review, 434–459.

Thomas, J., y Stockton, C. (2003). Socioeconomic status, race, gender, & reten-tion: Impact on student achievement. Essays in Education, 7.

Wooldridge, J. M. (2010). Econometric analysis of cross section and panel data. MITpress.


a

APÉNDICE 1: DESARROLLO TÉCNICO DEL MODELO TEÓRICO

El agente descrito en la sección del modelo teórico enfrenta una funciónintertemporal como la siguiente

U(CP , CF ) =CP

1− 1γ

1− 1γ

+ βCF

1− 1γ

1− 1γ

(7.1)

Y enfrenta las siguientes restricciones, donde sP es el ahorro presente:

EP + LP = T (7.2)

LF = T (7.3)

CP + sP = N +WPLP (7.4)

CF = (1 + r)sp +

(WAEP +WBLP

T

)LF (7.5)

Donde 7.2 es la restricción temporal para el presente, 7.3 es la restriccióntemporal para el futuro, 7.4 es la restricción presupuestaria para el presente y7.5 es la restricción presupuestaria para el futuro. En estas ecuaciones, ademásde las variables anteriormente definidas, LF es la cantidad de trabajo en elfuturo. Combinando estas restricciones llegamos a la restricción presupuestariaconsolidada

CP +CF

1 + r= N + wPLP +

1

1 + r

(wAEP + wBLP

T

)T (7.6)

Un individuo que maximiza la función 7.1 sujeta a la restricción 7.6, enfren-tará el siguiente lagrangeano, donde escoge CP , CF y LP :

L =CP

1− 1γ

1− 1γ

+βCF

1− 1γ

1− 1γ

+λ

(N + wPLP +

1

1 + r

(wAEP + wBLP

T

)T − CP −

CF1 + r

)(7.7)

De encontrar las condiciones de primer orden con respecto a CP y CF , sepueden encontrar las siguientes dos relaciones, respectivamente:

CP : C− 1γ

P − λ = 0 (7.8)

CF : βC− 1γ

F − λ

1 + r= 0 (7.9)


Combinando 7.8 y 7.9 se obtiene la siguiente relación:

C1γ

F

C1γ

P

= (1 + r)β ⇒ CF = ((1 + r)β)γCP (7.10)

Sustituyendo 7.10 en la restricción original 7.6 (que es la condición de pri-mer orden respecto a λ), obtenemos la siguiente relación

CP +((1 + r)β)

γCP

1 + r= CP (1 + (1 + r)γ−1βγ)

= N + wPLP +1

1 + r

(wAEP + wBLP

T

)T (7.11)

Finalmente, reordenando la expresión 7.11, es posible obtener la siguienterelación entre trabajo y consumo, que es la que se utiliza en el modelo teóricopara derivar el resto de los resultados:

LP =CP (1 + (1 + r)

γ−1βγ)−N − 1

1+rwAT

wP − 11+r (wA − wB)

(7.12)

APÉNDICE 2: CALIBRACIÓN DE LA DERIVADA PARCIAL

Se hicieron varias pruebas para determinar el signo de la siguiente expre-sión, aplicable para México:

F (·) =∂CP∂N

(1 + (1 + r)γ−1

βγ)− 1 ≥ 0 (7.13)

Los escenarios calculados son los siguientes:β = 0,7; γ = 0; r = 0,4; ∂CP∂N = 0,61;⇒ F (·) ≈ 0,05

β = 0,8; γ = 0; r = 0,4; ∂CP∂N = 0,61;⇒ F (·) ≈ 0,05

β = 0,95; γ = 0; r = 0,4; ∂CP∂N = 0,61;⇒ F (·) ≈ 0,05

β = 0,7; γ = 0,5; r = 0,4; ∂CP∂N = 0,61;⇒ F (·) ≈ 0,04

β = 0,8; γ = 0,5; r = 0,4; ∂CP∂N = 0,61;⇒ F (·) ≈ 0,07

β = 0,95; γ = 0,5; r = 0,4; ∂CP∂N = 0,61;⇒ F (·) ≈ 0,05

β = 0,7; γ = 1; r = 0,4; ∂CP∂N = 0,61;⇒ F (·) ≈ 0,04

β = 0,8; γ = 1; r = 0,4; ∂CP∂N = 0,61;⇒ F (·) ≈ 0,10

β = 0,95; γ = 1; r = 0,4; ∂CP∂N = 0,61;⇒ F (·) ≈ 0,19


APÉNDICE 3: DESCRIPCIÓN DE LOS CONTROLES.

En la presente sección se describen las controles utilizados en la investiga-ción.

Controles por Características del Jefe del HogarGénero del Jefe del Hogar: variable indicadora que toma el valor de 1 si el

género del jefe del hogar es masculino y 0 en otro caso;Edad del jefe del Hogar: variable numérica que registra la edad del jefe del

hogar en años cumplidos a la fecha de la encuesta;Jefe del Hogar sin Educación Formal: variable indicadora que toma el valor de

1 si el jefe del hogar no completó ni un año de educación formal y 0 en otrocaso;

Jefe del Hogar con Estudios Profesionales o de Posgrado: variable indicadora quetoma el valor de 1 si el jefe del hogar tiene estudios profesionales o de posgradoy 0 en otro caso;

Controles socioeconómicosGénero: variable indicadora que toma el valor de 1 si el individuo es de

hombre y 0 en otro caso;Gran Ciudad: variable indicadora que toma el valor de 1 si el individuo vi-

ve en una gran ciudad y 0 en otro caso (más de 250,000 habitantes, según laclasificación del INEGI);

Tamaño del hogar: variable numérica que registra el número de individuosque habitan en el hogar;

Menores de 5 Años: variable numérica que registra el número de individuosmenores de 5 años de edad que habitan en el hogar;

Hombres y Mujeres en Edad Productiva: variable numérica que registra el nú-mero de hombres o mujeres, según el caso, entre 24 y 64 años de edad quehabitan en el hogar;

Ancianos: variable numérica que registra el número de individuos mayoresde 64 años de edad que habitan el hogar;

Logaritmo Natural del Ingreso: variable numérica que registra el logaritmonatural del ingreso total acumulado para el hogar, con periodicidad trimestral;

Controles por Seguridad SocialPensión por Discapacidad: variable indicadora que toma el valor de 1 si el

individuo discapacitado (en caso de que existiera en un determinado hogar)tiene alguna pensión por discapacidad y 0 de otra forma;

Afiliación a Instituciones de Salud: variable indicadora que toma el valor de1 si el individuo discapacitado (en caso de que existiera en un determinadohogar) tiene alguna afiliación a instituciones de salud (IMSS, ISSSTE, ISSSTEestatal o PEMEX) y 0 de otra forma;


APÉNDICE 4: PROCEDIMIENTO DE HAUSSMAN.

Los controles utilizados en la estimación de Hausmann son los siguientes:el vector X1 está compuesto por las variables género del jefe del hogar, edaddel jefe del hogar, variable indicadora de jefe del hogar con estudios formales,variable indicadora del jefe del hogar con estudios profesionales o de posgra-do, variable indicadora de género, variable indicadora de que el individuo viveen una gran ciudad (más de 250,000 habitantes, según el INEGI), tamaño delhogar, número de individuos en el hogar menores a 5 años, número de hom-bres y mujeres en edad productiva dentro del hogar, el logaritmo del ingreso,variable indicadora de que el discapacitado cuenta con pensión de discapa-cidad y variable indicadora de que el individuo discapacitado está afiliado ainstituciones de salud (IMSS, ISSSTE, ISSSTE estatal y PEMEX).

Por otro lado, el vector X está compuesto por todas las variables del vectorX1 más las variables adicionales de porcentaje del ingreso del hogar gastadoen alimentos y porcentaje del ingreso del hogar gastado en salud, las cualespodrían inducir variación extra sobre la discapacidad en el hogar. Esto aseguraque se cumpla la restricción de que L1 < L.

CUADRO 4: PRIMERA ETAPA DEL PROCEDIMIENTO DE HAUSMANN

Pr. de Discapacidad en el HogarGénero del Jefe del Hogar -0.000645

(-0.06)

Edad del Jefe del Hogar 0.00253∗∗∗

(4.48)

Jefe del Hogar sin Educación Formal 0.107∗∗∗

(4.04)

Jefe del Hogar con Estudios Profesionales o de Posgrado 0.00655(0.76)

Género 0.00433(0.93)

Gran Ciudad -0.0517∗∗∗

(-5.90)

Tamaño del Hogar 0.00414(1.18)

Menores de 5 Años -0.00646(-0.81)

Hombres en Edad Productiva -0.00627



Pr. de Discapacidad en el Hogar(-0.70)

Mujeres en Edad Productiva -0.00365(-0.35)

Ancianos 0.0602∗∗∗

(3.44)

Log(Ingreso) -0.0439∗∗∗

(-5.19)

Pensión por Discapacidad 0.107∗

(2.42)

Afiliación a Institución de Salud 0.841∗∗∗

(39.54)

Gasto Proporcional en Alimentos -0.0146(-0.69)

Gasto Proporcional en Salud 0.429∗∗∗

(5.82)

Constante 0.412∗∗∗

(4.90)N 11,967Estadístico t entre paréntesis∗ p < 0,05, ∗∗ p < 0,01, ∗∗∗ p < 0,001


CUADRO 5: SEGUNDA ETAPA DEL PROCEDIMIENTO DE HAUSMANN

Pr. de Asistencia EscolarError (Etapa 1) Ajustado -0.106

(-0.17)

Discapacidad en el Hogar -0.0853(-0.14)

Género del Jefe del Hogar -0.0841(-1.21)

Edad del Jefe del Hogar -0.0134∗∗∗



Pr. de Discapacidad en el Hogar(-4.33)

Jefe del Hogar sin Educación Formales -0.251∗

(-2.42)

Jefe del Hogar con Educación Profesional o de Posgrado 0.775∗∗∗

(8.56)

Género -0.0585(-1.84)

Gran Ciudad 0.0977(1.60)


Menores de 5 Años -0.360∗∗∗

(-9.80)

Hombres en Edad Productiva 0.239∗∗∗

(4.33)

Mujeres en Edad Productiva 0.595∗∗∗

(9.92)


(3.91)


(-4.69)

Pensión por Discapacidad 0.130(0.86)

Afiliación a Institución de Salud -0.149(-0.28)

Constante 2.500∗∗∗

(6.20)N 11,967Estadístico t entre paréntesis∗ p < 0,05, ∗∗ p < 0,01, ∗∗∗ p < 0,001



APÉNDICE 5: SIGNO ESPERADO DE LOS CONTROLES.

En la presente sección se comenta brevemente del signo esperado de lasvariables incluidas en el análisis.

Género del Jefe del Hogar (negativo): en la literatura se habla de que lasmujeres procuran más el desarrollo educativo de los hijosSchultz (2004).

Edad del Jefe del Hogar (indefinido): un jefe del hogar con más experien-cia puede valorar más la educación, sin embargo, después de cierto punto,también pierde facultades para laborar, lo que podría incrementar el costo deoportunidad de estudiar.

Jefe del Hogar sin Educación Formal (negativo): un jefe del hogar sin es-tudios probablemente valorará menos la educación de los hijos.

Jefe del Hogar con Estudios Profesionales o de Posgrado (positivo): un je-fe del hogar con estudios profesionales o de posgrado probablemente valorarámás los beneficios de la educación.

Género (indefinido): en la literatura se habla de discriminación hacia lasmujeres por parte de los jefes del hogarSchultz (2004), sin embargo, también, entrabajos de baja productividad, existen más incentivos a contratar hombres porel trabajo físico, aumentando el costo de oportunidad de estudiar para ellos.

Gran Ciudad(positivo): vivir en grandes ciudades genera mayor accesibi-lidad a las escuelas, reduciendo el costo de oportunidad de estudiar.

Tamaño del Hogar (indefinido): un hogar con más integrantes, en edadestempranas, podría favorecer que los niños tuvieran más supervisión y ayudacon las tareas, sin embargo, también podría implicar mayores gastos agrega-dos, por lo que existirían incentivos a abandonar la escuela.

Mayores de 5 Años (negativo): Mayor número de menores de 5 años im-plica una mayor cantidad de personas económicamente dependientes, por loque aumenta la necesidad de fuentes adicionales de recursos.

Hombres y Mujeres en Edad Productiva (positivo): mayor cantidad de in-dividuos en edad productiva probablemente implica mayor cantidad de apor-tadores de recursos al hogar, lo que disminuye la necesidad adicional de apor-tar recursos.

Ancianos (negativo): mayor número de ancianos implica una mayor canti-dad de personas económicamente dependientes, por lo que aumenta la necesi-dad de fuentes adicionales de recursos.

Logaritmo del Ingreso (positivo): mayor ingreso probablemente esté corre-lacionado positivamente con menor costo de oportunidad de estudiar.

Pensión por Discapacidad y Afiliación a Instituciones de Salud (positi-vo): son mecanismos suavizadores del impacto de la discapacidad en las deci-siones del hogar, por lo que deberÃan reducir el impacto negativo de ésta.

ZE

KY

AH

ME

DM

UR

RA

AN

TÓ

N167

APÉNDICE 6: TABLAS DE RESULTADOS PARA LAS PRUEBAS DE ROBUSTEZ.

CUADRO 6: EFECTOS MARGINALES DE LA ESTIMACIÓN PROGRESIVA DEL PROBIT PA-RA PRIMARIA


Sin Controles Jefe del Hogar Socioeconómicos Seguridad SocialDiscapacidad en el Hogar -0.00157 -0.00337 -0.00264 -0.00282


Género del Jefe del Hogar 0.00157 0.000849 0.000845(0.36) (0.19) (0.19)

Edad del Jefe del Hogar 0.000351∗ 0.000200 0.000200(2.21) (1.33) (1.33)

Jefe del Hogar sin Educación Formal -0.0167∗∗∗ -0.00981∗∗ -0.00980∗∗

(-3.86) (-2.70) (-2.71)Características Socioeconómicas

Género -0.00493 -0.00493(-1.48) (-1.48)

Gran Ciudad -0.00295 -0.00297(-0.75) (-0.75)

Tamaño del Hogar -0.000961 -0.000964(-1.07) (-1.08)

Individuos Menores de 5 Años -0.000979 -0.000984

168E

LIM

PAC

TO

DE

LA

DISC

APA

CID

AD

EN

EL

HO

GA

R


Sin Controles Jefe del Hogar Socioeconómicos Seguridad Social

(-0.36) (-0.36)

Hombres en Edad Productiva 0.00112 0.00112(0.39) (0.39)

Mujeres en Edad Productiva 0.000528 0.000536(0.16) (0.16)

Ancianos 0.00135 0.00133(0.37) (0.39)

Log(Ingreso) 0.00493∗∗ 0.00490∗∗

(2.85) (2.92)Características de Seguridad Social

Afiliación a Institución de Salud 0.000228(0.03)

Pensión Por Discapacidad 0.000802(0.09)

N 4,618 4,618 4,618 4,618



ZE

KY

AH

ME

DM

UR

RA

AN

TÓ

N169

CUADRO 7: EFECTOS MARGINALES DE LA ESTIMACIÓN PROGRESIVA DEL PROBIT PA-RA SECUNDARIA Y PREPARATORIA


Sin Controles Jefe del Hogar Socioeconómicos Seguridad SocialDiscapacidad en el Hogar -0.0478 -0.0322 -0.0325 -0.0256


Género del Jefe del Hogar -0.0230 -0.0321 -0.0325(-0.97) (-1.01) (-1.02)

Edad del Jefe del Hogar 0.00163 -0.00158 -0.00162(1.46) (-1.09) (-1.12)

Jefe del Hogar sin Educación Formal -0.0971∗∗ -0.0483 -0.0483(-2.63) (-1.27) (-1.28)

Jefe del Hogar con Educación Profesional o de Posgrado 0.401∗∗∗ 0.350∗∗∗ 0.349∗∗∗


Género -0.00662 -0.00639(-0.37) (-0.36)

Gran Ciudad 0.0191 0.0210(0.62) (0.68)

Tamaño del Hogar -0.0185∗ -0.0186∗

(-2.28) (-2.29)

Individuos Menores de 5 Años -0.0900∗∗∗ -0.0896∗∗∗

170E

LIM

PAC

TO

DE

LA

DISC

APA

CID

AD

EN

EL

HO

GA

R



(-4.06) (-4.05)

Hombres en Edad Productiva 0.0596∗ 0.0606∗

(2.46) (2.50)

Mujeres en Edad Productiva 0.119∗∗∗ 0.120∗∗∗

(4.59) (4.58)

Ancianos 0.0516 0.0516(1.36) (1.38)

Log(Ingreso) 0.0333 0.0344(1.75) (1.80)

Características de Seguridad Social


Pensión Por Discapacidad -0.0667(-1.05)

N 3,912 3,912 3,912 3,912



ZE

KY

AH

ME

DM

UR

RA

AN

TÓ

N171

CUADRO 8: EFECTOS MARGINALES DE LA ESTIMACIÓN PROGRESIVA DEL PROBIT PA-RA EDUCACIÓN PROFESIONAL


Sin Controles Jefe del Hogar Socioeconómicos Seguridad SocialDiscapacidad en el Hogar -0.0967∗∗ -0.112∗∗ -0.0823∗ -0.147∗∗


Género del Jefe del Hogar -0.0732∗∗ -0.0404 -0.0349(-2.81) (-1.33) (-1.16)

Edad del Jefe del Hogar 0.00663∗∗∗ 0.00381∗∗∗ 0.00384∗∗∗

(8.20) (3.48) (3.53)

Jefe del Hogar sin Educación Formal -0.198∗∗ -0.147∗ -0.136∗

(-2.99) (-2.47) (-2.26)



Género -0.0461∗ -0.0469∗

(-2.12) (-2.18)

Gran Ciudad 0.0893∗∗ 0.0840∗∗

(3.26) (3.10)

Tamaño del Hogar 0.00823 0.00830(0.79) (0.80)

Individuos Menores de 5 Años -0.170∗∗∗ -0.172∗∗∗

172E

LIM

PAC

TO

DE

LA

DISC

APA

CID

AD

EN

EL

HO

GA

R



(-7.09) (-7.19)

Hombres en Edad Productiva -0.00641 -0.00987(-0.28) (-0.45)

Mujeres en Edad Productiva 0.0375 0.0372(1.26) (1.26)

Ancianos -0.0304 -0.0306(-0.91) (-0.91)

Log(Ingreso) 0.0363∗ 0.0332∗

(2.23) (2.06)Características de Seguridad Social

Pensión Por Discapacidad 0.125(1.78)


N 3,437 3,437 3,437 3,437



ZE

KY

AH

ME

DM

UR

RA

AN

TÓ

N173

CUADRO 9: EFECTOS MARGINALES DE LA ESTIMACIÓN PROBIT PARA LA POBLACIÓNDE SECUNDARIA Y PREPARATORIA SEPARADAS


Secundaria Preparatoria Secundaria y PreparatoriaDiscapacidad en el Hogar 0.0183 -0.0899 -0.0256

(0.82) (-1.63) (-0.90)Características del Jefe del Hogar

Género del Jefe del Hogar -0.0502∗ -0.0106 -0.0325(-2.22) (-0.21) (-1.02)

Edad del Jefe del Hogar 0.0000188 -0.0000957 -0.00162(0.02) (-0.05) (-1.12)

Jefe del Hogar sin Educación Formal -0.0399 -0.0272 -0.0483(-1.62) (-0.47) (-1.28)



Género -0.0258 0.0296 -0.00639(-1.96) (1.01) (-0.36)

Gran Ciudad -0.00456 0.0609 0.0210(-0.21) (1.51) (0.68)

Tamaño del Hogar -0.0178∗∗ -0.0196 -0.0186∗

(-3.27) (-1.53) (-2.29)

Individuos Menores de 5 Años -0.00125 -0.159∗∗∗ -0.0896∗∗∗

174E

LIM

PAC

TO

DE

LA

DISC

APA

CID

AD

EN

EL

HO

GA

R


Secundaria Preparatoria Secundaria y Preparatoria(-0.08) (-4.53) (-4.05)

Hombres en Edad Productiva 0.0579∗∗ 0.0328 0.0606∗

(2.81) (0.94) (2.50)

Mujeres en Edad Productiva 0.0378 0.125∗∗ 0.120∗∗∗

(1.92) (2.98) (4.58)

Ancianos 0.0473 -0.0207 0.0516(1.58) (-0.40) (1.38)

Log(Ingreso) 0.0240∗ 0.0809∗∗ 0.0344(2.05) (2.69) (1.80)


Pensión Por Discapacidad -0.0661 -0.0877 -0.0667(-1.01) (-0.86) (-1.05)

Afiliación a Institución de Salud -0.0188 0.107 0.0113(-0.38) (1.35) (0.21)

N 1,926 1,986 3,912




CUADRO 10: EFECTOS MARGINALES DE LA ESTIMACIÓN DEL PROBIT CON INTERAC-CIÓN DE LA DISCAPACIDAD Y EL NIVEL EDUCATIVO


Edad Escolar

Discapacidad*Primaria 0.529∗∗∗

(7.70)

Discapacidad*Secundaria y Preparatoria -0.0660∗

(-2.35)

Discapacidad*Profesional -0.527∗∗∗

(-14.31)Características del Jefe del Hogar

Género del Jefe del Hogar -0.0204(-0.83)

Edad del Jefe del Hogar -0.00420∗∗∗

(-4.56)

Jefe del Hogar sin Educación Formal -0.0788∗∗

(-2.89)

Jefe del Hogar con Educación Profesional o de Posgrado 0.263∗∗∗

(8.76)Características Socioeconómicas

Género -0.0192(-1.75)

Gran Ciudad 0.0315(1.75)


Menores de 5 años -0.120∗∗∗

(-9.41)

Hombres en Edad Productiva 0.0750∗∗∗

(3.93)

Mujeres en Edad Productiva 0.201∗∗∗

(9.73)


(3.92)


(-5.11)



Edad Escolar


Pensión por Discapacidad 0.0691(1.23)


N 33435



ZE

KY

AH

ME

DM

UR

RA

AN

TÓ

N177

CUADRO 11: EFECTOS MARGINALES DE LA ESTIMACIÓN LOGIT



Discapacidad en el Hogar -0.00259 -0.0238 -0.151∗∗ -0.0642∗∗


Género del Jefe del Hogar 0.000512 -0.0318 -0.0323 -0.0337(0.11) (-1.02) (-1.07) (-1.45)

Edad del Jefe del Hogar 0.000189 -0.00155 0.00361∗∗∗ -0.00483∗∗∗

(1.29) (-1.10) (3.36) (-5.17)

Jefe del Hogar sin Educación Formal -0.00889∗∗ -0.0463 -0.131∗ -0.0778∗∗

(-2.64) (-1.27) (-2.01) (-2.98)

Jefe del Hogar con Educación Profesional o Posgrado 0.381∗∗∗ 0.284∗∗∗ 0.280∗∗∗

(6.54) (7.47) (8.69)

Género -0.00438 -0.00608 -0.0496∗ -0.0198(-1.33) (-0.36) (-2.29) (-1.85)

Características Socioeconómicas

Gran Ciudad -0.00275 0.0200 0.0825∗∗ 0.0317(-0.72) (0.65) (3.00) (1.77)

Tamaño del Hogar -0.000744 -0.0174∗ 0.00679 0.00442(-0.87) (-2.22) (0.63) (0.84)

Individuos Menores de 5 Años -0.000930 -0.0845∗∗∗ -0.175∗∗∗ -0.123∗∗∗

178E

LIM

PAC

TO

DE

LA

DISC

APA

CID

AD

EN

EL

HO

GA

R



(-0.35) (-3.89) (-7.31) (-9.65)

Hombres en Edad Productiva 0.000741 0.0598∗ -0.00955 0.0876∗∗∗

(0.26) (2.47) (-0.43) (4.76)

Mujeres en Edad Productiva 0.000107 0.119∗∗∗ 0.0400 0.225∗∗∗

(0.03) (4.61) (1.34) (11.04)

Ancianos 0.00102 0.0507 -0.0234 0.132∗∗∗

(0.30) (1.37) (-0.68) (4.87)

Log(Ingreso) 0.00457∗∗ 0.0337 0.0335∗ -0.0738∗∗∗

(3.08) (1.80) (2.09) (-5.75)Características de Seguridad Social

Afiliación a Institución de Salud 0.00107 0.00981 0.0400 -0.0235(0.16) (0.19) (0.57) (-0.60)

Pensión Por Discapacidad 0.000444 -0.0687 0.128 0.0427(0.05) (-1.13) (1.73) (0.89)

N 4,618 3,912 3,473 11,967



primavera 2015

nancieras en m xico - itam

Documents