Midterm Review. Haojun Li. May 9, Haojun Midterm Review May 9, / 13

dokumen-dokumen yang mirip
Metode Pemulusan Eksponensial Sederhana

Linear Discrimant Model

Metode Kernel. Machine Learning

Data Time Series. Time series merupakan data yang diperoleh dan disusun berdasarkan urutan waktu atau

Keseimbangan Torsi Coulomb

K-Means Clustering. Tim Asprak Metkuan. What is Clustering?

SVM untuk Regresi Ordinal

ADLN Perpustakaan Universitas Airlangga

ANALISIS CAPAIAN OPTIMASI NILAI SUKU BUNGA BANK SENTRAL INDONESIA: SUATU PENGENALAN METODE BARU DALAM MENGANALISIS 47 VARIABEL EKONOMI UNTU

Sistem Informasi. Soal Dengan 2 Bahasa: Bahasa Indonesia Dan Bahasa Inggris

Outline. Struktur Data & Algoritme (Data Structures & Algorithms) Pengantar. Definisi. 2-3 Trees

PENGARUH MIXED DISTRIBUTION PADA PENDEKATAN QUASI-LIKELIHOOD DALAM MODEL LINEAR 1)

Menampilkan Penaksir Parameter pada Model Linear * Mulyana **

Non Linear Estimation and Maximum Likelihood Estimation

Penerapan SAS/IML Algoritma EM, Pembangkitan sebaran normal ganda, dan Bootstrap

ALJABAR LINEAR SUMANANG MUHTAR GOZALI KBK ANALISIS

PEMANFAATAN TRANSFORMASI WAVELET SEBAGAI EKSTRAKSI CIRI PADA KLASIFIKASI BERTINGKAT SINYAL EKG

Time series Linier Models

SVM untuk Regresi. Machine Learning

MA3081 STATISTIKA MATEMATIK(A) Bab 2: Distribusi Samp

Informatics Class A UISI CALCULUS I WEEK 2 DAY 2

Analisis Dasar dalam Runtun Waktu

Kasus. Survey terhadap remaja usia tahun apakah pernah melakukan kerja paruh waktu (part-time)??

TIF APPLIED MATH 1 (MATEMATIKA TERAPAN 1) Week 3 SET THEORY (Continued)

Support Vector Machine

MA4081 PENGANTAR PROSES STOKASTIK Topik Khusus: M

LINEAR PROGRAMMING-1

Dependent VS independent variable

SIMULASI MONTE CARLO RISK MANAGEMENT DEPARTMENT OF INDUSTRIAL ENGINEERING

BAB III LANDASAN TEORI. Definisi Peramalan adalah memperkiraan atau memproyeksikan sesuatu

TIP 163. Game Engine. Topik 5 (Pert 6) Graf, Representasi Dunia, dan Algoritma Pencari Jalur (Pathfinding) Dosen: Aditya Wikan Mahastama

Data Structures. Class 5 Pointer. Copyright 2006 by The McGraw-Hill Companies, Inc. All rights reserved.

English for Tourism Lesson 14 The Tour Guide (continued)

RANGKAIAN DIGITAL TO ANALOG CONVERTER (DAC) DAN ANALOG TO DIGITAL CONVERTER

PERANCANGAN PARAMETER TERBAIK UNTUK PREDIKSI PRODUKSI BAN GT3 MENGGUNAKAN JARINGAN SYARAF TIRUAN RESILIENT PROPAGATION

MISTERI PEMBUNUHAN DI KAKEK BODO (INDONESIAN EDITION) BY S. MARA GD.

Optimisasi dengan batasan persamaan (Optimization with equality constraints) Mengapa batasan relevan dalam kajian ekonomi?

Diskripsi: Types of Statistics dan Penyajian Data

Dalam kasus-kasus terjadinya heteroskedastisitas, var(e i. ) = σ i2

Virtual Memory Ch. 9. Virtual Memory. SISTIM OPERASI (Operating System) IKI Johny Moningka

Nama Soal Pembagian Ring Road Batas Waktu 1 detik Nama Berkas Ringroad[1..10].out Batas Memori 32 MB Tipe [output only] Sumber Brian Marshal

g(x, y) = F 1 { f (u, v) F (u, v) k} dimana F 1 (F (u, v)) diselesaikan dengan: f (x, y) = 1 MN M + vy )} M 1 N 1

Analisis dan Dampak Leverage

VERSI BAHASA INDONESIA

6 KERANJANG 7 LANGKAH API (INDONESIAN EDITION) BY LIM TUNG NING

Praktikum Metode Komputasi (Vector Spaces)

ID3 : Induksi Decision Tree

No Kegiatan Kalimat yang di latih Arti. 2. How are you? 3.- Do you remember about population? - Can you explain about population?

Analisis Algoritma Bubble Sort

Membangun Menara karakter (Indonesian Edition)

Radial Basis Function Networks

Learning Outcomes Sebaran Kontinu Nilai Harapan dan Ragam Beberapa Sebaran Kontinu. Peubah Acak Kontinu. Julio Adisantoso.

ABSTRAK. Kata Kunci : Artificial Neural Network(ANN), Backpropagation(BP), Levenberg Marquardt (LM), harga emas, Mean Squared Error(MSE), prediksi.

TIME SERIES DENGAN K-STAT &EVIEWS

Adam Mukharil Bachtiar English Class Informatics Engineering Algorithms and Programming Searching

Metode Bayes. Tim Machine Learning

BAB 2 LANDASAN TEORI

KONSEP DAYA DUKUNG LINGKUNGAN DALAM PENENTUAN LOKASI BUDIDAYA

OFVSBM!OFUXPSL. Latar Belakang

JARINGAN KOMPUTER. 2. What is the IP address and port number used by gaia.cs.umass.edu to receive the file. gaia.cs.umass.edu :

Introduction to Time Series Analysis

Struktur Data & Algoritme (Data Structures & Algorithms)

KLASIFIKASI CALON PENDONOR DARAH MENGGUNAKAN METODE NAÏVE BAYES CLASSIFIER (Studi Kasus : Calon Pendonor Darah di Kota Semarang)

Statistik Bisnis. Week 9 Confidence Interval Estimation

Objectives. Struktur Data & Algoritme (Data Structures & Algorithms) Outline. Apa itu Recursion? Recursion

JARINGAN KOMPUTER : ANALISA TCP MENGGUNAKAN WIRESHARK

ADLN - Perpustakaan Universitas Airlangga ABSTRAK. viii

3.7 Further Results and Technical Notes. Yenni Angraini-G

Statistik Bisnis 1. Week 9 Discrete Probability

Callista Sulaiman

Addition of beneficiary for other currency than INR

Subnetting & CIDR. Fakultas Rekayasa Industri Institut Teknologi Telkom

Statistik Bisnis 1. Week 10 Continuous Probability Normal Distribution

1/5. while and do Loops The remaining types of loops are while and do. As with for loops, while and do loops Praktikum Alpro Modul 3.

Single Exponential Smoothing

Peramalan Memprediksi peristiwa masa depan Biasanya memerlukan kebiasaan selama jangka waktu tertentu metode kualitatif

Research = experiment

FUZZY LINEAR PROGRAMMING (FLP) DENGAN KONSTANTA SEBELAH KANAN BERBENTUK BILANGAN FUZZY DAN BERBENTUK TRAPEZOIDAL SKRIPSI DEWI YANNI FRANSISKA SAMOSIR

FIXED, RANDOM & MIXED MODELS. Senin, 12 November 2012

PENANGANAN MASALAH HETEROSKEDASITAS DENGAN MODEL ARCH-GARCH DAN MODEL BLACK-SCHOLES MOSES ALFIAN SIMANJUNTAK

The Essence of Programming & Problem Solving

5. The removed-treatment design with pretest & posttest Design: O 1 X O 2 O 3 X O 4 Problem: O 2 - O 3 not thesame with O 3 - O 4 construct validity o

Convolutional Neural Networks (CNNs) dengan Keras

INTERAKSI ANTARA PENGURANGAN WAKTU TUNGGU DAN BIAYA PEMESANAN PADA MODEL PERSEDIAAN DENGAN BACKORDER PRICE DISCOUNT DAN PENGENDALIAN FAKTOR PENGAMAN

Model Linear untuk Klasifikasi

INTISARI. Rancangan D-optimal Untuk Model Eksponensial Tergeneralisasi, Eksponensial Terboboti dan Morgan Mercer Flodin (MMF) Oleh

BAB III METODOLOGI PENELITIAN

BAB 3 ANALISIS DAN PERANCANGAN

Analisis Chi-Square (x 2 )

Statistik Bisnis 2. Week 4 Fundamental of Hypothesis Testing Methodology

Pertemuan 10 STATISTIKA INDUSTRI 2. Multiple Linear Regression. Multiple Linear Regression. Multiple Linear Regression 19/04/2016

ANALISIS POTENSI DAN PENGEMBANGAN SUMBER DAYA AIR STUDI KASUS: DAS. CITARUM HULU - SAGULING

1st Lecture. Rus an NasrudinEconomic Modelling/ECON February 12, 2013

RISKESDAS Modul IPKM. Modul MDG s. Modul Biomedis. penyajian sampai domain kabupaten/kota. domain provinsi, subsampel dari Modul IPKM.

Statistika Farmasi

Algoritma Pencarian Blind. Breadth First Search Depth First Search

Entrepreneurship & Innovation

Pemrograman Lanjut. Interface

Pengantar Matematika Asuransi. Winita Sulandari

Transkripsi:

Midterm Review Haojun Li May 9, 2019 Haojun Midterm Review May 9, 2019 1 / 13

Table of Contents 1 Kernel 2 Generative Models (NB, GDA) 3 Bias-Variance Haojun Midterm Review May 9, 2019 2 / 13

Table of Contents 1 Kernel 2 Generative Models (NB, GDA) 3 Bias-Variance Haojun Midterm Review May 9, 2019 3 / 13

Kernel Recap Core idea: reparametrize parameter θ as a linear combination of featurized vectors. θ = n i=1 β iφ( x (i) ) A kernel matrix K is a valid (mercer) kernel function if it is a PSD. Suppose K(x (i), x (j) ) = K ij [ ] 1 0 If K = then is K a valid kernel function? 0 0 [ ] 3 5 If K = then is K a valid kernel function? 5 3 Haojun Midterm Review May 9, 2019 4 / 13

Kernel Question (Midterm FA18) Let us attempt to kernelize k-means algorithm. Consider the following update formula: step 1: step 2: c (i)[t+1] := arg min µ [t+1] j := j x (i) µ [t] j 2 m i=1 1{c(i)[t+1] = j}x (i) m i=1 1{c(i)[t+1] = j} We seek to combine this into a single formula while also applying the kernel trick to allow infinite dimensional features. Complete the derivation below: c (i)[t+1] := arg min j φ(x (i) ) µ j 2 := Haojun Midterm Review May 9, 2019 5 / 13

Table of Contents 1 Kernel 2 Generative Models (NB, GDA) 3 Bias-Variance Haojun Midterm Review May 9, 2019 6 / 13

GLM Recap Exponential family is P(y; η) = b(y) exp(η T T (y) a(η)) Universal GLM update formula: θ := θ + α(y h(x))x At test time, we want to make a prediction: h(x) = E[y x] = a (η) We parametrize η as η = θ T x. (Or, if η is vector-valued, then η i = θ T i x) Assume that we have an exponential distribution What is b(y), η, T (y) and a(η)? p(y; λ) = λe λy Haojun Midterm Review May 9, 2019 7 / 13

Generative Algorithm Recap We make a strong assumption about how our data and labels are generated. Discriminative Algorithms - models p(y x; θ) directly Generative Algorithms - models p(x, y; φ) = p(x y; φ)p(y; φ) Basic formulation: Model the label as p(y); model the data as p(x y) Haojun Midterm Review May 9, 2019 8 / 13

GLM + Generative Algorithm Question (Fall 2017 Midterm 3a) We examine a Generalized Discriminant Analysis (Generalized GDA) We model p(y = 1; φ) = φ where y Bernoulli(φ). We model p(x y = k; η k ) = b(x) exp(η T k T (x) a(η k)) where T (x) = x and k {0, 1} denotes the class label. Show that the decision boundary is linear by showing the posterior distribution is p(y x; η 0, η 1, φ) = 1 1 + exp( (θ 0 + θ 1 x)) Haojun Midterm Review May 9, 2019 9 / 13

Table of Contents 1 Kernel 2 Generative Models (NB, GDA) 3 Bias-Variance Haojun Midterm Review May 9, 2019 10 / 13

Learning Theory Bias-Variance trade off: Test MSE = σ 2 + (Bias(ˆf (x))) 2 + Var(ˆf (x)) High σ means noisy data, nothing much we can do here so we are screwed if the training data is too noisy. High Bias is correlated with underfitting, your model is unable to correctly model the distribution. High Variance is correlated with overfitting, your model is too able to model the distribution. Haojun Midterm Review May 9, 2019 11 / 13

Learning Theory (Supervised) (Fall 2018 Midterm 1b) For the next set of questions, please indicate the effect on Bias and Variance (increase? Decrease? Uncertain?) Assume that we are doing ordinary least squared regression on some data and labels Replace the value of your last feature by a random number for every example of your dataset. Map x R n to φ(x) R d where d > n, φ(x) i = x i for 1 i n, and φ(x) i = f i (x) for n < i d where each f i is some scalar valued, deterministic function over the initial set of features. None of the f i s are just constant values that ignore the input. Stop your optimization algorithm early. Haojun Midterm Review May 9, 2019 12 / 13