COMP9318-19T1

About

LAB1

Q1 Bisection method
Q2 Newton method
Q3 token = ['1', '[', '2', '[', '3', '4', '5', ']', '6', '[', '7', '8', '[', '9', ']', '10', '[', '11', '12', ']', ']', '13', ']']

Seen token as a tree and every number behind '[' is the parent of the number benhind next '[', so the main tree parent node is '1' and the second layer parent nodes is '2' '6' '10'

Marks: 100/100

LAB2

Optimal binning algorithm using dynamic programming

https://blog.csdn.net/US_579/article/details/88428294

Error Sum of Squares (SSE)

SSE is the sum of the squared differences between each observation and its group's mean. It can be used as a measure of variation within a cluster. If all cases within a cluster are identical the SSE would then be equal to 0.

The formula for SSE is:

$SSE = \sum_{i=1}^{n}\left ( x_{i} \right -\bar{x})^2$

Where n is the number of observations xi is the value of the ith observation and 0 is the mean of all the observations. This can also be rearranged to be written as seen in J.H. Ward's paper.

Recurisive method will compute many times, although its space complicity is lower than dynamic programming.

In this case we apply dynamic programming, like Fibonacci sequence , we use privious consequence to compute the next by storing it in the mermory.

this lab2 also implement this strategy

the first row ,represent from that block divide into one bin

[ ] [ ] [ ] [3, 1, 18, 11, 13, 17] -1

[ ] [ ] [ ] [1, 18, 11, 13, 17] -1

[ ] [ ] [ ] [18, 11, 13, 17] -1

[ ] [ ] [ ] [11, 13, 17] 18.666

[ ] [ ] [ ] [13, 17] 8.0

[ ] [ ] [ ] [17] 0

the second row ,represent from that block divide into two bin

[ ] [ ] [[3, 1, 18, 11, 13, 17]] -1

[ ] [ ] [[1, 18, 11, 13, 17]] -1

[ ] [ ] [[18, 11, 13, 17]] 18.666

[ ] [ ] [[11, 13, 17]] 2

[ ] [ ] [[13, 17]] 0

[ ] [ ] [[17]] -1

etc......

Divding the problem into sub-problem , store the first row because it is the best optimal solution [11, 13, 17]:

18.666, [13, 17]:8.0,[17]:0, after that when we are going to the second row,it will divide into two bins from the first number

in the list, so [18, 11, 13, 17] can be split to prefix and suffix,which are [18] and [11,13,17],as we already know what the

cost of [11,13,17] from the first row's data which is 18.666 and the cost of [18] which is 0 , we are able to calculate the

optimal cost that dividing [18, 11, 13, 17] into two bins is 0 + 18.999 == 18.999 ,this is the core thinking of the dynamic

programming.

Marks: 100/100

LAB3

Logistic Regression using Gradient Descent

the main formula to caculate weights

$\theta =\theta - \alpha * X^{T}*E$

in this formula ,θ is weight(the cofficient), α is learning rate (how far it will take for one step) , the numpy.T of data is X.T ,

the cost is E

`The thing you need to know`

Function Sigmoid

$\sigma \left ( z \right )=\frac{1}{1 + e^{-z}}$

this function can accept all the input and map it to 0 or 1, suitable for classification

it mainly because -z to the power of e is always smaller than 1 and the denominator is always bigger than 1 , as a result ,

it always smaller than 1

we can call it as Probability value

Marks: 100/100

LAB 4

Multinomial naive bayes classifier to predict spam SMS.WE implement a unigram model here

we need to caculate following

$P(c)=\frac{N_c}{N}$

which is the probility of class_i occur in all document , in this case we have two class ,one is spam and another is ham in this case $P(spam)=\frac{2}{6}$

after that,we apply following equition, this is the likelihood of that word given a class ,

$P(w|c)=\frac{count(w,c)+1)}{count(c)+|V|)}$

w represent word , c represent class

count(w,c) is the total number of that word occur in that class

count ( c ) is the total number of word in that class

we also use add-1 smoothing in this case , the purpose is that we aviod the 0 appear in the probility in the every single word by add 1 in the Numerator and add V which is the total number of set(all wrods) to the denominator,althongh the word not occur in the vacabulary ,the probility will not assign 0

Caculate the Conditional Probilities

By implement the equition in 2 , calculate the P(word|class) of all the word in the sms both in spam and ham ,after that we are able to caculate the likelihood of P(class ham|sms) and P(class spam|sms)

which is

$P(class ham|sms) =P(class ham) * \frac {count(w_i,c)+1}{count(c)+|V|}$

i : every word in the sms

and using same equition in spam ,after that caculating the ratio and all good to go (:

Marks: 100/100

ASSIGNMENT

1.OLAP

2.MLE

3.MLE uisng Binomial distribution

Author

WANZE LIU

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
EXAM		EXAM
Lecture notes		Lecture notes
Project_specs		Project_specs
assignment1		assignment1
lab1		lab1
lab2		lab2
lab3		lab3
lab4		lab4
lab5		lab5
tutorial		tutorial
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COMP9318-19T1

About

LAB1

Marks: 100/100

LAB2

Marks: 100/100

LAB3

`The thing you need to know`

Marks: 100/100

LAB 4

Marks: 100/100

ASSIGNMENT

Author

About

Releases

Packages

Languages

US579/COMP9318-19T1

Folders and files

Latest commit

History

Repository files navigation

COMP9318-19T1

About

LAB1

Marks: 100/100

LAB2

Marks: 100/100

LAB3

The thing you need to know

Marks: 100/100

LAB 4

Marks: 100/100

ASSIGNMENT

Author

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`The thing you need to know`

Packages