1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
|
#+LaTeX_CLASS: beamer
#+LaTeX_CLASS_OPTIONS: [presentation]
#+BEAMER_FRAME_LEVEL: 1
#+BEAMER_HEADER_EXTRA: \usetheme{Warsaw}\usecolortheme{default}\useoutertheme{infolines}\setbeamercovered{transparent}
#+COLUMNS: %45ITEM %10BEAMER_env(Env) %10BEAMER_envargs(Env Args) %4BEAMER_col(Col) %8BEAMER_extra(Extra)
#+PROPERTY: BEAMER_col_ALL 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 :ETC
#+LaTeX_CLASS: beamer
#+LaTeX_CLASS_OPTIONS: [presentation]
#+LaTeX_HEADER: \usepackage[english]{babel} \usepackage{ae,aecompl}
#+LaTeX_HEADER: \usepackage{mathpazo,courier,euler} \usepackage[scaled=.95]{helvet}
#+LaTeX_HEADER: \usepackage{listings}
#+LaTeX_HEADER:\lstset{language=Python, basicstyle=\ttfamily\bfseries,
#+LaTeX_HEADER: commentstyle=\color{red}\itshape, stringstyle=\color{darkgreen},
#+LaTeX_HEADER: showstringspaces=false, keywordstyle=\color{blue}\bfseries}
#+TITLE:
#+AUTHOR: FOSSEE
#+DATE
#+EMAIL: info@fossee.in
#+DESCRIPTION:
#+KEYWORDS:
#+LANGUAGE: en
#+OPTIONS: H:3 num:nil toc:nil \n:nil @:t ::t |:t ^:t -:t f:t *:t <:t
#+OPTIONS: TeX:t LaTeX:nil skip:nil d:nil todo:nil pri:nil tags:not-in-toc
*
#+begin_latex
\begin{center}
\vspace{12pt}
\textcolor{blue}{\huge Statistics}
\end{center}
\vspace{18pt}
\begin{center}
\vspace{10pt}
\includegraphics[scale=0.95]{../images/fossee-logo.png}\\
\vspace{5pt}
\scriptsize Developed by FOSSEE Team, IIT-Bombay. \\
\scriptsize Funded by National Mission on Education through ICT\\
\scriptsize MHRD,Govt. of India\\
\includegraphics[scale=0.30]{../images/iitb-logo.png}\\
\end{center}
#+end_latex
* Objectives
At the end of this tutorial,you will be able to,
- Do statistical operations in Python.
- Sum a set of numbers.
- Find their mean,median and standard deviation.
* Pre-requisite
Spoken tutorial on -
- Loading Data from files.
- Getting started with Lists.
- Accessing Pieces of Arrays.
* Data set
- A;015163;JOSEPH RAJ S;083;042;47;00;72;244;;;
The following are the fields in any given line.
- Region Code which is 'A'
- Roll Number 015163
- Name JOSEPH RAJ S
- Marks of 5 subjects: -- English 083 --
Hindi 042 -- Maths 47 --
Science 35 -- Social 72
- Total marks 244
* Exercise 1
- In the given file football.txt at path /home/fossee/football.txt ,
one column is player name,second is goals at home
and third goals away.
- Find the total goals for each player
- Mean home and away goals
- Standard deviation of home and away goals
* Solution 1
#+begin_src python
L=loadtxt('/home/fossee/football.txt',usecols=(1,2),
delimiter=',')
sum(L,1)
mean(L,0)
std(L,0)
#+end_src python
* Summary
In this tutorial,we have learnt to -
- Do the standard statistical operations sum , mean
median and standard deviation in Python.
- Combine text loading and the statistical operation to solve
real world problems.
* Evaluation
1. Given a two dimensional list,
two\_dimensional\_list=[[3,5,8,2,1],[4,3,6,2,1]]
how do we calculate the mean of each row?
2. Calcutate the median of the given list?
student\_marks=[74,78,56,87,91,82]
3. Suppose there is a file with 6 columns but we wish to load text
only in columns 2,3,4,5. How do we specify that?
* Solutions
1. mean(two\_dimensional\_list,1)
2. median(student\_marks)
3. We use the parameter usecols=(2,3,4,5)
*
#+begin_latex
\begin{block}{}
\begin{center}
\textcolor{blue}{\Large THANK YOU!}
\end{center}
\end{block}
\begin{block}{}
\begin{center}
For more Information, visit our website\\
\url{http://fossee.in/}
\end{center}
\end{block}
#+end_latex
|