Overview

Dataset statistics

Number of variables4
Number of observations52
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory36.5 B

Variable types

Numeric2
Text2

Dataset

Description연번,단지명,소재지,어린이놀이터 개소
Author강북구
URLhttps://data.seoul.go.kr/dataList/OA-11587/S/1/datasetView.do

Alerts

연번 has unique valuesUnique
단지명 has unique valuesUnique

Reproduction

Analysis started2023-12-11 04:06:52.999258
Analysis finished2023-12-11 04:06:53.855854
Duration0.86 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean26.5
Minimum1
Maximum52
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-11T13:06:53.941575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.55
Q113.75
median26.5
Q339.25
95-th percentile49.45
Maximum52
Range51
Interquartile range (IQR)25.5

Descriptive statistics

Standard deviation15.154757
Coefficient of variation (CV)0.57187763
Kurtosis-1.2
Mean26.5
Median Absolute Deviation (MAD)13
Skewness0
Sum1378
Variance229.66667
MonotonicityStrictly increasing
2023-12-11T13:06:54.144508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.9%
28 1
 
1.9%
30 1
 
1.9%
31 1
 
1.9%
32 1
 
1.9%
33 1
 
1.9%
34 1
 
1.9%
35 1
 
1.9%
36 1
 
1.9%
37 1
 
1.9%
Other values (42) 42
80.8%
ValueCountFrequency (%)
1 1
1.9%
2 1
1.9%
3 1
1.9%
4 1
1.9%
5 1
1.9%
6 1
1.9%
7 1
1.9%
8 1
1.9%
9 1
1.9%
10 1
1.9%
ValueCountFrequency (%)
52 1
1.9%
51 1
1.9%
50 1
1.9%
49 1
1.9%
48 1
1.9%
47 1
1.9%
46 1
1.9%
45 1
1.9%
44 1
1.9%
43 1
1.9%

단지명
Text

UNIQUE 

Distinct52
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-11T13:06:54.437380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length6.8846154
Min length3

Characters and Unicode

Total characters358
Distinct characters113
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique52 ?
Unique (%)100.0%

Sample

1st row벽산라이브파크
2nd row벽산라이브파크(임대)
3rd row경남아너스빌2
4th row삼성래미안
5th rowSH-Ville(임대)
ValueCountFrequency (%)
번동주공 5
 
8.1%
수유벽산 2
 
3.2%
래미안 2
 
3.2%
201동 1
 
1.6%
번동솔그린 1
 
1.6%
번동한양 1
 
1.6%
번동한솔아파트 1
 
1.6%
기산그린 1
 
1.6%
번동한진 1
 
1.6%
번동동문 1
 
1.6%
Other values (46) 46
74.2%
2023-12-11T13:06:54.900468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
17
 
4.7%
14
 
3.9%
12
 
3.4%
11
 
3.1%
10
 
2.8%
9
 
2.5%
( 9
 
2.5%
) 9
 
2.5%
8
 
2.2%
8
 
2.2%
Other values (103) 251
70.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 303
84.6%
Decimal Number 12
 
3.4%
Space Separator 10
 
2.8%
Uppercase Letter 10
 
2.8%
Open Punctuation 9
 
2.5%
Close Punctuation 9
 
2.5%
Lowercase Letter 4
 
1.1%
Dash Punctuation 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
5.6%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
8
 
2.6%
8
 
2.6%
8
 
2.6%
7
 
2.3%
Other values (83) 201
66.3%
Uppercase Letter
ValueCountFrequency (%)
S 3
30.0%
K 2
20.0%
I 1
 
10.0%
H 1
 
10.0%
V 1
 
10.0%
U 1
 
10.0%
N 1
 
10.0%
Decimal Number
ValueCountFrequency (%)
1 4
33.3%
2 4
33.3%
0 1
 
8.3%
5 1
 
8.3%
4 1
 
8.3%
3 1
 
8.3%
Lowercase Letter
ValueCountFrequency (%)
l 2
50.0%
i 1
25.0%
e 1
25.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 303
84.6%
Common 41
 
11.5%
Latin 14
 
3.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
5.6%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
8
 
2.6%
8
 
2.6%
8
 
2.6%
7
 
2.3%
Other values (83) 201
66.3%
Common
ValueCountFrequency (%)
10
24.4%
( 9
22.0%
) 9
22.0%
1 4
 
9.8%
2 4
 
9.8%
0 1
 
2.4%
- 1
 
2.4%
5 1
 
2.4%
4 1
 
2.4%
3 1
 
2.4%
Latin
ValueCountFrequency (%)
S 3
21.4%
K 2
14.3%
l 2
14.3%
I 1
 
7.1%
H 1
 
7.1%
V 1
 
7.1%
i 1
 
7.1%
e 1
 
7.1%
U 1
 
7.1%
N 1
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 303
84.6%
ASCII 55
 
15.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
17
 
5.6%
14
 
4.6%
12
 
4.0%
11
 
3.6%
9
 
3.0%
8
 
2.6%
8
 
2.6%
8
 
2.6%
8
 
2.6%
7
 
2.3%
Other values (83) 201
66.3%
ASCII
ValueCountFrequency (%)
10
18.2%
( 9
16.4%
) 9
16.4%
1 4
 
7.3%
2 4
 
7.3%
S 3
 
5.5%
K 2
 
3.6%
l 2
 
3.6%
0 1
 
1.8%
I 1
 
1.8%
Other values (10) 10
18.2%
Distinct51
Distinct (%)98.1%
Missing0
Missing (%)0.0%
Memory size548.0 B
2023-12-11T13:06:55.187177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length14
Mean length12.076923
Min length9

Characters and Unicode

Total characters628
Distinct characters28
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique50 ?
Unique (%)96.2%

Sample

1st row강북구 미아1동 1354
2nd row강북구 미아1동 1354-1
3rd row강북구 미아2동 796
4th row강북구 미아2동 797
5th row강북구 미아2동 797-8
ValueCountFrequency (%)
강북구 51
32.9%
번3동 11
 
7.1%
수유2동 5
 
3.2%
번2동 5
 
3.2%
미아동 5
 
3.2%
미아3동 4
 
2.6%
우이동 3
 
1.9%
미아2동 3
 
1.9%
730 2
 
1.3%
미아5동 2
 
1.3%
Other values (60) 64
41.3%
2023-12-11T13:06:55.610294image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
103
16.4%
52
 
8.3%
52
 
8.3%
52
 
8.3%
52
 
8.3%
3 43
 
6.8%
1 42
 
6.7%
2 30
 
4.8%
5 22
 
3.5%
21
 
3.3%
Other values (18) 159
25.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 296
47.1%
Decimal Number 217
34.6%
Space Separator 103
 
16.4%
Dash Punctuation 12
 
1.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
52
17.6%
52
17.6%
52
17.6%
52
17.6%
21
7.1%
21
7.1%
18
 
6.1%
7
 
2.4%
7
 
2.4%
3
 
1.0%
Other values (6) 11
 
3.7%
Decimal Number
ValueCountFrequency (%)
3 43
19.8%
1 42
19.4%
2 30
13.8%
5 22
10.1%
4 18
8.3%
7 17
 
7.8%
6 15
 
6.9%
0 11
 
5.1%
8 10
 
4.6%
9 9
 
4.1%
Space Separator
ValueCountFrequency (%)
103
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 12
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 332
52.9%
Hangul 296
47.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
52
17.6%
52
17.6%
52
17.6%
52
17.6%
21
7.1%
21
7.1%
18
 
6.1%
7
 
2.4%
7
 
2.4%
3
 
1.0%
Other values (6) 11
 
3.7%
Common
ValueCountFrequency (%)
103
31.0%
3 43
13.0%
1 42
12.7%
2 30
 
9.0%
5 22
 
6.6%
4 18
 
5.4%
7 17
 
5.1%
6 15
 
4.5%
- 12
 
3.6%
0 11
 
3.3%
Other values (2) 19
 
5.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 332
52.9%
Hangul 296
47.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
103
31.0%
3 43
13.0%
1 42
12.7%
2 30
 
9.0%
5 22
 
6.6%
4 18
 
5.4%
7 17
 
5.1%
6 15
 
4.5%
- 12
 
3.6%
0 11
 
3.3%
Other values (2) 19
 
5.7%
Hangul
ValueCountFrequency (%)
52
17.6%
52
17.6%
52
17.6%
52
17.6%
21
7.1%
21
7.1%
18
 
6.1%
7
 
2.4%
7
 
2.4%
3
 
1.0%
Other values (6) 11
 
3.7%

어린이놀이터 개소
Real number (ℝ)

Distinct7
Distinct (%)13.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1923077
Minimum1
Maximum13
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size600.0 B
2023-12-11T13:06:55.768756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1.5
Q32.25
95-th percentile5
Maximum13
Range12
Interquartile range (IQR)1.25

Descriptive statistics

Standard deviation2.0101552
Coefficient of variation (CV)0.9169129
Kurtosis15.857576
Mean2.1923077
Median Absolute Deviation (MAD)0.5
Skewness3.4011848
Sum114
Variance4.040724
MonotonicityNot monotonic
2023-12-11T13:06:55.904055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1 26
50.0%
2 13
25.0%
4 6
 
11.5%
3 3
 
5.8%
5 2
 
3.8%
13 1
 
1.9%
6 1
 
1.9%
ValueCountFrequency (%)
1 26
50.0%
2 13
25.0%
3 3
 
5.8%
4 6
 
11.5%
5 2
 
3.8%
6 1
 
1.9%
13 1
 
1.9%
ValueCountFrequency (%)
13 1
 
1.9%
6 1
 
1.9%
5 2
 
3.8%
4 6
 
11.5%
3 3
 
5.8%
2 13
25.0%
1 26
50.0%

Interactions

2023-12-11T13:06:53.436113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T13:06:53.236824image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T13:06:53.539066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T13:06:53.330924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T13:06:56.001062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번단지명소재지어린이놀이터 개소
연번1.0001.0001.0000.360
단지명1.0001.0001.0001.000
소재지1.0001.0001.0001.000
어린이놀이터 개소0.3601.0001.0001.000
2023-12-11T13:06:56.112756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번어린이놀이터 개소
연번1.000-0.026
어린이놀이터 개소-0.0261.000

Missing values

2023-12-11T13:06:53.696358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T13:06:53.813154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번단지명소재지어린이놀이터 개소
01벽산라이브파크강북구 미아1동 13545
12벽산라이브파크(임대)강북구 미아1동 1354-12
23경남아너스빌2강북구 미아2동 7961
34삼성래미안강북구 미아2동 7972
45SH-Ville(임대)강북구 미아2동 797-81
56미아현대강북구 미아3동 190-21
67미아신구강북구 미아3동 13521
78신일해피트리강북구 미아3동 13581
89미아요진강북구 미아3동 1521
910경남아너스빌강북구 미아4동 13562
연번단지명소재지어린이놀이터 개소
4243우이대우강북구 우이동 3422
4344래미안 트리베라1차강북구 삼각산동 8134
4445래미안 트리베라2차강북구 삼각산동 8124
4546송천 센트레빌강북구 송천동 4611
4647두산위브트래지움강북구 미아동 8114
4748이너스내안에강북구 미아동 13611
4849꿈의숲롯데캐슬강북구 미아동 42
4950꿈의숲해링턴플레이스강북구 미아동 13694
5051수유현대빌라강북구 수유동 530-11
5152우이빌라강북구 우이동 181-21