Overview

Dataset statistics

Number of variables7
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory5.9 KiB
Average record size in memory60.3 B

Variable types

Numeric3
Categorical1
Text3

Dataset

Description샘플 데이터
Author지디에스컨설팅그룹
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=f9660e60-fe2d-11ea-9867-1b12bf08ea04

Alerts

학제 has constant value ""Constant
미세먼지 수치 is highly overall correlated with 대기질 지수High correlation
대기질 지수 is highly overall correlated with 미세먼지 수치High correlation
고유번호 has unique valuesUnique
학교명 has unique valuesUnique
주소 has unique valuesUnique
연락처 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:31:58.743922
Analysis finished2023-12-10 13:32:01.253577
Duration2.51 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

고유번호
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.72
Minimum1
Maximum101
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:32:01.365596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile96.05
Maximum101
Range100
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.311659
Coefficient of variation (CV)0.57791125
Kurtosis-1.1923899
Mean50.72
Median Absolute Deviation (MAD)25
Skewness0.020258832
Sum5072
Variance859.17333
MonotonicityStrictly increasing
2023-12-10T22:32:01.599456image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
101 1
1.0%
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%

학제
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
어린이집
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row어린이집
2nd row어린이집
3rd row어린이집
4th row어린이집
5th row어린이집

Common Values

ValueCountFrequency (%)
어린이집 100
100.0%

Length

2023-12-10T22:32:01.808075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:32:02.052325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
어린이집 100
100.0%

학교명
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T22:32:02.379403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length6
Mean length7.01
Min length6

Characters and Unicode

Total characters701
Distinct characters155
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row이루숲어린이집
2nd row리틀빌리지어린이집
3rd row하은어린이집
4th row꿈초롱어린이집
5th row패트와매트어린이집
ValueCountFrequency (%)
어린이집 4
 
3.8%
이루숲어린이집 1
 
1.0%
소망어린이집 1
 
1.0%
피노키오어린이집 1
 
1.0%
하나어린이집 1
 
1.0%
퇴계연꽃어린이집 1
 
1.0%
푸른솔어린이집 1
 
1.0%
한솔어린이집 1
 
1.0%
진실어린이집 1
 
1.0%
아이사랑어린이집 1
 
1.0%
Other values (92) 92
87.6%
2023-12-10T22:32:02.937787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
109
15.5%
100
14.3%
100
14.3%
100
14.3%
11
 
1.6%
8
 
1.1%
8
 
1.1%
8
 
1.1%
7
 
1.0%
5
 
0.7%
Other values (145) 245
35.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 693
98.9%
Space Separator 5
 
0.7%
Lowercase Letter 3
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
109
15.7%
100
14.4%
100
14.4%
100
14.4%
11
 
1.6%
8
 
1.2%
8
 
1.2%
8
 
1.2%
7
 
1.0%
5
 
0.7%
Other values (143) 237
34.2%
Space Separator
ValueCountFrequency (%)
5
100.0%
Lowercase Letter
ValueCountFrequency (%)
c 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 693
98.9%
Common 5
 
0.7%
Latin 3
 
0.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
109
15.7%
100
14.4%
100
14.4%
100
14.4%
11
 
1.6%
8
 
1.2%
8
 
1.2%
8
 
1.2%
7
 
1.0%
5
 
0.7%
Other values (143) 237
34.2%
Common
ValueCountFrequency (%)
5
100.0%
Latin
ValueCountFrequency (%)
c 3
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 693
98.9%
ASCII 8
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
109
15.7%
100
14.4%
100
14.4%
100
14.4%
11
 
1.6%
8
 
1.2%
8
 
1.2%
8
 
1.2%
7
 
1.0%
5
 
0.7%
Other values (143) 237
34.2%
ASCII
ValueCountFrequency (%)
5
62.5%
c 3
37.5%

주소
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T22:32:03.331966image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length38
Mean length28.24
Min length17

Characters and Unicode

Total characters2824
Distinct characters157
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row강원도 춘천시 백석골길22번길 21-21 이루숲어린이집(퇴계동)
2nd row강원도 춘천시 영서로 2920 현대아파트 관리동 2층
3rd row강원도 춘천시 안마산로 310 1층 (석사동)
4th row강원도 춘천시 부평길 7 한신아파트 4동 105호(후평동 864)
5th row강원도 춘천시 영서로 2169 103동101호 (퇴계동, 퇴계이안아파트)
ValueCountFrequency (%)
춘천시 100
 
17.9%
강원도 95
 
17.0%
퇴계동 12
 
2.2%
후평동 9
 
1.6%
동면 9
 
1.6%
석사동 7
 
1.3%
11 5
 
0.9%
강원 5
 
0.9%
관리동 5
 
0.9%
동내면 5
 
0.9%
Other values (245) 306
54.8%
2023-12-10T22:32:03.972561image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
486
 
17.2%
1 145
 
5.1%
119
 
4.2%
114
 
4.0%
111
 
3.9%
103
 
3.6%
103
 
3.6%
102
 
3.6%
96
 
3.4%
2 86
 
3.0%
Other values (147) 1359
48.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1608
56.9%
Decimal Number 529
 
18.7%
Space Separator 486
 
17.2%
Close Punctuation 71
 
2.5%
Open Punctuation 71
 
2.5%
Dash Punctuation 32
 
1.1%
Other Punctuation 27
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
119
 
7.4%
114
 
7.1%
111
 
6.9%
103
 
6.4%
103
 
6.4%
102
 
6.3%
96
 
6.0%
67
 
4.2%
62
 
3.9%
38
 
2.4%
Other values (132) 693
43.1%
Decimal Number
ValueCountFrequency (%)
1 145
27.4%
2 86
16.3%
0 65
12.3%
4 48
 
9.1%
3 45
 
8.5%
6 39
 
7.4%
8 28
 
5.3%
9 26
 
4.9%
5 25
 
4.7%
7 22
 
4.2%
Space Separator
ValueCountFrequency (%)
486
100.0%
Close Punctuation
ValueCountFrequency (%)
) 71
100.0%
Open Punctuation
ValueCountFrequency (%)
( 71
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 32
100.0%
Other Punctuation
ValueCountFrequency (%)
, 27
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1608
56.9%
Common 1216
43.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
119
 
7.4%
114
 
7.1%
111
 
6.9%
103
 
6.4%
103
 
6.4%
102
 
6.3%
96
 
6.0%
67
 
4.2%
62
 
3.9%
38
 
2.4%
Other values (132) 693
43.1%
Common
ValueCountFrequency (%)
486
40.0%
1 145
 
11.9%
2 86
 
7.1%
) 71
 
5.8%
( 71
 
5.8%
0 65
 
5.3%
4 48
 
3.9%
3 45
 
3.7%
6 39
 
3.2%
- 32
 
2.6%
Other values (5) 128
 
10.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1608
56.9%
ASCII 1216
43.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
486
40.0%
1 145
 
11.9%
2 86
 
7.1%
) 71
 
5.8%
( 71
 
5.8%
0 65
 
5.3%
4 48
 
3.9%
3 45
 
3.7%
6 39
 
3.2%
- 32
 
2.6%
Other values (5) 128
 
10.5%
Hangul
ValueCountFrequency (%)
119
 
7.4%
114
 
7.1%
111
 
6.9%
103
 
6.4%
103
 
6.4%
102
 
6.3%
96
 
6.0%
67
 
4.2%
62
 
3.9%
38
 
2.4%
Other values (132) 693
43.1%

연락처
Text

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2023-12-10T22:32:04.362463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.02
Min length12

Characters and Unicode

Total characters1202
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)100.0%

Sample

1st row033-243-8833
2nd row033-252-8513
3rd row033-252-8476
4th row033-256-0879
5th row070-8688-8616
ValueCountFrequency (%)
033-243-8833 1
 
1.0%
033-243-9567 1
 
1.0%
033-255-2496 1
 
1.0%
033-262-2297 1
 
1.0%
033-243-5055 1
 
1.0%
033-244-5024 1
 
1.0%
033-242-0660 1
 
1.0%
033-241-7735 1
 
1.0%
033-255-4885 1
 
1.0%
033-257-5913 1
 
1.0%
Other values (90) 90
90.0%
2023-12-10T22:32:04.912668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 256
21.3%
- 200
16.6%
2 167
13.9%
0 139
11.6%
5 82
 
6.8%
4 75
 
6.2%
6 69
 
5.7%
1 66
 
5.5%
7 61
 
5.1%
8 45
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1002
83.4%
Dash Punctuation 200
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 256
25.5%
2 167
16.7%
0 139
13.9%
5 82
 
8.2%
4 75
 
7.5%
6 69
 
6.9%
1 66
 
6.6%
7 61
 
6.1%
8 45
 
4.5%
9 42
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 200
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1202
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 256
21.3%
- 200
16.6%
2 167
13.9%
0 139
11.6%
5 82
 
6.8%
4 75
 
6.2%
6 69
 
5.7%
1 66
 
5.5%
7 61
 
5.1%
8 45
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1202
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 256
21.3%
- 200
16.6%
2 167
13.9%
0 139
11.6%
5 82
 
6.8%
4 75
 
6.2%
6 69
 
5.7%
1 66
 
5.5%
7 61
 
5.1%
8 45
 
3.7%

미세먼지 수치
Real number (ℝ)

HIGH CORRELATION 

Distinct9
Distinct (%)9.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.24322
Minimum17.924
Maximum19.017
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:32:05.097641image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17.924
5-th percentile17.924
Q118.061
median18.061
Q318.436
95-th percentile18.8274
Maximum19.017
Range1.093
Interquartile range (IQR)0.375

Descriptive statistics

Standard deviation0.35099361
Coefficient of variation (CV)0.019239674
Kurtosis-0.49215819
Mean18.24322
Median Absolute Deviation (MAD)0
Skewness1.0724145
Sum1824.322
Variance0.12319652
MonotonicityNot monotonic
2023-12-10T22:32:05.255061image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%)
18.061 53
53.0%
17.924 17
 
17.0%
18.819 16
 
16.0%
18.436 4
 
4.0%
18.43 3
 
3.0%
18.624 2
 
2.0%
18.987 2
 
2.0%
19.002 2
 
2.0%
19.017 1
 
1.0%
ValueCountFrequency (%)
17.924 17
 
17.0%
18.061 53
53.0%
18.43 3
 
3.0%
18.436 4
 
4.0%
18.624 2
 
2.0%
18.819 16
 
16.0%
18.987 2
 
2.0%
19.002 2
 
2.0%
19.017 1
 
1.0%
ValueCountFrequency (%)
19.017 1
 
1.0%
19.002 2
 
2.0%
18.987 2
 
2.0%
18.819 16
 
16.0%
18.624 2
 
2.0%
18.436 4
 
4.0%
18.43 3
 
3.0%
18.061 53
53.0%
17.924 17
 
17.0%

대기질 지수
Real number (ℝ)

HIGH CORRELATION 

Distinct8
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.90308
Minimum0.887
Maximum0.941
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:32:05.417368image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.887
5-th percentile0.887
Q10.894
median0.894
Q30.913
95-th percentile0.9324
Maximum0.941
Range0.054
Interquartile range (IQR)0.019

Descriptive statistics

Standard deviation0.017562402
Coefficient of variation (CV)0.019447228
Kurtosis-0.50725181
Mean0.90308
Median Absolute Deviation (MAD)0
Skewness1.0664157
Sum90.308
Variance0.00030843798
MonotonicityNot monotonic
2023-12-10T22:32:05.604064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
0.894 53
53.0%
0.887 17
 
17.0%
0.932 16
 
16.0%
0.913 4
 
4.0%
0.912 3
 
3.0%
0.941 3
 
3.0%
0.922 2
 
2.0%
0.94 2
 
2.0%
ValueCountFrequency (%)
0.887 17
 
17.0%
0.894 53
53.0%
0.912 3
 
3.0%
0.913 4
 
4.0%
0.922 2
 
2.0%
0.932 16
 
16.0%
0.94 2
 
2.0%
0.941 3
 
3.0%
ValueCountFrequency (%)
0.941 3
 
3.0%
0.94 2
 
2.0%
0.932 16
 
16.0%
0.922 2
 
2.0%
0.913 4
 
4.0%
0.912 3
 
3.0%
0.894 53
53.0%
0.887 17
 
17.0%

Interactions

2023-12-10T22:32:00.574999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:31:59.761554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:32:00.182913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:32:00.712076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:31:59.900871image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:32:00.303406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:32:00.846340image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:32:00.049356image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:32:00.438575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:32:05.753448image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고유번호학교명주소연락처미세먼지 수치대기질 지수
고유번호1.0001.0001.0001.0000.0000.000
학교명1.0001.0001.0001.0001.0001.000
주소1.0001.0001.0001.0001.0001.000
연락처1.0001.0001.0001.0001.0001.000
미세먼지 수치0.0001.0001.0001.0001.0001.000
대기질 지수0.0001.0001.0001.0001.0001.000
2023-12-10T22:32:05.922713image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
고유번호미세먼지 수치대기질 지수
고유번호1.000-0.091-0.091
미세먼지 수치-0.0911.0001.000
대기질 지수-0.0911.0001.000

Missing values

2023-12-10T22:32:00.998800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:32:01.183015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

고유번호학제학교명주소연락처미세먼지 수치대기질 지수
01어린이집이루숲어린이집강원도 춘천시 백석골길22번길 21-21 이루숲어린이집(퇴계동)033-243-883317.9240.887
12어린이집리틀빌리지어린이집강원도 춘천시 영서로 2920 현대아파트 관리동 2층033-252-851318.430.912
23어린이집하은어린이집강원도 춘천시 안마산로 310 1층 (석사동)033-252-847618.8190.932
34어린이집꿈초롱어린이집강원도 춘천시 부평길 7 한신아파트 4동 105호(후평동 864)033-256-087918.0610.894
45어린이집패트와매트어린이집강원도 춘천시 영서로 2169 103동101호 (퇴계동, 퇴계이안아파트)070-8688-861618.0610.894
56어린이집배꼽어린이집강원도 춘천시 영서로 2920 109동 101호(사농동, 현대아파트)033-256-719118.430.912
67어린이집친구어린이집강원도 춘천시 벌말길 19 (석사동)033-264-870818.0610.894
78어린이집아띠어린이집강원도 춘천시 우석로101번길 86 105동 101호(석사동, 대우아파트)033-263-123318.0610.894
89어린이집연두어린이집강원도 춘천시 대룡산길 132-12 (사암리 572-2)033-262-144318.8190.932
910어린이집창의나라어린이집강원 춘천시 석사동 893 현진에버빌 104-101033-257-832818.8190.932
고유번호학제학교명주소연락처미세먼지 수치대기질 지수
9092어린이집퇴계좋은어린이집강원도 춘천시 승지골길16번길 47 1008동 102호(퇴계동, 퇴계뜨란채아파트)033-256-221818.8190.932
9193어린이집아이맘어린이집강원도 춘천시 후석로228번길 24 211동 101호(후평동, 석사2차아파트)033-254-454418.0610.894
9294어린이집호반 어린이집강원도 춘천시 소양로163번길 55 (근화동)033-242-772217.9240.887
9395어린이집나무별어린이집강원도 춘천시 효석로135번길 15-6 (석사동)033-253-990218.0610.894
9496어린이집햇살어린이집강원도 춘천시 우석로85번길 9-13033-261-345218.0610.894
9597어린이집해든어린이집강원도 춘천시 퇴계로 220-19 302동 104호(석사동, 퇴계주공3차아파트)033-261-556918.0610.894
9698어린이집푸른금산어린이집강원도 춘천시 서면 금산길 39-12033-243-915417.9240.887
9799어린이집사임당 어린이집강원도 춘천시 안마산로 214 202동 106호(퇴계동, 퇴계금호타운2차아파트)033-261-402018.8190.932
98100어린이집사랑어린이집강원도 춘천시 향교앞길 5 (교동)033-255-552118.0610.894
99101어린이집하바키즈어린이집강원도 춘천시 남춘천새길 11 ,106동103호(퇴계동, 휴먼시아남춘천1단지)033-261-106918.0610.894