Overview

Dataset statistics

Number of variables4
Number of observations67
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory35.0 B

Variable types

Numeric1
Text2
DateTime1

Dataset

Description대구광역시_수성구_특정토양오염관리대상시설현황_20190911
Author대구광역시 수성구
URLhttp://data.daegu.go.kr/open/data/dataView.do?dataSetId=3075365&dataSetDetailId=307536529fb791acf2e4&provdMethod=FILE

Alerts

데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique
상호 has unique valuesUnique
소재지(지번) has unique valuesUnique

Reproduction

Analysis started2024-04-19 05:40:26.431094
Analysis finished2024-04-19 05:40:26.871348
Duration0.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean34
Minimum1
Maximum67
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size735.0 B
2024-04-19T14:40:26.949995image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.3
Q117.5
median34
Q350.5
95-th percentile63.7
Maximum67
Range66
Interquartile range (IQR)33

Descriptive statistics

Standard deviation19.485037
Coefficient of variation (CV)0.57308932
Kurtosis-1.2
Mean34
Median Absolute Deviation (MAD)17
Skewness0
Sum2278
Variance379.66667
MonotonicityStrictly increasing
2024-04-19T14:40:27.101690image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.5%
44 1
 
1.5%
50 1
 
1.5%
49 1
 
1.5%
48 1
 
1.5%
47 1
 
1.5%
46 1
 
1.5%
45 1
 
1.5%
43 1
 
1.5%
2 1
 
1.5%
Other values (57) 57
85.1%
ValueCountFrequency (%)
1 1
1.5%
2 1
1.5%
3 1
1.5%
4 1
1.5%
5 1
1.5%
6 1
1.5%
7 1
1.5%
8 1
1.5%
9 1
1.5%
10 1
1.5%
ValueCountFrequency (%)
67 1
1.5%
66 1
1.5%
65 1
1.5%
64 1
1.5%
63 1
1.5%
62 1
1.5%
61 1
1.5%
60 1
1.5%
59 1
1.5%
58 1
1.5%

상호
Text

UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size668.0 B
2024-04-19T14:40:27.346898image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length13
Mean length7.5074627
Min length4

Characters and Unicode

Total characters503
Distinct characters142
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)100.0%

Sample

1st row육군제6335부대
2nd row제2150부대
3rd row어린이회관
4th row수성주유소
5th row대구은행(주)
ValueCountFrequency (%)
구도일주유소 2
 
2.8%
육군제6335부대 1
 
1.4%
문화주유소 1
 
1.4%
연호주유소 1
 
1.4%
수성셀프주유소 1
 
1.4%
주)태산이앤엘 1
 
1.4%
하나주유소 1
 
1.4%
삼보주유소 1
 
1.4%
주)태영오일 1
 
1.4%
정다운주유소 1
 
1.4%
Other values (61) 61
84.7%
2024-04-19T14:40:27.759372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
60
 
11.9%
60
 
11.9%
47
 
9.3%
15
 
3.0%
( 13
 
2.6%
) 13
 
2.6%
9
 
1.8%
9
 
1.8%
8
 
1.6%
8
 
1.6%
Other values (132) 261
51.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 443
88.1%
Decimal Number 18
 
3.6%
Open Punctuation 13
 
2.6%
Close Punctuation 13
 
2.6%
Uppercase Letter 8
 
1.6%
Space Separator 5
 
1.0%
Other Symbol 3
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
60
 
13.5%
60
 
13.5%
47
 
10.6%
15
 
3.4%
9
 
2.0%
9
 
2.0%
8
 
1.8%
8
 
1.8%
7
 
1.6%
7
 
1.6%
Other values (118) 213
48.1%
Decimal Number
ValueCountFrequency (%)
1 4
22.2%
6 4
22.2%
9 3
16.7%
2 2
11.1%
3 2
11.1%
5 2
11.1%
0 1
 
5.6%
Uppercase Letter
ValueCountFrequency (%)
S 4
50.0%
G 2
25.0%
K 2
25.0%
Open Punctuation
ValueCountFrequency (%)
( 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%
Space Separator
ValueCountFrequency (%)
5
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 446
88.7%
Common 49
 
9.7%
Latin 8
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
60
 
13.5%
60
 
13.5%
47
 
10.5%
15
 
3.4%
9
 
2.0%
9
 
2.0%
8
 
1.8%
8
 
1.8%
7
 
1.6%
7
 
1.6%
Other values (119) 216
48.4%
Common
ValueCountFrequency (%)
( 13
26.5%
) 13
26.5%
5
 
10.2%
1 4
 
8.2%
6 4
 
8.2%
9 3
 
6.1%
2 2
 
4.1%
3 2
 
4.1%
5 2
 
4.1%
0 1
 
2.0%
Latin
ValueCountFrequency (%)
S 4
50.0%
G 2
25.0%
K 2
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 443
88.1%
ASCII 57
 
11.3%
None 3
 
0.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
60
 
13.5%
60
 
13.5%
47
 
10.6%
15
 
3.4%
9
 
2.0%
9
 
2.0%
8
 
1.8%
8
 
1.8%
7
 
1.6%
7
 
1.6%
Other values (118) 213
48.1%
ASCII
ValueCountFrequency (%)
( 13
22.8%
) 13
22.8%
5
 
8.8%
S 4
 
7.0%
1 4
 
7.0%
6 4
 
7.0%
9 3
 
5.3%
2 2
 
3.5%
G 2
 
3.5%
3 2
 
3.5%
Other values (3) 5
 
8.8%
None
ValueCountFrequency (%)
3
100.0%

소재지(지번)
Text

UNIQUE 

Distinct67
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size668.0 B
2024-04-19T14:40:28.037329image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length28
Mean length22.119403
Min length18

Characters and Unicode

Total characters1482
Distinct characters52
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique67 ?
Unique (%)100.0%

Sample

1st row대구광역시 수성구 가천동 사서함 96-10
2nd row대구광역시 수성구 가천동 사서함96-12-1
3rd row대구광역시 수성구 황금동 626번지 , 637-2
4th row대구광역시 수성구 상동 623-2번지
5th row대구광역시 수성구 수성동2가 118번지
ValueCountFrequency (%)
대구광역시 67
24.5%
수성구 67
24.5%
만촌동 14
 
5.1%
지산동 7
 
2.6%
상동 6
 
2.2%
황금동 6
 
2.2%
범어동 6
 
2.2%
파동 4
 
1.5%
두산동 3
 
1.1%
신매동 3
 
1.1%
Other values (83) 90
33.0%
2024-04-19T14:40:28.463078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
273
18.4%
134
 
9.0%
72
 
4.9%
70
 
4.7%
69
 
4.7%
68
 
4.6%
67
 
4.5%
67
 
4.5%
67
 
4.5%
67
 
4.5%
Other values (42) 528
35.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 865
58.4%
Decimal Number 283
 
19.1%
Space Separator 273
 
18.4%
Dash Punctuation 58
 
3.9%
Other Punctuation 1
 
0.1%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
134
15.5%
72
8.3%
70
8.1%
69
8.0%
68
7.9%
67
7.7%
67
7.7%
67
7.7%
67
7.7%
64
7.4%
Other values (27) 120
13.9%
Decimal Number
ValueCountFrequency (%)
1 60
21.2%
3 35
12.4%
2 33
11.7%
4 31
11.0%
6 25
8.8%
7 25
8.8%
5 20
 
7.1%
0 19
 
6.7%
9 19
 
6.7%
8 16
 
5.7%
Space Separator
ValueCountFrequency (%)
273
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 58
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 865
58.4%
Common 617
41.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
134
15.5%
72
8.3%
70
8.1%
69
8.0%
68
7.9%
67
7.7%
67
7.7%
67
7.7%
67
7.7%
64
7.4%
Other values (27) 120
13.9%
Common
ValueCountFrequency (%)
273
44.2%
1 60
 
9.7%
- 58
 
9.4%
3 35
 
5.7%
2 33
 
5.3%
4 31
 
5.0%
6 25
 
4.1%
7 25
 
4.1%
5 20
 
3.2%
0 19
 
3.1%
Other values (5) 38
 
6.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 865
58.4%
ASCII 617
41.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
273
44.2%
1 60
 
9.7%
- 58
 
9.4%
3 35
 
5.7%
2 33
 
5.3%
4 31
 
5.0%
6 25
 
4.1%
7 25
 
4.1%
5 20
 
3.2%
0 19
 
3.1%
Other values (5) 38
 
6.2%
Hangul
ValueCountFrequency (%)
134
15.5%
72
8.3%
70
8.1%
69
8.0%
68
7.9%
67
7.7%
67
7.7%
67
7.7%
67
7.7%
64
7.4%
Other values (27) 120
13.9%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size668.0 B
Minimum2019-09-11 00:00:00
Maximum2019-09-11 00:00:00
2024-04-19T14:40:28.583315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-19T14:40:28.685488image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2024-04-19T14:40:26.604400image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-19T14:40:28.766153image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호소재지(지번)
연번1.0001.0001.000
상호1.0001.0001.000
소재지(지번)1.0001.0001.000

Missing values

2024-04-19T14:40:26.727914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-19T14:40:26.836024image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호소재지(지번)데이터기준일자
01육군제6335부대대구광역시 수성구 가천동 사서함 96-102019-09-11
12제2150부대대구광역시 수성구 가천동 사서함96-12-12019-09-11
23어린이회관대구광역시 수성구 황금동 626번지 , 637-22019-09-11
34수성주유소대구광역시 수성구 상동 623-2번지2019-09-11
45대구은행(주)대구광역시 수성구 수성동2가 118번지2019-09-11
56한국광유(주)청기와주유소대구광역시 수성구 만촌동 132-3번지 (1221-11)2019-09-11
67(주)삼우대구공장대구광역시 수성구 사월동 447번지2019-09-11
78육군제6619부대대구광역시 수성구 만촌동 503-1번지2019-09-11
89육군제6199부대본부사령실대구광역시 수성구 만촌동 사서함503-17호2019-09-11
910SK네트웍스(주)제일주유소대구광역시 수성구 범물동 1275-7번지2019-09-11
연번상호소재지(지번)데이터기준일자
5758메트로팔레스주유소대구광역시 수성구 만촌동 414-18번지2019-09-11
5859가든주유소대구광역시 수성구 범어동 97-11번지2019-09-11
5960수성에스주유소대구광역시 수성구 지산동 1161-24번지2019-09-11
6061대성산업(주)대경주유소대구광역시 수성구 신매동 365-1번지2019-09-11
6162황금기름창고주유소대구광역시 수성구 황금동 887-1번지 외12019-09-11
6263신화명품주유소대구광역시 수성구 지산동 997-2번지2019-09-11
6364대림주유소대구광역시 수성구 지산동 1205-3번지2019-09-11
6465경상제일주유소대구광역시 수성구 범어동 428-4번지2019-09-11
6566구도일주유소대구광역시 수성구 범어동 165-17번지2019-09-11
6667공군 제1방공유도탄여단대구광역시 수성구 이천동 239번지2019-09-11