Overview

Dataset statistics

Number of variables7
Number of observations36
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.2 KiB
Average record size in memory61.7 B

Variable types

Numeric2
Text2
DateTime1
Categorical2

Dataset

Description경기도 하남시의 토양오염관리대상시설 설치현황에 대한 데이터로 상호, 소재지지번주소, 인허가등록일자, 총저장용량(리터) 등의 항목을 제공합니다.
URLhttps://www.data.go.kr/data/15116118/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique
상호 has unique valuesUnique
소재지지번주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 18:22:02.223969
Analysis finished2023-12-12 18:22:03.128208
Duration0.9 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.5
Minimum1
Maximum36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-13T03:22:03.205853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.75
Q19.75
median18.5
Q327.25
95-th percentile34.25
Maximum36
Range35
Interquartile range (IQR)17.5

Descriptive statistics

Standard deviation10.535654
Coefficient of variation (CV)0.5694948
Kurtosis-1.2
Mean18.5
Median Absolute Deviation (MAD)9
Skewness0
Sum666
Variance111
MonotonicityStrictly increasing
2023-12-13T03:22:03.378459image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
1 1
 
2.8%
20 1
 
2.8%
22 1
 
2.8%
23 1
 
2.8%
24 1
 
2.8%
25 1
 
2.8%
26 1
 
2.8%
27 1
 
2.8%
28 1
 
2.8%
29 1
 
2.8%
Other values (26) 26
72.2%
ValueCountFrequency (%)
1 1
2.8%
2 1
2.8%
3 1
2.8%
4 1
2.8%
5 1
2.8%
6 1
2.8%
7 1
2.8%
8 1
2.8%
9 1
2.8%
10 1
2.8%
ValueCountFrequency (%)
36 1
2.8%
35 1
2.8%
34 1
2.8%
33 1
2.8%
32 1
2.8%
31 1
2.8%
30 1
2.8%
29 1
2.8%
28 1
2.8%
27 1
2.8%

상호
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-13T03:22:03.635757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length18
Mean length10.194444
Min length5

Characters and Unicode

Total characters367
Distinct characters105
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row초이주유소
2nd row신한국주유소
3rd row오륜주유소
4th row약수터주유소
5th row(주)한경에너지 베스트원주유소
ValueCountFrequency (%)
한국도로공사 2
 
3.9%
에이치디현대오일뱅크(주)직영 2
 
3.9%
주식회사 2
 
3.9%
초이주유소 1
 
2.0%
서하남배다리주유소 1
 
2.0%
동서울지사 1
 
2.0%
sk서하남ic주유소 1
 
2.0%
서하남고속주유소 1
 
2.0%
지에스칼텍스(주)황산주유소 1
 
2.0%
씨앤에스유통(주 1
 
2.0%
Other values (38) 38
74.5%
2023-12-13T03:22:04.021606image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
39
 
10.6%
30
 
8.2%
27
 
7.4%
15
 
4.1%
13
 
3.5%
( 11
 
3.0%
11
 
3.0%
) 11
 
3.0%
10
 
2.7%
9
 
2.5%
Other values (95) 191
52.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 326
88.8%
Space Separator 15
 
4.1%
Open Punctuation 11
 
3.0%
Close Punctuation 11
 
3.0%
Uppercase Letter 4
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
12.0%
30
 
9.2%
27
 
8.3%
13
 
4.0%
11
 
3.4%
10
 
3.1%
9
 
2.8%
7
 
2.1%
7
 
2.1%
7
 
2.1%
Other values (88) 166
50.9%
Uppercase Letter
ValueCountFrequency (%)
I 1
25.0%
K 1
25.0%
S 1
25.0%
C 1
25.0%
Space Separator
ValueCountFrequency (%)
15
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 326
88.8%
Common 37
 
10.1%
Latin 4
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
12.0%
30
 
9.2%
27
 
8.3%
13
 
4.0%
11
 
3.4%
10
 
3.1%
9
 
2.8%
7
 
2.1%
7
 
2.1%
7
 
2.1%
Other values (88) 166
50.9%
Latin
ValueCountFrequency (%)
I 1
25.0%
K 1
25.0%
S 1
25.0%
C 1
25.0%
Common
ValueCountFrequency (%)
15
40.5%
( 11
29.7%
) 11
29.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 326
88.8%
ASCII 41
 
11.2%

Most frequent character per block

Hangul
ValueCountFrequency (%)
39
 
12.0%
30
 
9.2%
27
 
8.3%
13
 
4.0%
11
 
3.4%
10
 
3.1%
9
 
2.8%
7
 
2.1%
7
 
2.1%
7
 
2.1%
Other values (88) 166
50.9%
ASCII
ValueCountFrequency (%)
15
36.6%
( 11
26.8%
) 11
26.8%
I 1
 
2.4%
K 1
 
2.4%
S 1
 
2.4%
C 1
 
2.4%
Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-13T03:22:04.281166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length28
Mean length18.777778
Min length12

Characters and Unicode

Total characters676
Distinct characters63
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row경기도 하남시 초이동 62-1 (외 1필지)
2nd row경기도 하남시 감북동 458-1
3rd row경기도 하남시 감일동 349-4
4th row경기도 하남시 상산곡동 412-2
5th row경기도 하남시 신장동 408-1
ValueCountFrequency (%)
경기도 36
23.7%
하남시 36
23.7%
감북동 5
 
3.3%
상산곡동 4
 
2.6%
망월동 4
 
2.6%
덕풍동 4
 
2.6%
춘궁동 3
 
2.0%
신장동 3
 
2.0%
풍산동 3
 
2.0%
천현동 3
 
2.0%
Other values (48) 51
33.6%
2023-12-13T03:22:04.652363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
151
22.3%
37
 
5.5%
37
 
5.5%
36
 
5.3%
36
 
5.3%
36
 
5.3%
36
 
5.3%
36
 
5.3%
- 30
 
4.4%
1 29
 
4.3%
Other values (53) 212
31.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 351
51.9%
Space Separator 151
22.3%
Decimal Number 137
 
20.3%
Dash Punctuation 30
 
4.4%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%
Uppercase Letter 2
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
37
10.5%
37
10.5%
36
10.3%
36
10.3%
36
10.3%
36
10.3%
36
10.3%
9
 
2.6%
7
 
2.0%
7
 
2.0%
Other values (36) 74
21.1%
Decimal Number
ValueCountFrequency (%)
1 29
21.2%
3 20
14.6%
2 18
13.1%
4 15
10.9%
8 13
9.5%
9 10
 
7.3%
5 10
 
7.3%
7 9
 
6.6%
0 8
 
5.8%
6 5
 
3.6%
Uppercase Letter
ValueCountFrequency (%)
T 1
50.0%
I 1
50.0%
Space Separator
ValueCountFrequency (%)
151
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 30
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 351
51.9%
Common 323
47.8%
Latin 2
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
37
10.5%
37
10.5%
36
10.3%
36
10.3%
36
10.3%
36
10.3%
36
10.3%
9
 
2.6%
7
 
2.0%
7
 
2.0%
Other values (36) 74
21.1%
Common
ValueCountFrequency (%)
151
46.7%
- 30
 
9.3%
1 29
 
9.0%
3 20
 
6.2%
2 18
 
5.6%
4 15
 
4.6%
8 13
 
4.0%
9 10
 
3.1%
5 10
 
3.1%
7 9
 
2.8%
Other values (5) 18
 
5.6%
Latin
ValueCountFrequency (%)
T 1
50.0%
I 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 351
51.9%
ASCII 325
48.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
151
46.5%
- 30
 
9.2%
1 29
 
8.9%
3 20
 
6.2%
2 18
 
5.5%
4 15
 
4.6%
8 13
 
4.0%
9 10
 
3.1%
5 10
 
3.1%
7 9
 
2.8%
Other values (7) 20
 
6.2%
Hangul
ValueCountFrequency (%)
37
10.5%
37
10.5%
36
10.3%
36
10.3%
36
10.3%
36
10.3%
36
10.3%
9
 
2.6%
7
 
2.0%
7
 
2.0%
Other values (36) 74
21.1%
Distinct29
Distinct (%)80.6%
Missing0
Missing (%)0.0%
Memory size420.0 B
Minimum1996-06-29 00:00:00
Maximum2021-03-15 00:00:00
2023-12-13T03:22:04.782991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:22:04.927171image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=29)

대표업종
Categorical

Distinct3
Distinct (%)8.3%
Missing0
Missing (%)0.0%
Memory size420.0 B
주유소 운영업
28 
<NA>
레미콘 제조업
 
1

Length

Max length7
Median length7
Mean length6.4166667
Min length4

Unique

Unique1 ?
Unique (%)2.8%

Sample

1st row주유소 운영업
2nd row주유소 운영업
3rd row주유소 운영업
4th row주유소 운영업
5th row주유소 운영업

Common Values

ValueCountFrequency (%)
주유소 운영업 28
77.8%
<NA> 7
 
19.4%
레미콘 제조업 1
 
2.8%

Length

2023-12-13T03:22:05.064423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:22:05.180257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
주유소 28
43.1%
운영업 28
43.1%
na 7
 
10.8%
레미콘 1
 
1.5%
제조업 1
 
1.5%

총저장용량(리터)
Real number (ℝ)

Distinct24
Distinct (%)66.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean321411.11
Minimum44800
Maximum1000000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-13T03:22:05.290322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum44800
5-th percentile50000
Q1200000
median255000
Q3367500
95-th percentile812500
Maximum1000000
Range955200
Interquartile range (IQR)167500

Descriptive statistics

Standard deviation224334.66
Coefficient of variation (CV)0.69796796
Kurtosis1.9765096
Mean321411.11
Median Absolute Deviation (MAD)95000
Skewness1.480392
Sum11570800
Variance5.0326039 × 1010
MonotonicityNot monotonic
2023-12-13T03:22:05.740434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
200000 5
 
13.9%
260000 3
 
8.3%
350000 2
 
5.6%
160000 2
 
5.6%
250000 2
 
5.6%
230000 2
 
5.6%
500000 2
 
5.6%
50000 2
 
5.6%
340000 1
 
2.8%
320000 1
 
2.8%
Other values (14) 14
38.9%
ValueCountFrequency (%)
44800 1
 
2.8%
50000 2
 
5.6%
118000 1
 
2.8%
140000 1
 
2.8%
158000 1
 
2.8%
160000 2
 
5.6%
200000 5
13.9%
220000 1
 
2.8%
230000 2
 
5.6%
250000 2
 
5.6%
ValueCountFrequency (%)
1000000 1
2.8%
850000 1
2.8%
800000 1
2.8%
690000 1
2.8%
600000 1
2.8%
500000 2
5.6%
400000 1
2.8%
390000 1
2.8%
360000 1
2.8%
350000 2
5.6%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-07-03
36 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-07-03
2nd row2023-07-03
3rd row2023-07-03
4th row2023-07-03
5th row2023-07-03

Common Values

ValueCountFrequency (%)
2023-07-03 36
100.0%

Length

2023-12-13T03:22:05.861569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T03:22:05.969658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-07-03 36
100.0%

Interactions

2023-12-13T03:22:02.729011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:22:02.547747image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:22:02.823271image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T03:22:02.641056image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T03:22:06.047881image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호소재지지번주소인허가등록일자대표업종총저장용량(리터)
연번1.0001.0001.0000.9180.6430.000
상호1.0001.0001.0001.0001.0001.000
소재지지번주소1.0001.0001.0001.0001.0001.000
인허가등록일자0.9181.0001.0001.0001.0000.914
대표업종0.6431.0001.0001.0001.0000.643
총저장용량(리터)0.0001.0001.0000.9140.6431.000
2023-12-13T03:22:06.165588image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번총저장용량(리터)대표업종
연번1.0000.1910.408
총저장용량(리터)0.1911.0000.408
대표업종0.4080.4081.000

Missing values

2023-12-13T03:22:02.952946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T03:22:03.077832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호소재지지번주소인허가등록일자대표업종총저장용량(리터)데이터기준일자
01초이주유소경기도 하남시 초이동 62-1 (외 1필지)1996-06-29주유소 운영업3500002023-07-03
12신한국주유소경기도 하남시 감북동 458-11996-07-02주유소 운영업8000002023-07-03
23오륜주유소경기도 하남시 감일동 349-41996-07-05주유소 운영업1400002023-07-03
34약수터주유소경기도 하남시 상산곡동 412-21996-07-10주유소 운영업1600002023-07-03
45(주)한경에너지 베스트원주유소경기도 하남시 신장동 408-11996-07-10주유소 운영업2000002023-07-03
56에이치디현대오일뱅크(주)직영 동부셀프주유소경기도 하남시 신장동 434-21996-07-10주유소 운영업2000002023-07-03
67동운주유소경기도 하남시 풍산동 84-11996-07-10주유소 운영업2500002023-07-03
78(주)프라임에너지경기도 하남시 덕풍동 345-381996-07-10주유소 운영업1600002023-07-03
89(사)한국고속도로휴게시설협회 하남만남주유소경기도 하남시 천현동 181-201996-07-05주유소 운영업2200002023-07-03
910씨앤에스에너지(주) 덕풍주유소경기도 하남시 덕풍동 318-291996-07-05주유소 운영업2000002023-07-03
연번상호소재지지번주소인허가등록일자대표업종총저장용량(리터)데이터기준일자
2627서하남나들목주유소경기도 하남시 감북동2010-08-31주유소 운영업4000002023-07-03
2728서하남배다리주유소경기도 하남시 감북동 391-172012-11-13<NA>3500002023-07-03
2829주식회사 구산에너지경기도 하남시 망월동 8352015-06-22주유소 운영업10000002023-07-03
2930꽃밭주유소경기도 하남시 망월동 9982017-09-11주유소 운영업5000002023-07-03
3031유한회사 풍산경기도 하남시 풍산동 318-22018-12-11주유소 운영업8500002023-07-03
3132에스씨에너지(주) 하남풀페이주유소경기도 하남시 초이동 620-22019-04-10<NA>2500002023-07-03
3233주식회사 유니드에너지경기도 하남시 덕풍동 8342021-03-15주유소 운영업6000002023-07-03
3334한국산업은행경기도 하남시 망월동 836-1 한국산업은행 IT센터2018-01-31<NA>1180002023-07-03
3435광암에너지경기도 하남시 광암동 90-22012-10-17<NA>3200002023-07-03
3536흥국산업(주)경기도 하남시 초이동 535-1 흥국산업 주식회사2016-04-04레미콘 제조업500002023-07-03