Overview

Dataset statistics

Number of variables6
Number of observations37
Missing cells4
Missing cells (%)1.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory54.6 B

Variable types

Numeric3
Text3

Dataset

Description인천광역시 미추홀구 특정토양오염 관리대상 현황에 대한 데이터로 연번, 상호명, 도로명주소, 좌표값 등의 항목을 제공합니다.
Author인천광역시 미추홀구
URLhttps://www.data.go.kr/data/3081243/fileData.do

Alerts

사업자등록번호 has 4 (10.8%) missing valuesMissing
연번 has unique valuesUnique
상호명 has unique valuesUnique
도로명주소 has unique valuesUnique
위도 has unique valuesUnique
경도 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:23:25.883607
Analysis finished2023-12-12 10:23:27.637161
Duration1.75 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19
Minimum1
Maximum37
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-12T19:23:27.720867image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.8
Q110
median19
Q328
95-th percentile35.2
Maximum37
Range36
Interquartile range (IQR)18

Descriptive statistics

Standard deviation10.824355
Coefficient of variation (CV)0.56970291
Kurtosis-1.2
Mean19
Median Absolute Deviation (MAD)9
Skewness0
Sum703
Variance117.16667
MonotonicityStrictly increasing
2023-12-12T19:23:27.871163image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
1 1
 
2.7%
29 1
 
2.7%
22 1
 
2.7%
23 1
 
2.7%
24 1
 
2.7%
25 1
 
2.7%
26 1
 
2.7%
27 1
 
2.7%
28 1
 
2.7%
30 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
1 1
2.7%
2 1
2.7%
3 1
2.7%
4 1
2.7%
5 1
2.7%
6 1
2.7%
7 1
2.7%
8 1
2.7%
9 1
2.7%
10 1
2.7%
ValueCountFrequency (%)
37 1
2.7%
36 1
2.7%
35 1
2.7%
34 1
2.7%
33 1
2.7%
32 1
2.7%
31 1
2.7%
30 1
2.7%
29 1
2.7%
28 1
2.7%

상호명
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-12T19:23:28.085948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length21
Median length14
Mean length8.6486486
Min length3

Characters and Unicode

Total characters320
Distinct characters117
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st rowKH에너지㈜직영 청도1주유소
2nd row박문주유소
3rd rowSK에너지㈜남강주유소
4th row석암주유소
5th row동일석유(주)영남주유소
ValueCountFrequency (%)
kh에너지㈜직영 1
 
2.4%
주식회사 1
 
2.4%
명보주유소 1
 
2.4%
학익셀프주유소 1
 
2.4%
탑주유소 1
 
2.4%
인하주유소 1
 
2.4%
이건산업㈜ 1
 
2.4%
sk행복가득주유소 1
 
2.4%
㈜한진인천터미널주유소 1
 
2.4%
sk큰나무셀프주유소 1
 
2.4%
Other values (31) 31
75.6%
2023-12-12T19:23:28.475038image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
 
10.3%
32
 
10.0%
29
 
9.1%
15
 
4.7%
9
 
2.8%
7
 
2.2%
K 5
 
1.6%
5
 
1.6%
5
 
1.6%
5
 
1.6%
Other values (107) 175
54.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 288
90.0%
Other Symbol 15
 
4.7%
Uppercase Letter 10
 
3.1%
Space Separator 4
 
1.2%
Close Punctuation 1
 
0.3%
Open Punctuation 1
 
0.3%
Decimal Number 1
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
33
 
11.5%
32
 
11.1%
29
 
10.1%
9
 
3.1%
7
 
2.4%
5
 
1.7%
5
 
1.7%
5
 
1.7%
5
 
1.7%
4
 
1.4%
Other values (99) 154
53.5%
Uppercase Letter
ValueCountFrequency (%)
K 5
50.0%
S 4
40.0%
H 1
 
10.0%
Other Symbol
ValueCountFrequency (%)
15
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Decimal Number
ValueCountFrequency (%)
1 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 303
94.7%
Latin 10
 
3.1%
Common 7
 
2.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
 
10.9%
32
 
10.6%
29
 
9.6%
15
 
5.0%
9
 
3.0%
7
 
2.3%
5
 
1.7%
5
 
1.7%
5
 
1.7%
5
 
1.7%
Other values (100) 158
52.1%
Common
ValueCountFrequency (%)
4
57.1%
) 1
 
14.3%
( 1
 
14.3%
1 1
 
14.3%
Latin
ValueCountFrequency (%)
K 5
50.0%
S 4
40.0%
H 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 288
90.0%
ASCII 17
 
5.3%
None 15
 
4.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
33
 
11.5%
32
 
11.1%
29
 
10.1%
9
 
3.1%
7
 
2.4%
5
 
1.7%
5
 
1.7%
5
 
1.7%
5
 
1.7%
4
 
1.4%
Other values (99) 154
53.5%
None
ValueCountFrequency (%)
15
100.0%
ASCII
ValueCountFrequency (%)
K 5
29.4%
S 4
23.5%
4
23.5%
) 1
 
5.9%
( 1
 
5.9%
1 1
 
5.9%
H 1
 
5.9%

도로명주소
Text

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size428.0 B
2023-12-12T19:23:28.699039image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length22
Mean length18.810811
Min length17

Characters and Unicode

Total characters696
Distinct characters58
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)100.0%

Sample

1st row인천광역시 미추홀구 경원대로 813-9
2nd row인천광역시 미추홀구 석정로 146
3rd row인천광역시 미추홀구 인주대로 43
4th row인천광역시 미추홀구 구월로 35
5th row인천광역시 미추홀구 주안로 22
ValueCountFrequency (%)
인천광역시 37
25.3%
미추홀구 37
25.3%
인주대로 5
 
3.4%
석정로 3
 
2.1%
소성로 3
 
2.1%
인하로 3
 
2.1%
217 2
 
1.4%
경인로 2
 
1.4%
방축로 2
 
1.4%
매소홀로 2
 
1.4%
Other values (46) 50
34.2%
2023-12-12T19:23:29.241351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
109
15.7%
48
 
6.9%
40
 
5.7%
38
 
5.5%
37
 
5.3%
37
 
5.3%
37
 
5.3%
37
 
5.3%
37
 
5.3%
37
 
5.3%
Other values (48) 239
34.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 468
67.2%
Decimal Number 115
 
16.5%
Space Separator 109
 
15.7%
Dash Punctuation 4
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
10.3%
40
8.5%
38
8.1%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
Other values (36) 83
17.7%
Decimal Number
ValueCountFrequency (%)
2 20
17.4%
4 18
15.7%
3 18
15.7%
1 16
13.9%
9 10
8.7%
8 9
7.8%
5 9
7.8%
6 9
7.8%
7 4
 
3.5%
0 2
 
1.7%
Space Separator
ValueCountFrequency (%)
109
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 468
67.2%
Common 228
32.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
10.3%
40
8.5%
38
8.1%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
Other values (36) 83
17.7%
Common
ValueCountFrequency (%)
109
47.8%
2 20
 
8.8%
4 18
 
7.9%
3 18
 
7.9%
1 16
 
7.0%
9 10
 
4.4%
8 9
 
3.9%
5 9
 
3.9%
6 9
 
3.9%
- 4
 
1.8%
Other values (2) 6
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 468
67.2%
ASCII 228
32.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
109
47.8%
2 20
 
8.8%
4 18
 
7.9%
3 18
 
7.9%
1 16
 
7.0%
9 10
 
4.4%
8 9
 
3.9%
5 9
 
3.9%
6 9
 
3.9%
- 4
 
1.8%
Other values (2) 6
 
2.6%
Hangul
ValueCountFrequency (%)
48
10.3%
40
8.5%
38
8.1%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
37
7.9%
Other values (36) 83
17.7%

위도
Real number (ℝ)

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.456614
Minimum37.438233
Maximum37.481295
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-12T19:23:29.413829image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.438233
5-th percentile37.438533
Q137.447081
median37.455594
Q337.465581
95-th percentile37.476684
Maximum37.481295
Range0.04306213
Interquartile range (IQR)0.01849949

Descriptive statistics

Standard deviation0.012766482
Coefficient of variation (CV)0.00034083385
Kurtosis-0.98969609
Mean37.456614
Median Absolute Deviation (MAD)0.00998715
Skewness0.25536469
Sum1385.8947
Variance0.00016298305
MonotonicityNot monotonic
2023-12-12T19:23:29.587191image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
37.45517142 1
 
2.7%
37.48129491 1
 
2.7%
37.4421968 1
 
2.7%
37.47561833 1
 
2.7%
37.44940027 1
 
2.7%
37.47990252 1
 
2.7%
37.44338699 1
 
2.7%
37.44250623 1
 
2.7%
37.43839941 1
 
2.7%
37.46558074 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
37.43823278 1
2.7%
37.43839941 1
2.7%
37.43856676 1
2.7%
37.43959923 1
2.7%
37.43975265 1
2.7%
37.4421968 1
2.7%
37.44250623 1
2.7%
37.44338699 1
2.7%
37.444423 1
2.7%
37.44708125 1
2.7%
ValueCountFrequency (%)
37.48129491 1
2.7%
37.47990252 1
2.7%
37.47587918 1
2.7%
37.47561833 1
2.7%
37.47503899 1
2.7%
37.47362682 1
2.7%
37.4681873 1
2.7%
37.46686457 1
2.7%
37.46644643 1
2.7%
37.46558074 1
2.7%

경도
Real number (ℝ)

UNIQUE 

Distinct37
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.66533
Minimum126.63056
Maximum126.70152
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size465.0 B
2023-12-12T19:23:29.777395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum126.63056
5-th percentile126.63953
Q1126.65235
median126.66544
Q3126.67887
95-th percentile126.69162
Maximum126.70152
Range0.0709561
Interquartile range (IQR)0.0265194

Descriptive statistics

Standard deviation0.017376784
Coefficient of variation (CV)0.00013718658
Kurtosis-0.63265815
Mean126.66533
Median Absolute Deviation (MAD)0.0134285
Skewness0.04292404
Sum4686.6172
Variance0.00030195261
MonotonicityNot monotonic
2023-12-12T19:23:29.970786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
126.6890938 1
 
2.7%
126.6575332 1
 
2.7%
126.6654415 1
 
2.7%
126.6569798 1
 
2.7%
126.6625188 1
 
2.7%
126.6617945 1
 
2.7%
126.650359 1
 
2.7%
126.7015187 1
 
2.7%
126.6814244 1
 
2.7%
126.6910039 1
 
2.7%
Other values (27) 27
73.0%
ValueCountFrequency (%)
126.6305626 1
2.7%
126.63437 1
2.7%
126.6408204 1
2.7%
126.6428675 1
2.7%
126.6445319 1
2.7%
126.6458259 1
2.7%
126.650359 1
2.7%
126.6509498 1
2.7%
126.651384 1
2.7%
126.6523506 1
2.7%
ValueCountFrequency (%)
126.7015187 1
2.7%
126.6940601 1
2.7%
126.6910039 1
2.7%
126.6890938 1
2.7%
126.685997 1
2.7%
126.6857896 1
2.7%
126.6831402 1
2.7%
126.6814244 1
2.7%
126.6802912 1
2.7%
126.67887 1
2.7%

사업자등록번호
Text

MISSING 

Distinct31
Distinct (%)93.9%
Missing4
Missing (%)10.8%
Memory size428.0 B
2023-12-12T19:23:30.214661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters396
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique30 ?
Unique (%)90.9%

Sample

1st row476-85-00878
2nd row121-10-61981
3rd row101-86-60120
4th row112-72-82146
5th row121-13-22198
ValueCountFrequency (%)
124-81-00718 3
 
9.1%
121-21-72145 1
 
3.0%
476-85-00878 1
 
3.0%
765-85-01855 1
 
3.0%
121-14-06275 1
 
3.0%
121-85-22889 1
 
3.0%
808-85-01583 1
 
3.0%
121-85-36291 1
 
3.0%
171-03-01714 1
 
3.0%
121-25-87549 1
 
3.0%
Other values (21) 21
63.6%
2023-12-12T19:23:30.672074image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 92
23.2%
- 66
16.7%
8 39
9.8%
0 35
 
8.8%
2 31
 
7.8%
5 28
 
7.1%
7 25
 
6.3%
3 25
 
6.3%
4 23
 
5.8%
6 18
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 330
83.3%
Dash Punctuation 66
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 92
27.9%
8 39
11.8%
0 35
 
10.6%
2 31
 
9.4%
5 28
 
8.5%
7 25
 
7.6%
3 25
 
7.6%
4 23
 
7.0%
6 18
 
5.5%
9 14
 
4.2%
Dash Punctuation
ValueCountFrequency (%)
- 66
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 396
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 92
23.2%
- 66
16.7%
8 39
9.8%
0 35
 
8.8%
2 31
 
7.8%
5 28
 
7.1%
7 25
 
6.3%
3 25
 
6.3%
4 23
 
5.8%
6 18
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 396
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 92
23.2%
- 66
16.7%
8 39
9.8%
0 35
 
8.8%
2 31
 
7.8%
5 28
 
7.1%
7 25
 
6.3%
3 25
 
6.3%
4 23
 
5.8%
6 18
 
4.5%

Interactions

2023-12-12T19:23:27.089100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:23:26.169570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:23:26.817343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:23:27.206699image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:23:26.297308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:23:26.906977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:23:27.319155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:23:26.699224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T19:23:26.993003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T19:23:30.817114image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호명도로명주소위도경도사업자등록번호
연번1.0001.0001.0000.0000.2500.000
상호명1.0001.0001.0001.0001.0001.000
도로명주소1.0001.0001.0001.0001.0001.000
위도0.0001.0001.0001.0000.3170.853
경도0.2501.0001.0000.3171.0000.972
사업자등록번호0.0001.0001.0000.8530.9721.000
2023-12-12T19:23:30.929104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번위도경도
연번1.000-0.138-0.148
위도-0.1381.000-0.054
경도-0.148-0.0541.000

Missing values

2023-12-12T19:23:27.471687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:23:27.595827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호명도로명주소위도경도사업자등록번호
01KH에너지㈜직영 청도1주유소인천광역시 미추홀구 경원대로 813-937.455171126.689094476-85-00878
12박문주유소인천광역시 미추홀구 석정로 14637.468187126.651384121-10-61981
23SK에너지㈜남강주유소인천광역시 미추홀구 인주대로 4337.45971126.642867101-86-60120
34석암주유소인천광역시 미추홀구 구월로 3537.457757126.69406112-72-82146
45동일석유(주)영남주유소인천광역시 미추홀구 주안로 2237.463753126.672074<NA>
56공성주유소인천광역시 미추홀구 인주대로 5937.459048126.644532121-13-22198
67SK네트웍스㈜문학주유소인천광역시 미추홀구 매소홀로 57437.438233126.68579124-81-00718
78한국특수잉크공업㈜인천광역시 미추홀구 방축로 29437.475039126.675615<NA>
89제물포하이웨이주유소인천광역시 미추홀구 경인로 19437.465086126.664135<NA>
910에이치디현대오일뱅크㈜직영미추홀셀프주유소인천광역시 미추홀구 인주대로 39437.451043126.680291124-81-00718
연번상호명도로명주소위도경도사업자등록번호
2728SK큰나무셀프주유소인천광역시 미추홀구 매소홀로53637.438399126.681424171-03-01714
2829케이제이에너지㈜ 경인터미널인천광역시 미추홀구 봉수대로 5437.481295126.657533765-85-01855
2930최고주유소인천광역시 미추홀구 석정로 49837.465581126.691004714-01-01537
3031정해주유소개발㈜인천주유소인천광역시 미추홀구 인주대로 28637.451754126.668018121-25-87549
3132고속주유소인천광역시 미추홀구 아암대로 12837.450948126.630563121-81-81949
3233㈜강원인천광역시 미추홀구 매소홀로 24837.444423126.65095131-85-04457
3334㈜케이피일렉트릭인천광역시 미추홀구 방축로 32837.473627126.67887<NA>
3435㈜강우주신자동차운전전문학원인천광역시 미추홀구 노적산로 66-3637.439753126.655259121-81-75938
3536부흥유류상사인천광역시 미추홀구 인중로16번길19-2437.46395126.64082121-21-72145
3637동양석유인천광역시 미추홀구 인하로 166-237.448775126.666424658-20-00670