Overview

Dataset statistics

Number of variables6
Number of observations94
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.6 KiB
Average record size in memory50.4 B

Variable types

Numeric1
Text2
DateTime2
Categorical1

Dataset

Description경상남도 사천시의 특정토양 오염관리 대상에 관한 현황 정보(상호, 소재지, 완공일자 등)를 공공데이터로 제공합니다.
Author경상남도 사천시
URLhttps://www.data.go.kr/data/15005200/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 연번High correlation
비고 is highly imbalanced (65.8%)Imbalance
연번 has unique valuesUnique
상호 has unique valuesUnique
소재지(지번) has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:00:44.789806
Analysis finished2023-12-12 19:00:45.623383
Duration0.83 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct94
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean47.5
Minimum1
Maximum94
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size978.0 B
2023-12-13T04:00:45.763777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.65
Q124.25
median47.5
Q370.75
95-th percentile89.35
Maximum94
Range93
Interquartile range (IQR)46.5

Descriptive statistics

Standard deviation27.279418
Coefficient of variation (CV)0.57430354
Kurtosis-1.2
Mean47.5
Median Absolute Deviation (MAD)23.5
Skewness0
Sum4465
Variance744.16667
MonotonicityStrictly increasing
2023-12-13T04:00:46.040292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
61 1
 
1.1%
70 1
 
1.1%
69 1
 
1.1%
68 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
Other values (84) 84
89.4%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
94 1
1.1%
93 1
1.1%
92 1
1.1%
91 1
1.1%
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%

상호
Text

UNIQUE 

Distinct94
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size884.0 B
2023-12-13T04:00:46.483657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length15.5
Mean length8.212766
Min length4

Characters and Unicode

Total characters772
Distinct characters167
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)100.0%

Sample

1st row용장군주유소
2nd row송포새한주유소
3rd row동백제3주유소
4th row믿음주유소
5th row삼일주유소
ValueCountFrequency (%)
주식회사 3
 
2.8%
주)에어로코텍 2
 
1.9%
합동주유소 1
 
0.9%
브리티쉬아메리칸토바코코리아(주 1
 
0.9%
현대모비스(주)진주부품사업소 1
 
0.9%
육군제3718-1부대 1
 
0.9%
주)월성주유소 1
 
0.9%
대동기어(주 1
 
0.9%
유니슨(주 1
 
0.9%
한국앰코스페셜티카톤즈(유 1
 
0.9%
Other values (93) 93
87.7%
2023-12-13T04:00:47.194370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
98
 
12.7%
72
 
9.3%
65
 
8.4%
) 34
 
4.4%
( 34
 
4.4%
21
 
2.7%
18
 
2.3%
12
 
1.6%
10
 
1.3%
10
 
1.3%
Other values (157) 398
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 666
86.3%
Close Punctuation 34
 
4.4%
Open Punctuation 34
 
4.4%
Decimal Number 17
 
2.2%
Space Separator 12
 
1.6%
Uppercase Letter 8
 
1.0%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
98
 
14.7%
72
 
10.8%
65
 
9.8%
21
 
3.2%
18
 
2.7%
10
 
1.5%
10
 
1.5%
10
 
1.5%
9
 
1.4%
9
 
1.4%
Other values (143) 344
51.7%
Decimal Number
ValueCountFrequency (%)
1 5
29.4%
3 4
23.5%
2 3
17.6%
8 2
 
11.8%
9 2
 
11.8%
7 1
 
5.9%
Uppercase Letter
ValueCountFrequency (%)
Y 2
25.0%
K 2
25.0%
M 2
25.0%
S 2
25.0%
Close Punctuation
ValueCountFrequency (%)
) 34
100.0%
Open Punctuation
ValueCountFrequency (%)
( 34
100.0%
Space Separator
ValueCountFrequency (%)
12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 666
86.3%
Common 98
 
12.7%
Latin 8
 
1.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
98
 
14.7%
72
 
10.8%
65
 
9.8%
21
 
3.2%
18
 
2.7%
10
 
1.5%
10
 
1.5%
10
 
1.5%
9
 
1.4%
9
 
1.4%
Other values (143) 344
51.7%
Common
ValueCountFrequency (%)
) 34
34.7%
( 34
34.7%
12
 
12.2%
1 5
 
5.1%
3 4
 
4.1%
2 3
 
3.1%
8 2
 
2.0%
9 2
 
2.0%
7 1
 
1.0%
- 1
 
1.0%
Latin
ValueCountFrequency (%)
Y 2
25.0%
K 2
25.0%
M 2
25.0%
S 2
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 666
86.3%
ASCII 106
 
13.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
98
 
14.7%
72
 
10.8%
65
 
9.8%
21
 
3.2%
18
 
2.7%
10
 
1.5%
10
 
1.5%
10
 
1.5%
9
 
1.4%
9
 
1.4%
Other values (143) 344
51.7%
ASCII
ValueCountFrequency (%)
) 34
32.1%
( 34
32.1%
12
 
11.3%
1 5
 
4.7%
3 4
 
3.8%
2 3
 
2.8%
8 2
 
1.9%
9 2
 
1.9%
Y 2
 
1.9%
K 2
 
1.9%
Other values (4) 6
 
5.7%

소재지(지번)
Text

UNIQUE 

Distinct94
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size884.0 B
2023-12-13T04:00:47.853307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length29
Mean length21.946809
Min length17

Characters and Unicode

Total characters2063
Distinct characters93
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique94 ?
Unique (%)100.0%

Sample

1st row경상남도 사천시 용현면 금문리 49-10
2nd row경상남도 사천시 송포동 174-3
3rd row경상남도 사천시 대방동 162-1
4th row경상남도 사천시 벌리동 25-20
5th row경상남도 사천시 좌룡동 408-2
ValueCountFrequency (%)
경상남도 94
20.8%
사천시 94
20.8%
사남면 18
 
4.0%
사천읍 13
 
2.9%
축동면 9
 
2.0%
송포동 7
 
1.5%
곤양면 6
 
1.3%
유천리 6
 
1.3%
곤명면 6
 
1.3%
용현면 5
 
1.1%
Other values (152) 195
43.0%
2023-12-13T04:00:49.049447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
453
22.0%
130
 
6.3%
117
 
5.7%
116
 
5.6%
95
 
4.6%
94
 
4.6%
94
 
4.6%
94
 
4.6%
1 78
 
3.8%
- 76
 
3.7%
Other values (83) 716
34.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1177
57.1%
Space Separator 453
 
22.0%
Decimal Number 357
 
17.3%
Dash Punctuation 76
 
3.7%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
130
11.0%
117
9.9%
116
9.9%
95
 
8.1%
94
 
8.0%
94
 
8.0%
94
 
8.0%
65
 
5.5%
50
 
4.2%
46
 
3.9%
Other values (71) 276
23.4%
Decimal Number
ValueCountFrequency (%)
1 78
21.8%
3 45
12.6%
2 39
10.9%
4 38
10.6%
6 31
 
8.7%
5 30
 
8.4%
8 27
 
7.6%
9 25
 
7.0%
7 23
 
6.4%
0 21
 
5.9%
Space Separator
ValueCountFrequency (%)
453
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 76
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1177
57.1%
Common 886
42.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
130
11.0%
117
9.9%
116
9.9%
95
 
8.1%
94
 
8.0%
94
 
8.0%
94
 
8.0%
65
 
5.5%
50
 
4.2%
46
 
3.9%
Other values (71) 276
23.4%
Common
ValueCountFrequency (%)
453
51.1%
1 78
 
8.8%
- 76
 
8.6%
3 45
 
5.1%
2 39
 
4.4%
4 38
 
4.3%
6 31
 
3.5%
5 30
 
3.4%
8 27
 
3.0%
9 25
 
2.8%
Other values (2) 44
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1177
57.1%
ASCII 886
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
453
51.1%
1 78
 
8.8%
- 76
 
8.6%
3 45
 
5.1%
2 39
 
4.4%
4 38
 
4.3%
6 31
 
3.5%
5 30
 
3.4%
8 27
 
3.0%
9 25
 
2.8%
Other values (2) 44
 
5.0%
Hangul
ValueCountFrequency (%)
130
11.0%
117
9.9%
116
9.9%
95
 
8.1%
94
 
8.0%
94
 
8.0%
94
 
8.0%
65
 
5.5%
50
 
4.2%
46
 
3.9%
Other values (71) 276
23.4%
Distinct93
Distinct (%)98.9%
Missing0
Missing (%)0.0%
Memory size884.0 B
Minimum1988-01-16 00:00:00
Maximum2022-06-27 00:00:00
2023-12-13T04:00:49.247107image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:00:49.448104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size884.0 B
<NA>
88 
휴업
 
6

Length

Max length4
Median length4
Mean length3.8723404
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row휴업
3rd row<NA>
4th row휴업
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 88
93.6%
휴업 6
 
6.4%

Length

2023-12-13T04:00:49.696946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:00:49.871702image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 88
93.6%
휴업 6
 
6.4%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size884.0 B
Minimum2023-10-30 00:00:00
Maximum2023-10-30 00:00:00
2023-12-13T04:00:50.004744image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:00:50.154631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-13T04:00:45.220783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:00:50.281112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호소재지(지번)완공일자
연번1.0001.0001.0000.932
상호1.0001.0001.0001.000
소재지(지번)1.0001.0001.0001.000
완공일자0.9321.0001.0001.000
2023-12-13T04:00:50.426652image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번비고
연번1.0001.000
비고1.0001.000

Missing values

2023-12-13T04:00:45.400961image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:00:45.543407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호소재지(지번)완공일자비고데이터기준일자
01용장군주유소경상남도 사천시 용현면 금문리 49-101992-10-26<NA>2023-10-30
12송포새한주유소경상남도 사천시 송포동 174-31995-05-26휴업2023-10-30
23동백제3주유소경상남도 사천시 대방동 162-11995-03-25<NA>2023-10-30
34믿음주유소경상남도 사천시 벌리동 25-201994-08-26휴업2023-10-30
45삼일주유소경상남도 사천시 좌룡동 408-21993-12-20<NA>2023-10-30
56남일주유소경상남도 사천시 향촌동 562-31994-06-16휴업2023-10-30
67노랑주유소경상남도 사천시 서포면 구랑리 811-10 노랑주유소1992-12-19<NA>2023-10-30
78용머리주유소경상남도 사천시 용현면 송지리 836-61994-10-15<NA>2023-10-30
89목화주유소경상남도 사천시 용현면 주문리 4-21991-04-16<NA>2023-10-30
910경서대로주유소경상남도 사천시 곤명면 신흥리 4-31993-08-17<NA>2023-10-30
연번상호소재지(지번)완공일자비고데이터기준일자
8485아주주유소경상남도 사천시 송포동 15-32004-02-10<NA>2023-10-30
8586SK포유주유소경상남도 사천시 선구동 314-22007-09-06<NA>2023-10-30
8687남척주유소경상남도 사천시 송포동 430-11990-05-26<NA>2023-10-30
8788사천농협주유소경상남도 사천시 사천읍 정의리 413-102006-12-04<NA>2023-10-30
8889삼천포수협늑도분급소경상남도 사천시 늑도동 29-12009-11-27<NA>2023-10-30
8990청양주유소경상남도 사천시 향촌동 996-81994-08-01<NA>2023-10-30
9091한국전기통신공사 사천전화국경상남도 사천시 사천읍 수석리 280-101990-07-27<NA>2023-10-30
9192사천신흥주유소경상남도 사천시 용현면 송지리 12622010-11-11<NA>2023-10-30
9293동금석유경상남도 사천시 향촌동 925-162005-06-07<NA>2023-10-30
9394남포주유소경상남도 사천시 이금동 99-12004-10-14<NA>2023-10-30