Overview

Dataset statistics

Number of variables5
Number of observations98
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.1 KiB
Average record size in memory42.3 B

Variable types

Numeric1
Text2
DateTime1
Categorical1

Dataset

Description경상남도 사천시의 특정토양 오염관리 대상에 관한 현황 정보(상호, 소재지, 완공일자 등)를 공공데이터로 제공합니다.
Author경상남도 사천시
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15005200

Alerts

연번 is highly overall correlated with 비고High correlation
비고 is highly overall correlated with 연번High correlation
비고 is highly imbalanced (70.9%)Imbalance
연번 has unique valuesUnique
상호 has unique valuesUnique
소재지(지번) has unique valuesUnique

Reproduction

Analysis started2023-12-11 00:57:27.979440
Analysis finished2023-12-11 00:57:28.688919
Duration0.71 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct98
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean49.5
Minimum1
Maximum98
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1014.0 B
2023-12-11T09:57:28.789394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.85
Q125.25
median49.5
Q373.75
95-th percentile93.15
Maximum98
Range97
Interquartile range (IQR)48.5

Descriptive statistics

Standard deviation28.434134
Coefficient of variation (CV)0.57442696
Kurtosis-1.2
Mean49.5
Median Absolute Deviation (MAD)24.5
Skewness0
Sum4851
Variance808.5
MonotonicityStrictly increasing
2023-12-11T09:57:28.960345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
75 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
67 1
 
1.0%
66 1
 
1.0%
Other values (88) 88
89.8%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%
90 1
1.0%
89 1
1.0%

상호
Text

UNIQUE 

Distinct98
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
2023-12-11T09:57:29.192838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length16
Mean length7.244898
Min length4

Characters and Unicode

Total characters710
Distinct characters168
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)100.0%

Sample

1st row청양주유소
2nd row용장군주유소
3rd row송포새한주유소
4th row동백제3주유소
5th row현대주유소
ValueCountFrequency (%)
청양주유소 1
 
0.9%
용장군주유소 1
 
0.9%
sk포유주유소 1
 
0.9%
주)하나주유소 1
 
0.9%
용현농협주유소 1
 
0.9%
서포고속주유소 1
 
0.9%
윤창주유소 1
 
0.9%
사천농협주유소 1
 
0.9%
사남농협주유소 1
 
0.9%
합동주유소 1
 
0.9%
Other values (96) 96
90.6%
2023-12-11T09:57:29.563052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
86
 
12.1%
76
 
10.7%
71
 
10.0%
( 21
 
3.0%
) 21
 
3.0%
17
 
2.4%
15
 
2.1%
11
 
1.5%
11
 
1.5%
10
 
1.4%
Other values (158) 371
52.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 622
87.6%
Open Punctuation 21
 
3.0%
Close Punctuation 21
 
3.0%
Decimal Number 19
 
2.7%
Uppercase Letter 10
 
1.4%
Other Symbol 9
 
1.3%
Space Separator 8
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
86
 
13.8%
76
 
12.2%
71
 
11.4%
17
 
2.7%
15
 
2.4%
11
 
1.8%
11
 
1.8%
10
 
1.6%
9
 
1.4%
8
 
1.3%
Other values (143) 308
49.5%
Decimal Number
ValueCountFrequency (%)
1 6
31.6%
3 4
21.1%
2 3
15.8%
8 3
15.8%
9 2
 
10.5%
6 1
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
K 3
30.0%
M 2
20.0%
Y 2
20.0%
S 2
20.0%
B 1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Other Symbol
ValueCountFrequency (%)
9
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 631
88.9%
Common 69
 
9.7%
Latin 10
 
1.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
86
 
13.6%
76
 
12.0%
71
 
11.3%
17
 
2.7%
15
 
2.4%
11
 
1.7%
11
 
1.7%
10
 
1.6%
9
 
1.4%
9
 
1.4%
Other values (144) 316
50.1%
Common
ValueCountFrequency (%)
( 21
30.4%
) 21
30.4%
8
 
11.6%
1 6
 
8.7%
3 4
 
5.8%
2 3
 
4.3%
8 3
 
4.3%
9 2
 
2.9%
6 1
 
1.4%
Latin
ValueCountFrequency (%)
K 3
30.0%
M 2
20.0%
Y 2
20.0%
S 2
20.0%
B 1
 
10.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 622
87.6%
ASCII 79
 
11.1%
None 9
 
1.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
86
 
13.8%
76
 
12.2%
71
 
11.4%
17
 
2.7%
15
 
2.4%
11
 
1.8%
11
 
1.8%
10
 
1.6%
9
 
1.4%
8
 
1.3%
Other values (143) 308
49.5%
ASCII
ValueCountFrequency (%)
( 21
26.6%
) 21
26.6%
8
 
10.1%
1 6
 
7.6%
3 4
 
5.1%
2 3
 
3.8%
K 3
 
3.8%
8 3
 
3.8%
M 2
 
2.5%
Y 2
 
2.5%
Other values (4) 6
 
7.6%
None
ValueCountFrequency (%)
9
100.0%

소재지(지번)
Text

UNIQUE 

Distinct98
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
2023-12-11T09:57:29.878711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length25
Mean length22.132653
Min length14

Characters and Unicode

Total characters2169
Distinct characters92
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique98 ?
Unique (%)100.0%

Sample

1st row경상남도 사천시 향촌동 996-8번지
2nd row경상남도 사천시 용현면 금문리 49-10번지
3rd row경상남도 사천시 송포동 174-3
4th row경상남도 사천시 대방동 162-1번지
5th row경상남도 사천시 곤명면 송림리 33-1번지
ValueCountFrequency (%)
사천시 97
21.2%
경상남도 92
20.1%
사남면 16
 
3.5%
사천읍 14
 
3.1%
축동면 9
 
2.0%
곤명면 8
 
1.8%
곤양면 7
 
1.5%
송포동 7
 
1.5%
용현면 5
 
1.1%
사주리 5
 
1.1%
Other values (154) 197
43.1%
2023-12-11T09:57:30.399977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
362
16.7%
133
 
6.1%
119
 
5.5%
112
 
5.2%
102
 
4.7%
97
 
4.5%
93
 
4.3%
93
 
4.3%
92
 
4.2%
92
 
4.2%
Other values (82) 874
40.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1347
62.1%
Decimal Number 375
 
17.3%
Space Separator 362
 
16.7%
Dash Punctuation 83
 
3.8%
Other Punctuation 1
 
< 0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
133
 
9.9%
119
 
8.8%
112
 
8.3%
102
 
7.6%
97
 
7.2%
93
 
6.9%
93
 
6.9%
92
 
6.8%
92
 
6.8%
65
 
4.8%
Other values (68) 349
25.9%
Decimal Number
ValueCountFrequency (%)
1 85
22.7%
2 47
12.5%
3 45
12.0%
4 41
10.9%
6 34
 
9.1%
8 31
 
8.3%
5 28
 
7.5%
9 23
 
6.1%
7 23
 
6.1%
0 18
 
4.8%
Space Separator
ValueCountFrequency (%)
362
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 83
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1347
62.1%
Common 821
37.9%
Latin 1
 
< 0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
133
 
9.9%
119
 
8.8%
112
 
8.3%
102
 
7.6%
97
 
7.2%
93
 
6.9%
93
 
6.9%
92
 
6.8%
92
 
6.8%
65
 
4.8%
Other values (68) 349
25.9%
Common
ValueCountFrequency (%)
362
44.1%
1 85
 
10.4%
- 83
 
10.1%
2 47
 
5.7%
3 45
 
5.5%
4 41
 
5.0%
6 34
 
4.1%
8 31
 
3.8%
5 28
 
3.4%
9 23
 
2.8%
Other values (3) 42
 
5.1%
Latin
ValueCountFrequency (%)
B 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1347
62.1%
ASCII 822
37.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
362
44.0%
1 85
 
10.3%
- 83
 
10.1%
2 47
 
5.7%
3 45
 
5.5%
4 41
 
5.0%
6 34
 
4.1%
8 31
 
3.8%
5 28
 
3.4%
9 23
 
2.8%
Other values (4) 43
 
5.2%
Hangul
ValueCountFrequency (%)
133
 
9.9%
119
 
8.8%
112
 
8.3%
102
 
7.6%
97
 
7.2%
93
 
6.9%
93
 
6.9%
92
 
6.8%
92
 
6.8%
65
 
4.8%
Other values (68) 349
25.9%
Distinct97
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
Minimum1988-01-16 00:00:00
Maximum2018-12-26 00:00:00
2023-12-11T09:57:30.573025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:57:30.721697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

비고
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size916.0 B
<NA>
93 
휴업
 
5

Length

Max length4
Median length4
Mean length3.8979592
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row휴업

Common Values

ValueCountFrequency (%)
<NA> 93
94.9%
휴업 5
 
5.1%

Length

2023-12-11T09:57:30.873764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:57:31.014080image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 93
94.9%
휴업 5
 
5.1%

Interactions

2023-12-11T09:57:28.356554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:57:31.087298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번상호소재지(지번)완공일자
연번1.0001.0001.0001.000
상호1.0001.0001.0001.000
소재지(지번)1.0001.0001.0001.000
완공일자1.0001.0001.0001.000
2023-12-11T09:57:31.189264image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번비고
연번1.0001.000
비고1.0001.000

Missing values

2023-12-11T09:57:28.477123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:57:28.623076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호소재지(지번)완공일자비고
01청양주유소경상남도 사천시 향촌동 996-8번지1994-08-01<NA>
12용장군주유소경상남도 사천시 용현면 금문리 49-10번지1992-10-26<NA>
23송포새한주유소경상남도 사천시 송포동 174-31995-05-26<NA>
34동백제3주유소경상남도 사천시 대방동 162-1번지1995-03-25<NA>
45현대주유소경상남도 사천시 곤명면 송림리 33-1번지1994-11-18휴업
56믿음주유소경상남도 사천시 벌리동 25-20번지1994-08-26휴업
67남척석유(주)경상남도 사천시 서동 178-8번지1996-10-16<NA>
78삼일주유소경상남도 사천시 좌룡동 408-2번지1993-12-20<NA>
89남척주유소경상남도 사천시 송포동 430-1번지1990-05-26<NA>
910남일주유소경상남도 사천시 향촌동 562-3번지1994-06-16<NA>
연번상호소재지(지번)완공일자비고
8889월성주유소경상남도 사천시 사남면 월성리 15-1번지2011-01-07<NA>
8990㈜KB손해보험 인재니움 사천경상남도 사천시 곤양면대진리 산 78-1번지2011-05-06<NA>
9091육군제8611부대경상남도 사천시 곤양면 서정리 1149번지2015-11-26<NA>
9192현대모비스㈜진주부품사업소경상남도 축동면 인절미고갯길 2-212017-03-06<NA>
9293브리티쉬아메리칸토바코코리아㈜경상남도 사천시 사남면 유천리 889번지2017-02-10<NA>
9394(주)코텍 사천공장사천시 사남면 방지리 669-5번지외2018-07-11<NA>
9495지에이산업㈜사천시 사남면 방지리 736번지2018-08-16<NA>
9596에스앤케이항공㈜사천시 사남면 유천리 901, 904번지2018-08-17<NA>
9697SK네트웍스(주)사천역사주유소사천시 사천읍 사천대로 18462018-11-29<NA>
9798한국항공서비스㈜사천시 사천읍 항공로 642018-12-26<NA>