Overview

Dataset statistics

Number of variables4
Number of observations119
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.0 KiB
Average record size in memory34.1 B

Variable types

Numeric1
Text2
Categorical1

Dataset

Description경상남도 사천시 소음진동배출시설 현황 자료입니다.(연번, 사업장명, 소재지 지번주소), 2023년 6월 1일 기준 120개 시설
Author경상남도 사천시
URLhttps://www.data.go.kr/data/15114256/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
연번 has unique valuesUnique
사업장명 has unique valuesUnique

Reproduction

Analysis started2023-12-13 00:59:49.459591
Analysis finished2023-12-13 00:59:49.831515
Duration0.37 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct119
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean60
Minimum1
Maximum119
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.2 KiB
2023-12-13T09:59:50.088905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile6.9
Q130.5
median60
Q389.5
95-th percentile113.1
Maximum119
Range118
Interquartile range (IQR)59

Descriptive statistics

Standard deviation34.496377
Coefficient of variation (CV)0.57493961
Kurtosis-1.2
Mean60
Median Absolute Deviation (MAD)30
Skewness0
Sum7140
Variance1190
MonotonicityStrictly increasing
2023-12-13T09:59:50.194058image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
2 1
 
0.8%
89 1
 
0.8%
88 1
 
0.8%
87 1
 
0.8%
86 1
 
0.8%
85 1
 
0.8%
84 1
 
0.8%
83 1
 
0.8%
82 1
 
0.8%
Other values (109) 109
91.6%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
119 1
0.8%
118 1
0.8%
117 1
0.8%
116 1
0.8%
115 1
0.8%
114 1
0.8%
113 1
0.8%
112 1
0.8%
111 1
0.8%
110 1
0.8%

사업장명
Text

UNIQUE 

Distinct119
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T09:59:50.405944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length13
Mean length6.8823529
Min length2

Characters and Unicode

Total characters819
Distinct characters163
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)100.0%

Sample

1st row(주)신성산업
2nd row유아이 주식회사
3rd row에이원제경
4th row다자연영농조합법인
5th row삼호기업
ValueCountFrequency (%)
주식회사 9
 
6.6%
주)부성 2
 
1.5%
제2공장 2
 
1.5%
두원중공업(주 2
 
1.5%
주)신성산업 1
 
0.7%
금양수산(주 1
 
0.7%
영진물산 1
 
0.7%
규장각 1
 
0.7%
주)에이치엔에프 1
 
0.7%
주)에이치에스씨푸드 1
 
0.7%
Other values (115) 115
84.6%
2023-12-13T09:59:50.708280image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
69
 
8.4%
( 60
 
7.3%
) 60
 
7.3%
35
 
4.3%
30
 
3.7%
20
 
2.4%
18
 
2.2%
17
 
2.1%
17
 
2.1%
16
 
2.0%
Other values (153) 477
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 674
82.3%
Open Punctuation 60
 
7.3%
Close Punctuation 60
 
7.3%
Space Separator 17
 
2.1%
Decimal Number 6
 
0.7%
Other Punctuation 2
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
69
 
10.2%
35
 
5.2%
30
 
4.5%
20
 
3.0%
18
 
2.7%
17
 
2.5%
16
 
2.4%
15
 
2.2%
13
 
1.9%
12
 
1.8%
Other values (146) 429
63.6%
Decimal Number
ValueCountFrequency (%)
2 3
50.0%
1 2
33.3%
3 1
 
16.7%
Open Punctuation
ValueCountFrequency (%)
( 60
100.0%
Close Punctuation
ValueCountFrequency (%)
) 60
100.0%
Space Separator
ValueCountFrequency (%)
17
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 674
82.3%
Common 145
 
17.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
69
 
10.2%
35
 
5.2%
30
 
4.5%
20
 
3.0%
18
 
2.7%
17
 
2.5%
16
 
2.4%
15
 
2.2%
13
 
1.9%
12
 
1.8%
Other values (146) 429
63.6%
Common
ValueCountFrequency (%)
( 60
41.4%
) 60
41.4%
17
 
11.7%
2 3
 
2.1%
1 2
 
1.4%
. 2
 
1.4%
3 1
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 674
82.3%
ASCII 145
 
17.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
69
 
10.2%
35
 
5.2%
30
 
4.5%
20
 
3.0%
18
 
2.7%
17
 
2.5%
16
 
2.4%
15
 
2.2%
13
 
1.9%
12
 
1.8%
Other values (146) 429
63.6%
ASCII
ValueCountFrequency (%)
( 60
41.4%
) 60
41.4%
17
 
11.7%
2 3
 
2.1%
1 2
 
1.4%
. 2
 
1.4%
3 1
 
0.7%
Distinct116
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-12-13T09:59:50.960620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length29
Mean length22.831933
Min length16

Characters and Unicode

Total characters2717
Distinct characters109
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique113 ?
Unique (%)95.0%

Sample

1st row경상남도 사천시 축동면 구호리 8-6
2nd row경상남도 사천시 용현면 신복리 678-7
3rd row경상남도 사천시 사천읍 두량리 1278-18
4th row경상남도 사천시 곤명면 금성리 1096 다자연영농조합법인
5th row경상남도 사천시 사천읍 장전리 317
ValueCountFrequency (%)
경상남도 119
19.6%
사천시 119
19.6%
축동면 42
 
6.9%
탑리 12
 
2.0%
사천읍 12
 
2.0%
대방동 11
 
1.8%
용현면 11
 
1.8%
곤양면 10
 
1.6%
구호리 10
 
1.6%
8
 
1.3%
Other values (183) 254
41.8%
2023-12-13T09:59:51.327894image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
612
22.5%
150
 
5.5%
134
 
4.9%
129
 
4.7%
120
 
4.4%
119
 
4.4%
119
 
4.4%
119
 
4.4%
1 105
 
3.9%
92
 
3.4%
Other values (99) 1018
37.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1537
56.6%
Space Separator 612
 
22.5%
Decimal Number 473
 
17.4%
Dash Punctuation 89
 
3.3%
Close Punctuation 3
 
0.1%
Open Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
150
 
9.8%
134
 
8.7%
129
 
8.4%
120
 
7.8%
119
 
7.7%
119
 
7.7%
119
 
7.7%
92
 
6.0%
81
 
5.3%
74
 
4.8%
Other values (85) 400
26.0%
Decimal Number
ValueCountFrequency (%)
1 105
22.2%
3 66
14.0%
5 48
10.1%
2 45
9.5%
7 44
9.3%
4 41
 
8.7%
6 39
 
8.2%
8 34
 
7.2%
9 30
 
6.3%
0 21
 
4.4%
Space Separator
ValueCountFrequency (%)
612
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 89
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1537
56.6%
Common 1180
43.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
150
 
9.8%
134
 
8.7%
129
 
8.4%
120
 
7.8%
119
 
7.7%
119
 
7.7%
119
 
7.7%
92
 
6.0%
81
 
5.3%
74
 
4.8%
Other values (85) 400
26.0%
Common
ValueCountFrequency (%)
612
51.9%
1 105
 
8.9%
- 89
 
7.5%
3 66
 
5.6%
5 48
 
4.1%
2 45
 
3.8%
7 44
 
3.7%
4 41
 
3.5%
6 39
 
3.3%
8 34
 
2.9%
Other values (4) 57
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1537
56.6%
ASCII 1180
43.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
612
51.9%
1 105
 
8.9%
- 89
 
7.5%
3 66
 
5.6%
5 48
 
4.1%
2 45
 
3.8%
7 44
 
3.7%
4 41
 
3.5%
6 39
 
3.3%
8 34
 
2.9%
Other values (4) 57
 
4.8%
Hangul
ValueCountFrequency (%)
150
 
9.8%
134
 
8.7%
129
 
8.4%
120
 
7.8%
119
 
7.7%
119
 
7.7%
119
 
7.7%
92
 
6.0%
81
 
5.3%
74
 
4.8%
Other values (85) 400
26.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2023-10-27
119 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2023-10-27
2nd row2023-10-27
3rd row2023-10-27
4th row2023-10-27
5th row2023-10-27

Common Values

ValueCountFrequency (%)
2023-10-27 119
100.0%

Length

2023-12-13T09:59:51.434205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T09:59:51.515657image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-10-27 119
100.0%

Interactions

2023-12-13T09:59:49.636693image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T09:59:49.742148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T09:59:49.805950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번사업장명소재지(지번)데이터기준일자
01(주)신성산업경상남도 사천시 축동면 구호리 8-62023-10-27
12유아이 주식회사경상남도 사천시 용현면 신복리 678-72023-10-27
23에이원제경경상남도 사천시 사천읍 두량리 1278-182023-10-27
34다자연영농조합법인경상남도 사천시 곤명면 금성리 1096 다자연영농조합법인2023-10-27
45삼호기업경상남도 사천시 사천읍 장전리 3172023-10-27
56(주)우리에너지경상남도 사천시 곤양면 대진리 16-132023-10-27
67풍융 주식회사경상남도 사천시 곤양면 흥사리 83-82023-10-27
78광남금속경상남도 사천시 축동면 탑리 12023-10-27
89동주경상남도 사천시 축동면 구호리 8-22023-10-27
910에이치제이케이구조경상남도 사천시 축동면 가산리 741-12023-10-27
연번사업장명소재지(지번)데이터기준일자
109110동림제빙냉동경상남도 사천시 서금동 117-12023-10-27
110111성창수산(주)경상남도 사천시 서금동 1262023-10-27
111112삼경식품경상남도 사천시 대방동 740-22023-10-27
112113삼홍물산경상남도 사천시 이금동 123-42023-10-27
113114나인산업(주)경상남도 사천시 용현면 덕곡리 397-7 외2필지(390-16 397-11)2023-10-27
114115(주)동림1공장경상남도 사천시 대방동 764-52023-10-27
115116대창물산경상남도 사천시 대방동 7642023-10-27
116117선경산업(주)경상남도 사천시 용현면 신복리 4862023-10-27
117118성일산업경상남도 사천시 향촌동 505-42023-10-27
118119제일제빙냉동(주)경상남도 사천시 서동 346-332023-10-27