Overview

Dataset statistics

Number of variables3
Number of observations328
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory8.1 KiB
Average record size in memory25.4 B

Variable types

Numeric1
Text2

Dataset

Description경상북도 구미시 사업장폐기물(수시배출) 배출자 신고 업체 현황입니다. 일련의 공사로 5톤이상 폐기물이 발생하였을 때 신고하는
Author경상북도 구미시
URLhttps://www.data.go.kr/data/15081168/fileData.do

Alerts

연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 19:48:06.122328
Analysis finished2023-12-12 19:48:06.658800
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct328
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean164.5
Minimum1
Maximum328
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-13T04:48:06.739533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.35
Q182.75
median164.5
Q3246.25
95-th percentile311.65
Maximum328
Range327
Interquartile range (IQR)163.5

Descriptive statistics

Standard deviation94.829672
Coefficient of variation (CV)0.57647217
Kurtosis-1.2
Mean164.5
Median Absolute Deviation (MAD)82
Skewness0
Sum53956
Variance8992.6667
MonotonicityStrictly increasing
2023-12-13T04:48:06.905866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
227 1
 
0.3%
225 1
 
0.3%
224 1
 
0.3%
223 1
 
0.3%
222 1
 
0.3%
221 1
 
0.3%
220 1
 
0.3%
219 1
 
0.3%
218 1
 
0.3%
Other values (318) 318
97.0%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
328 1
0.3%
327 1
0.3%
326 1
0.3%
325 1
0.3%
324 1
0.3%
323 1
0.3%
322 1
0.3%
321 1
0.3%
320 1
0.3%
319 1
0.3%

상호
Text

Distinct327
Distinct (%)99.7%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-13T04:48:07.159846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length20
Mean length9.777439
Min length3

Characters and Unicode

Total characters3207
Distinct characters317
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique326 ?
Unique (%)99.4%

Sample

1st row(주)재세
2nd row(주)덕영글로벌
3rd row(주)디와이비
4th row신화정밀(주)
5th row해동레미콘(주)
ValueCountFrequency (%)
주식회사 10
 
2.5%
구미공장 8
 
2.0%
구미지점 5
 
1.3%
구미점 5
 
1.3%
유한회사 3
 
0.8%
주)원익큐엔씨 3
 
0.8%
주)코리아스타텍 2
 
0.5%
클라리오스델코 2
 
0.5%
주)이코니 2
 
0.5%
구미2공장 2
 
0.5%
Other values (344) 352
89.3%
2023-12-13T04:48:07.561461image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
276
 
8.6%
) 276
 
8.6%
( 275
 
8.6%
101
 
3.1%
93
 
2.9%
83
 
2.6%
76
 
2.4%
75
 
2.3%
70
 
2.2%
66
 
2.1%
Other values (307) 1816
56.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2487
77.5%
Close Punctuation 276
 
8.6%
Open Punctuation 275
 
8.6%
Space Separator 66
 
2.1%
Uppercase Letter 51
 
1.6%
Decimal Number 46
 
1.4%
Lowercase Letter 3
 
0.1%
Other Punctuation 2
 
0.1%
Connector Punctuation 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
276
 
11.1%
101
 
4.1%
93
 
3.7%
83
 
3.3%
76
 
3.1%
75
 
3.0%
70
 
2.8%
44
 
1.8%
41
 
1.6%
39
 
1.6%
Other values (277) 1589
63.9%
Uppercase Letter
ValueCountFrequency (%)
L 10
19.6%
S 8
15.7%
G 8
15.7%
K 6
11.8%
H 4
 
7.8%
I 2
 
3.9%
E 2
 
3.9%
R 2
 
3.9%
D 2
 
3.9%
T 2
 
3.9%
Other values (5) 5
9.8%
Decimal Number
ValueCountFrequency (%)
2 18
39.1%
1 11
23.9%
3 8
17.4%
4 5
 
10.9%
5 2
 
4.3%
8 1
 
2.2%
6 1
 
2.2%
Lowercase Letter
ValueCountFrequency (%)
y 1
33.3%
e 1
33.3%
s 1
33.3%
Close Punctuation
ValueCountFrequency (%)
) 276
100.0%
Open Punctuation
ValueCountFrequency (%)
( 275
100.0%
Space Separator
ValueCountFrequency (%)
66
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2487
77.5%
Common 666
 
20.8%
Latin 54
 
1.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
276
 
11.1%
101
 
4.1%
93
 
3.7%
83
 
3.3%
76
 
3.1%
75
 
3.0%
70
 
2.8%
44
 
1.8%
41
 
1.6%
39
 
1.6%
Other values (277) 1589
63.9%
Latin
ValueCountFrequency (%)
L 10
18.5%
S 8
14.8%
G 8
14.8%
K 6
11.1%
H 4
 
7.4%
I 2
 
3.7%
E 2
 
3.7%
R 2
 
3.7%
D 2
 
3.7%
T 2
 
3.7%
Other values (8) 8
14.8%
Common
ValueCountFrequency (%)
) 276
41.4%
( 275
41.3%
66
 
9.9%
2 18
 
2.7%
1 11
 
1.7%
3 8
 
1.2%
4 5
 
0.8%
. 2
 
0.3%
5 2
 
0.3%
_ 1
 
0.2%
Other values (2) 2
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2487
77.5%
ASCII 720
 
22.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
276
 
11.1%
101
 
4.1%
93
 
3.7%
83
 
3.3%
76
 
3.1%
75
 
3.0%
70
 
2.8%
44
 
1.8%
41
 
1.6%
39
 
1.6%
Other values (277) 1589
63.9%
ASCII
ValueCountFrequency (%)
) 276
38.3%
( 275
38.2%
66
 
9.2%
2 18
 
2.5%
1 11
 
1.5%
L 10
 
1.4%
S 8
 
1.1%
3 8
 
1.1%
G 8
 
1.1%
K 6
 
0.8%
Other values (20) 34
 
4.7%
Distinct196
Distinct (%)59.8%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-13T04:48:07.877596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length3
Mean length3.7621951
Min length2

Characters and Unicode

Total characters1234
Distinct characters179
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique181 ?
Unique (%)55.2%

Sample

1st row이성열
2nd row권기철
3rd row이중호
4th row대표이사
5th row곽주영
ValueCountFrequency (%)
대표이사 112
32.9%
정금용 6
 
1.8%
이사장 4
 
1.2%
박치웅 3
 
0.9%
김재윤 2
 
0.6%
남광희 2
 
0.6%
백의열 2
 
0.6%
한창호 2
 
0.6%
김용창 2
 
0.6%
박용해 2
 
0.6%
Other values (196) 203
59.7%
2023-12-13T04:48:08.345911image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
147
 
11.9%
124
 
10.0%
115
 
9.3%
113
 
9.2%
41
 
3.3%
23
 
1.9%
20
 
1.6%
18
 
1.5%
18
 
1.5%
17
 
1.4%
Other values (169) 598
48.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1191
96.5%
Uppercase Letter 15
 
1.2%
Space Separator 12
 
1.0%
Connector Punctuation 11
 
0.9%
Close Punctuation 2
 
0.2%
Open Punctuation 2
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
147
 
12.3%
124
 
10.4%
115
 
9.7%
113
 
9.5%
41
 
3.4%
23
 
1.9%
20
 
1.7%
18
 
1.5%
18
 
1.5%
17
 
1.4%
Other values (154) 555
46.6%
Uppercase Letter
ValueCountFrequency (%)
N 3
20.0%
A 2
13.3%
J 2
13.3%
M 2
13.3%
U 1
 
6.7%
I 1
 
6.7%
G 1
 
6.7%
S 1
 
6.7%
E 1
 
6.7%
H 1
 
6.7%
Space Separator
ValueCountFrequency (%)
12
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1191
96.5%
Common 28
 
2.3%
Latin 15
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
147
 
12.3%
124
 
10.4%
115
 
9.7%
113
 
9.5%
41
 
3.4%
23
 
1.9%
20
 
1.7%
18
 
1.5%
18
 
1.5%
17
 
1.4%
Other values (154) 555
46.6%
Latin
ValueCountFrequency (%)
N 3
20.0%
A 2
13.3%
J 2
13.3%
M 2
13.3%
U 1
 
6.7%
I 1
 
6.7%
G 1
 
6.7%
S 1
 
6.7%
E 1
 
6.7%
H 1
 
6.7%
Common
ValueCountFrequency (%)
12
42.9%
_ 11
39.3%
) 2
 
7.1%
( 2
 
7.1%
. 1
 
3.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1191
96.5%
ASCII 43
 
3.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
147
 
12.3%
124
 
10.4%
115
 
9.7%
113
 
9.5%
41
 
3.4%
23
 
1.9%
20
 
1.7%
18
 
1.5%
18
 
1.5%
17
 
1.4%
Other values (154) 555
46.6%
ASCII
ValueCountFrequency (%)
12
27.9%
_ 11
25.6%
N 3
 
7.0%
) 2
 
4.7%
( 2
 
4.7%
A 2
 
4.7%
J 2
 
4.7%
M 2
 
4.7%
U 1
 
2.3%
I 1
 
2.3%
Other values (5) 5
11.6%

Interactions

2023-12-13T04:48:06.395417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-13T04:48:06.512770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:48:06.613434image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번상호대표자
01(주)재세이성열
12(주)덕영글로벌권기철
23(주)디와이비이중호
34신화정밀(주)대표이사
45해동레미콘(주)곽주영
56아주엠씨엠(주)대표이사
67(주)대양기업이화자
78(주)TSR 8공장이민혁_ 류한광
89태산테크(주)권영득
910(주)제욱이문자
연번상호대표자
318319한국닛다무아(주)이즈미아쯔시
319320(주)팜한농 구미공장대표이사
320321(주)엔피케이(1_2공장)최상건_최윤혁
321322구미칠곡축산업협동조합(축산물유통센터)조합장
322323(주)쌍마김무섭
323324엘에스전선(주) 구미공장명노현
324325엘지이노텍(주)2.3공장대표이사
325326엘지이노텍(주) 1공장대표이사
326327코오롱글로텍(주)구미공장최석순
327328코오롱인더스트리(주)구미공장대표이사