Overview

Dataset statistics

Number of variables3
Number of observations392
Missing cells1
Missing cells (%)0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.7 KiB
Average record size in memory25.3 B

Variable types

Numeric1
Text2

Dataset

Description경상북도 구미시 사업장폐기물(수시배출) 배출자 신고 업체 현황에 대한 데이터로 상호 및 대표자 이름을 제공하고 있습니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/15060265/fileData.do

Alerts

순번 has unique valuesUnique
상호 has unique valuesUnique

Reproduction

Analysis started2023-12-11 23:46:03.577863
Analysis finished2023-12-11 23:46:04.124934
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct392
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean196.5
Minimum1
Maximum392
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-12T08:46:04.225667image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile20.55
Q198.75
median196.5
Q3294.25
95-th percentile372.45
Maximum392
Range391
Interquartile range (IQR)195.5

Descriptive statistics

Standard deviation113.3049
Coefficient of variation (CV)0.57661526
Kurtosis-1.2
Mean196.5
Median Absolute Deviation (MAD)98
Skewness0
Sum77028
Variance12838
MonotonicityStrictly increasing
2023-12-12T08:46:04.376112image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
271 1
 
0.3%
269 1
 
0.3%
268 1
 
0.3%
267 1
 
0.3%
266 1
 
0.3%
265 1
 
0.3%
264 1
 
0.3%
263 1
 
0.3%
262 1
 
0.3%
Other values (382) 382
97.4%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
392 1
0.3%
391 1
0.3%
390 1
0.3%
389 1
0.3%
388 1
0.3%
387 1
0.3%
386 1
0.3%
385 1
0.3%
384 1
0.3%
383 1
0.3%

상호
Text

UNIQUE 

Distinct392
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size3.2 KiB
2023-12-12T08:46:04.640172image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length31
Median length20
Mean length9.3673469
Min length3

Characters and Unicode

Total characters3672
Distinct characters334
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique392 ?
Unique (%)100.0%

Sample

1st row(주)레몬 구미2사업장
2nd row(주)피플웍스
3rd row구미시설공단
4th row(주)윤금사 구미지점
5th row(주)원익 구미지점
ValueCountFrequency (%)
주식회사 10
 
2.2%
구미공장 10
 
2.2%
구미점 5
 
1.1%
4
 
0.9%
구미지점 4
 
0.9%
유한회사 3
 
0.6%
주)원익큐엔씨 3
 
0.6%
주)레몬 2
 
0.4%
ls전선 2
 
0.4%
주)에스에스유통 2
 
0.4%
Other values (412) 418
90.3%
2023-12-12T08:46:05.050755image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
312
 
8.5%
( 310
 
8.4%
) 310
 
8.4%
115
 
3.1%
104
 
2.8%
95
 
2.6%
87
 
2.4%
85
 
2.3%
78
 
2.1%
72
 
2.0%
Other values (324) 2104
57.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2876
78.3%
Open Punctuation 310
 
8.4%
Close Punctuation 310
 
8.4%
Space Separator 72
 
2.0%
Uppercase Letter 54
 
1.5%
Decimal Number 45
 
1.2%
Lowercase Letter 3
 
0.1%
Other Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
312
 
10.8%
115
 
4.0%
104
 
3.6%
95
 
3.3%
87
 
3.0%
85
 
3.0%
78
 
2.7%
51
 
1.8%
50
 
1.7%
47
 
1.6%
Other values (296) 1852
64.4%
Uppercase Letter
ValueCountFrequency (%)
L 10
18.5%
S 8
14.8%
G 8
14.8%
K 6
11.1%
H 4
 
7.4%
I 3
 
5.6%
C 3
 
5.6%
T 2
 
3.7%
E 2
 
3.7%
J 2
 
3.7%
Other values (5) 6
11.1%
Decimal Number
ValueCountFrequency (%)
2 20
44.4%
1 12
26.7%
3 6
 
13.3%
4 5
 
11.1%
5 1
 
2.2%
6 1
 
2.2%
Lowercase Letter
ValueCountFrequency (%)
y 1
33.3%
e 1
33.3%
s 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 310
100.0%
Close Punctuation
ValueCountFrequency (%)
) 310
100.0%
Space Separator
ValueCountFrequency (%)
72
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2876
78.3%
Common 739
 
20.1%
Latin 57
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
312
 
10.8%
115
 
4.0%
104
 
3.6%
95
 
3.3%
87
 
3.0%
85
 
3.0%
78
 
2.7%
51
 
1.8%
50
 
1.7%
47
 
1.6%
Other values (296) 1852
64.4%
Latin
ValueCountFrequency (%)
L 10
17.5%
S 8
14.0%
G 8
14.0%
K 6
10.5%
H 4
 
7.0%
I 3
 
5.3%
C 3
 
5.3%
T 2
 
3.5%
E 2
 
3.5%
J 2
 
3.5%
Other values (8) 9
15.8%
Common
ValueCountFrequency (%)
( 310
41.9%
) 310
41.9%
72
 
9.7%
2 20
 
2.7%
1 12
 
1.6%
3 6
 
0.8%
4 5
 
0.7%
. 2
 
0.3%
5 1
 
0.1%
6 1
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2876
78.3%
ASCII 796
 
21.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
312
 
10.8%
115
 
4.0%
104
 
3.6%
95
 
3.3%
87
 
3.0%
85
 
3.0%
78
 
2.7%
51
 
1.8%
50
 
1.7%
47
 
1.6%
Other values (296) 1852
64.4%
ASCII
ValueCountFrequency (%)
( 310
38.9%
) 310
38.9%
72
 
9.0%
2 20
 
2.5%
1 12
 
1.5%
L 10
 
1.3%
S 8
 
1.0%
G 8
 
1.0%
3 6
 
0.8%
K 6
 
0.8%
Other values (18) 34
 
4.3%
Distinct254
Distinct (%)65.0%
Missing1
Missing (%)0.3%
Memory size3.2 KiB
2023-12-12T08:46:05.387835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length3
Mean length3.5294118
Min length2

Characters and Unicode

Total characters1380
Distinct characters190
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique237 ?
Unique (%)60.6%

Sample

1st row김효규
2nd row민경백
3rd row구미시설공단 이사장
4th row윤희성
5th row대표이사
ValueCountFrequency (%)
대표이사 116
29.2%
정금용 6
 
1.5%
이사장 4
 
1.0%
박치웅 3
 
0.8%
조복제 2
 
0.5%
한창호 2
 
0.5%
김영준 2
 
0.5%
김재윤 2
 
0.5%
김효규 2
 
0.5%
유병선 2
 
0.5%
Other values (249) 256
64.5%
2023-12-12T08:46:05.920903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
155
 
11.2%
126
 
9.1%
118
 
8.6%
116
 
8.4%
53
 
3.8%
28
 
2.0%
27
 
2.0%
27
 
2.0%
22
 
1.6%
20
 
1.4%
Other values (180) 688
49.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1354
98.1%
Uppercase Letter 15
 
1.1%
Space Separator 6
 
0.4%
Connector Punctuation 3
 
0.2%
Close Punctuation 1
 
0.1%
Open Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
155
 
11.4%
126
 
9.3%
118
 
8.7%
116
 
8.6%
53
 
3.9%
28
 
2.1%
27
 
2.0%
27
 
2.0%
22
 
1.6%
20
 
1.5%
Other values (166) 662
48.9%
Uppercase Letter
ValueCountFrequency (%)
N 3
20.0%
J 2
13.3%
M 2
13.3%
A 2
13.3%
H 1
 
6.7%
E 1
 
6.7%
I 1
 
6.7%
G 1
 
6.7%
U 1
 
6.7%
S 1
 
6.7%
Space Separator
ValueCountFrequency (%)
6
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1354
98.1%
Latin 15
 
1.1%
Common 11
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
155
 
11.4%
126
 
9.3%
118
 
8.7%
116
 
8.6%
53
 
3.9%
28
 
2.1%
27
 
2.0%
27
 
2.0%
22
 
1.6%
20
 
1.5%
Other values (166) 662
48.9%
Latin
ValueCountFrequency (%)
N 3
20.0%
J 2
13.3%
M 2
13.3%
A 2
13.3%
H 1
 
6.7%
E 1
 
6.7%
I 1
 
6.7%
G 1
 
6.7%
U 1
 
6.7%
S 1
 
6.7%
Common
ValueCountFrequency (%)
6
54.5%
_ 3
27.3%
) 1
 
9.1%
( 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1354
98.1%
ASCII 26
 
1.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
155
 
11.4%
126
 
9.3%
118
 
8.7%
116
 
8.6%
53
 
3.9%
28
 
2.1%
27
 
2.0%
27
 
2.0%
22
 
1.6%
20
 
1.5%
Other values (166) 662
48.9%
ASCII
ValueCountFrequency (%)
6
23.1%
N 3
11.5%
_ 3
11.5%
J 2
 
7.7%
M 2
 
7.7%
A 2
 
7.7%
H 1
 
3.8%
) 1
 
3.8%
( 1
 
3.8%
E 1
 
3.8%
Other values (4) 4
15.4%

Interactions

2023-12-12T08:46:03.849219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Missing values

2023-12-12T08:46:03.999804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T08:46:04.086607image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번상호대표자
01(주)레몬 구미2사업장김효규
12(주)피플웍스민경백
23구미시설공단구미시설공단 이사장
34(주)윤금사 구미지점윤희성
45(주)원익 구미지점대표이사
56(주)대경테크김정우
67금영통상김기수
78피앤와이환경산업박서윤
89(주)무일화성 구미공장정상진
910(학)순천향대학교 부속 구미병원김성구
순번상호대표자
382383넥스텍스(주)백보현
383384(주)팜한농 구미공장대표이사
384385구미칠곡축산업협동조합(축산물유통센터)조합장
385386(주)쌍마김무섭
386387(주)하이닉스반도체박종섭
387388엘에스전선(주) 구미공장명노현
388389엘지이노텍(주)2.3공장대표이사
389390엘지이노텍(주) 1공장대표이사
390391코오롱글로텍(주)구미공장최석순
391392코오롱인더스트리(주)구미공장대표이사