Overview

Dataset statistics

Number of variables6
Number of observations33
Missing cells22
Missing cells (%)11.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.7 KiB
Average record size in memory53.0 B

Variable types

Numeric1
Categorical1
Text4

Dataset

Description경상북도 구미시 골재채취업체 등록현황 데이터로 업종,회사명,법인번호,소재지,연락처 등을 제공하고 있습니다.
Author경상북도 구미시
URLhttps://www.data.go.kr/data/15016576/fileData.do

Alerts

연락처 has 22 (66.7%) missing valuesMissing
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 14:22:33.137346
Analysis finished2023-12-12 14:22:33.674500
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

UNIQUE 

Distinct33
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17
Minimum1
Maximum33
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size429.0 B
2023-12-12T23:22:33.778053image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.6
Q19
median17
Q325
95-th percentile31.4
Maximum33
Range32
Interquartile range (IQR)16

Descriptive statistics

Standard deviation9.6695398
Coefficient of variation (CV)0.56879646
Kurtosis-1.2
Mean17
Median Absolute Deviation (MAD)8
Skewness0
Sum561
Variance93.5
MonotonicityStrictly increasing
2023-12-12T23:22:33.906137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
1 1
 
3.0%
26 1
 
3.0%
20 1
 
3.0%
21 1
 
3.0%
22 1
 
3.0%
23 1
 
3.0%
24 1
 
3.0%
25 1
 
3.0%
27 1
 
3.0%
2 1
 
3.0%
Other values (23) 23
69.7%
ValueCountFrequency (%)
1 1
3.0%
2 1
3.0%
3 1
3.0%
4 1
3.0%
5 1
3.0%
6 1
3.0%
7 1
3.0%
8 1
3.0%
9 1
3.0%
10 1
3.0%
ValueCountFrequency (%)
33 1
3.0%
32 1
3.0%
31 1
3.0%
30 1
3.0%
29 1
3.0%
28 1
3.0%
27 1
3.0%
26 1
3.0%
25 1
3.0%
24 1
3.0%

업종
Categorical

Distinct3
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size396.0 B
육상골재채취업
19 
골재선별파쇄업
12 
산림골재업

Length

Max length7
Median length7
Mean length6.8787879
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row육상골재채취업
2nd row산림골재업
3rd row골재선별파쇄업
4th row육상골재채취업
5th row육상골재채취업

Common Values

ValueCountFrequency (%)
육상골재채취업 19
57.6%
골재선별파쇄업 12
36.4%
산림골재업 2
 
6.1%

Length

2023-12-12T23:22:34.033291image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:22:34.137372image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
육상골재채취업 19
57.6%
골재선별파쇄업 12
36.4%
산림골재업 2
 
6.1%
Distinct25
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-12T23:22:34.289650image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length5.1212121
Min length3

Characters and Unicode

Total characters169
Distinct characters51
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)54.5%

Sample

1st row㈜대동
2nd row㈜대양
3rd row㈜대양
4th row㈜대양
5th row㈜대원산업개발
ValueCountFrequency (%)
㈜대양 3
 
9.1%
㈜대영산업개발 2
 
6.1%
삼봉산업㈜ 2
 
6.1%
㈜예스골재 2
 
6.1%
㈜대한토건 2
 
6.1%
㈜대원산업개발 2
 
6.1%
태경산업㈜ 2
 
6.1%
㈜미래개발 1
 
3.0%
㈜대동 1
 
3.0%
㈜고려 1
 
3.0%
Other values (15) 15
45.5%
2023-12-12T23:22:34.630659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
33
19.5%
14
 
8.3%
13
 
7.7%
13
 
7.7%
13
 
7.7%
10
 
5.9%
4
 
2.4%
4
 
2.4%
3
 
1.8%
3
 
1.8%
Other values (41) 59
34.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 134
79.3%
Other Symbol 33
 
19.5%
Space Separator 2
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
14
 
10.4%
13
 
9.7%
13
 
9.7%
13
 
9.7%
10
 
7.5%
4
 
3.0%
4
 
3.0%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (39) 54
40.3%
Other Symbol
ValueCountFrequency (%)
33
100.0%
Space Separator
ValueCountFrequency (%)
2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 167
98.8%
Common 2
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
33
19.8%
14
 
8.4%
13
 
7.8%
13
 
7.8%
13
 
7.8%
10
 
6.0%
4
 
2.4%
4
 
2.4%
3
 
1.8%
3
 
1.8%
Other values (40) 57
34.1%
Common
ValueCountFrequency (%)
2
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 134
79.3%
None 33
 
19.5%
ASCII 2
 
1.2%

Most frequent character per block

None
ValueCountFrequency (%)
33
100.0%
Hangul
ValueCountFrequency (%)
14
 
10.4%
13
 
9.7%
13
 
9.7%
13
 
9.7%
10
 
7.5%
4
 
3.0%
4
 
3.0%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (39) 54
40.3%
ASCII
ValueCountFrequency (%)
2
100.0%
Distinct25
Distinct (%)75.8%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-12T23:22:34.851396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length45
Median length34
Mean length23.969697
Min length17

Characters and Unicode

Total characters791
Distinct characters77
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)54.5%

Sample

1st row경상북도 구미시 도개면 강동로 2651
2nd row경상북도 구미시 도개면 도군로 477-25
3rd row경상북도 구미시 도개면 도군로 477-25
4th row경상북도 구미시 도개면 도군로 477-25
5th row경상북도 구미시 금오대로 382
ValueCountFrequency (%)
경상북도 33
19.5%
구미시 33
19.5%
선산읍 6
 
3.6%
고아읍 5
 
3.0%
도개면 4
 
2.4%
2층 4
 
2.4%
강동로 4
 
2.4%
도군로 3
 
1.8%
477-25 3
 
1.8%
650 2
 
1.2%
Other values (56) 72
42.6%
2023-12-12T23:22:35.192239image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
136
 
17.2%
40
 
5.1%
39
 
4.9%
37
 
4.7%
34
 
4.3%
34
 
4.3%
33
 
4.2%
33
 
4.2%
2 29
 
3.7%
1 29
 
3.7%
Other values (67) 347
43.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 474
59.9%
Decimal Number 147
 
18.6%
Space Separator 136
 
17.2%
Other Punctuation 12
 
1.5%
Dash Punctuation 11
 
1.4%
Open Punctuation 6
 
0.8%
Close Punctuation 5
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
8.4%
39
 
8.2%
37
 
7.8%
34
 
7.2%
34
 
7.2%
33
 
7.0%
33
 
7.0%
25
 
5.3%
18
 
3.8%
16
 
3.4%
Other values (52) 165
34.8%
Decimal Number
ValueCountFrequency (%)
2 29
19.7%
1 29
19.7%
3 19
12.9%
4 17
11.6%
0 13
8.8%
5 11
 
7.5%
6 10
 
6.8%
7 9
 
6.1%
8 6
 
4.1%
9 4
 
2.7%
Space Separator
ValueCountFrequency (%)
136
100.0%
Other Punctuation
ValueCountFrequency (%)
, 12
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 6
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 474
59.9%
Common 317
40.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
8.4%
39
 
8.2%
37
 
7.8%
34
 
7.2%
34
 
7.2%
33
 
7.0%
33
 
7.0%
25
 
5.3%
18
 
3.8%
16
 
3.4%
Other values (52) 165
34.8%
Common
ValueCountFrequency (%)
136
42.9%
2 29
 
9.1%
1 29
 
9.1%
3 19
 
6.0%
4 17
 
5.4%
0 13
 
4.1%
, 12
 
3.8%
5 11
 
3.5%
- 11
 
3.5%
6 10
 
3.2%
Other values (5) 30
 
9.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 474
59.9%
ASCII 317
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
136
42.9%
2 29
 
9.1%
1 29
 
9.1%
3 19
 
6.0%
4 17
 
5.4%
0 13
 
4.1%
, 12
 
3.8%
5 11
 
3.5%
- 11
 
3.5%
6 10
 
3.2%
Other values (5) 30
 
9.5%
Hangul
ValueCountFrequency (%)
40
 
8.4%
39
 
8.2%
37
 
7.8%
34
 
7.2%
34
 
7.2%
33
 
7.0%
33
 
7.0%
25
 
5.3%
18
 
3.8%
16
 
3.4%
Other values (52) 165
34.8%

연락처
Text

MISSING 

Distinct7
Distinct (%)63.6%
Missing22
Missing (%)66.7%
Memory size396.0 B
2023-12-12T23:22:35.342953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters132
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4 ?
Unique (%)36.4%

Sample

1st row054-472-0018
2nd row054-474-7070
3rd row054-474-7070
4th row054-474-7070
5th row054-475-3500
ValueCountFrequency (%)
054-474-7070 3
27.3%
054-475-3500 2
18.2%
054-481-0039 2
18.2%
054-472-0018 1
 
9.1%
054-443-6616 1
 
9.1%
054-975-0023 1
 
9.1%
054-457-6874 1
 
9.1%
2023-12-12T23:22:35.609885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 29
22.0%
4 26
19.7%
- 22
16.7%
5 17
12.9%
7 15
11.4%
3 6
 
4.5%
8 4
 
3.0%
1 4
 
3.0%
6 4
 
3.0%
9 3
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 110
83.3%
Dash Punctuation 22
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 29
26.4%
4 26
23.6%
5 17
15.5%
7 15
13.6%
3 6
 
5.5%
8 4
 
3.6%
1 4
 
3.6%
6 4
 
3.6%
9 3
 
2.7%
2 2
 
1.8%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 132
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 29
22.0%
4 26
19.7%
- 22
16.7%
5 17
12.9%
7 15
11.4%
3 6
 
4.5%
8 4
 
3.0%
1 4
 
3.0%
6 4
 
3.0%
9 3
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 132
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 29
22.0%
4 26
19.7%
- 22
16.7%
5 17
12.9%
7 15
11.4%
3 6
 
4.5%
8 4
 
3.0%
1 4
 
3.0%
6 4
 
3.0%
9 3
 
2.3%
Distinct30
Distinct (%)90.9%
Missing0
Missing (%)0.0%
Memory size396.0 B
2023-12-12T23:22:35.812529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length10
Mean length10
Min length9

Characters and Unicode

Total characters330
Distinct characters16
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique27 ?
Unique (%)81.8%

Sample

1st row구미시2009-01
2nd row구미시1995-136
3rd row구미시2012-03
4th row구미시2012-04
5th row구미시2015-04
ValueCountFrequency (%)
구미시2015-04 2
 
6.1%
구미시2022-02 2
 
6.1%
구미시2020-10 2
 
6.1%
구미시2020-05 1
 
3.0%
구미시2009-01 1
 
3.0%
구미시2020-06 1
 
3.0%
구미시2021-04 1
 
3.0%
구미시2017-04 1
 
3.0%
구미시2016-04 1
 
3.0%
구미시2016-03 1
 
3.0%
Other values (20) 20
60.6%
2023-12-12T23:22:36.169385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 73
22.1%
2 57
17.3%
- 33
10.0%
32
9.7%
32
9.7%
32
9.7%
1 26
 
7.9%
5 7
 
2.1%
4 7
 
2.1%
6 7
 
2.1%
Other values (6) 24
 
7.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 199
60.3%
Other Letter 98
29.7%
Dash Punctuation 33
 
10.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 73
36.7%
2 57
28.6%
1 26
 
13.1%
5 7
 
3.5%
4 7
 
3.5%
6 7
 
3.5%
9 6
 
3.0%
3 6
 
3.0%
7 6
 
3.0%
8 4
 
2.0%
Other Letter
ValueCountFrequency (%)
32
32.7%
32
32.7%
32
32.7%
1
 
1.0%
1
 
1.0%
Dash Punctuation
ValueCountFrequency (%)
- 33
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 232
70.3%
Hangul 98
29.7%

Most frequent character per script

Common
ValueCountFrequency (%)
0 73
31.5%
2 57
24.6%
- 33
14.2%
1 26
 
11.2%
5 7
 
3.0%
4 7
 
3.0%
6 7
 
3.0%
9 6
 
2.6%
3 6
 
2.6%
7 6
 
2.6%
Hangul
ValueCountFrequency (%)
32
32.7%
32
32.7%
32
32.7%
1
 
1.0%
1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 232
70.3%
Hangul 98
29.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 73
31.5%
2 57
24.6%
- 33
14.2%
1 26
 
11.2%
5 7
 
3.0%
4 7
 
3.0%
6 7
 
3.0%
9 6
 
2.6%
3 6
 
2.6%
7 6
 
2.6%
Hangul
ValueCountFrequency (%)
32
32.7%
32
32.7%
32
32.7%
1
 
1.0%
1
 
1.0%

Interactions

2023-12-12T23:22:33.400719image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:22:36.263567image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종회사명소재지연락처등록번호
연번1.0000.5380.9410.9410.7090.992
업종0.5381.0000.0000.0000.0000.660
회사명0.9410.0001.0001.0001.0001.000
소재지0.9410.0001.0001.0001.0001.000
연락처0.7090.0001.0001.0001.0001.000
등록번호0.9920.6601.0001.0001.0001.000
2023-12-12T23:22:36.354373image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번업종
연번1.0000.323
업종0.3231.000

Missing values

2023-12-12T23:22:33.522342image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:22:33.628945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번업종회사명소재지연락처등록번호
01육상골재채취업㈜대동경상북도 구미시 도개면 강동로 2651054-472-0018구미시2009-01
12산림골재업㈜대양경상북도 구미시 도개면 도군로 477-25054-474-7070구미시1995-136
23골재선별파쇄업㈜대양경상북도 구미시 도개면 도군로 477-25054-474-7070구미시2012-03
34육상골재채취업㈜대양경상북도 구미시 도개면 도군로 477-25054-474-7070구미시2012-04
45육상골재채취업㈜대원산업개발경상북도 구미시 금오대로 382<NA>구미시2015-04
56골재선별파쇄업㈜대원산업개발경상북도 구미시 금오대로 382<NA>구미시2015-04
67골재선별파쇄업태경산업㈜경상북도 구미시 흥안로 2길 14-43(옥계동)054-475-3500구미시2016-02
78육상골재채취업태경산업㈜경상북도 구미시 흥안로 2길 14-43(옥계동)054-475-3500구미시2017-05
89육상골재채취업㈜한성개발경상북도 구미시 구미대로 144, 3층054-443-6616구미시2017-06
910육상골재채취업㈜대영산업개발경상북도 구미시 선산읍 동교길 43-1<NA>구미시2018-01
연번업종회사명소재지연락처등록번호
2324육상골재채취업㈜예스골재경상북도 구미시 장천면 강동로 650<NA>구미시2022-02
2425골재선별파쇄업㈜예스골재경상북도 구미시 장천면 강동로 650<NA>구미시2022-02
2526육상골재채취업㈜감성골재경상북도 구미시 선산읍 선주로 95<NA>구미시2022-03
2627골재선별파쇄업세아산업㈜경상북도 구미시 구미대로 409, 10층 1005호(신평동, 구미신평지엘리베라움)054-975-0023구미시2007-02
2728골재선별파쇄업㈜백상경상북도 구미시 선산읍 선상동로 354, 2층<NA>구미시2015-02
2829산림골재업삼봉산업㈜경상북도 구미시 옥성면 옥관구평길 111054-481-0039구미시2016-03
2930골재선별파쇄업삼봉산업㈜경상북도 구미시 옥성면 옥관구평길 111054-481-0039구미시2016-04
3031골재선별파쇄업㈜에스에스물류경상북도 구미시 1공단로7길 86-1, 3층 302호(공단동)<NA>구미시2017-04
3132골재선별파쇄업㈜진율산업개발경상북도 구미시 고아읍 문장로22길 22, 지하1층 102호054-457-6874구미시2021-04
3233골재선별파쇄업㈜영실경상북도 구미시 고아읍 송평로7, 2호<NA>전주2007-07