Overview

Dataset statistics

Number of variables6
Number of observations40
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.0 KiB
Average record size in memory51.3 B

Variable types

Text3
Categorical3

Dataset

Description충청북도 충주시 골재채취업 등록 현황에 대한 정보 제공(업체명, 업종, 주소, 도로명주소, 관리부서 전화번호, 기준일 등)
URLhttps://www.data.go.kr/data/15047465/fileData.do

Alerts

업종선택 is highly overall correlated with 관리부서 전화번호 and 1 other fieldsHigh correlation
관리부서 전화번호 is highly overall correlated with 업종선택 and 1 other fieldsHigh correlation
등록기준일 is highly overall correlated with 업종선택 and 1 other fieldsHigh correlation
관리부서 전화번호 is highly imbalanced (83.1%)Imbalance
등록기준일 is highly imbalanced (83.1%)Imbalance

Reproduction

Analysis started2023-12-12 19:27:26.972390
Analysis finished2023-12-12 19:27:27.535617
Duration0.56 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct27
Distinct (%)67.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-13T04:27:27.692175image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length5
Mean length4.75
Min length2

Characters and Unicode

Total characters190
Distinct characters55
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18 ?
Unique (%)45.0%

Sample

1st row㈜이레산업
2nd row㈜이레산업
3rd row㈜명진개발
4th row㈜명진개발
5th row㈜명진개발
ValueCountFrequency (%)
㈜명진 3
 
7.5%
㈜명진개발 3
 
7.5%
신진개발㈜ 3
 
7.5%
㈜덕산 3
 
7.5%
㈜삼일산업 2
 
5.0%
이음건설㈜ 2
 
5.0%
㈜대성물산 2
 
5.0%
㈜이레산업 2
 
5.0%
㈜대화개발 2
 
5.0%
㈜노은환경개발 1
 
2.5%
Other values (17) 17
42.5%
2023-12-13T04:27:28.032068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
40
21.1%
19
 
10.0%
13
 
6.8%
10
 
5.3%
10
 
5.3%
9
 
4.7%
7
 
3.7%
7
 
3.7%
6
 
3.2%
4
 
2.1%
Other values (45) 65
34.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 150
78.9%
Other Symbol 40
 
21.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
12.7%
13
 
8.7%
10
 
6.7%
10
 
6.7%
9
 
6.0%
7
 
4.7%
7
 
4.7%
6
 
4.0%
4
 
2.7%
3
 
2.0%
Other values (44) 62
41.3%
Other Symbol
ValueCountFrequency (%)
40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 190
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
21.1%
19
 
10.0%
13
 
6.8%
10
 
5.3%
10
 
5.3%
9
 
4.7%
7
 
3.7%
7
 
3.7%
6
 
3.2%
4
 
2.1%
Other values (45) 65
34.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 150
78.9%
None 40
 
21.1%

Most frequent character per block

None
ValueCountFrequency (%)
40
100.0%
Hangul
ValueCountFrequency (%)
19
 
12.7%
13
 
8.7%
10
 
6.7%
10
 
6.7%
9
 
6.0%
7
 
4.7%
7
 
4.7%
6
 
4.0%
4
 
2.7%
3
 
2.0%
Other values (44) 62
41.3%

업종선택
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
육상골재
18 
선별파쇄
13 
산림골재
선별세척
 
1

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique1 ?
Unique (%)2.5%

Sample

1st row산림골재
2nd row선별파쇄
3rd row산림골재
4th row육상골재
5th row선별파쇄

Common Values

ValueCountFrequency (%)
육상골재 18
45.0%
선별파쇄 13
32.5%
산림골재 8
20.0%
선별세척 1
 
2.5%

Length

2023-12-13T04:27:28.173371image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:27:28.283293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
육상골재 18
45.0%
선별파쇄 13
32.5%
산림골재 8
20.0%
선별세척 1
 
2.5%

주소
Text

Distinct24
Distinct (%)60.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-13T04:27:28.511422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length30
Mean length23.975
Min length17

Characters and Unicode

Total characters959
Distinct characters97
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)32.5%

Sample

1st row충청북도 충주시 노은면 연하리 산29-2
2nd row충청북도 충주시 노은면 연하리 산29-2
3rd row충청북도 충주시 소태면 구룡리 산 36-3
4th row충청북도 충주시 소태면 구룡리 산 36-3
5th row충청북도 충주시 소태면 구룡리 산 36-3
ValueCountFrequency (%)
충청북도 40
19.6%
충주시 40
19.6%
대소원면 10
 
4.9%
주덕읍 5
 
2.5%
연수로 4
 
2.0%
1길12(연수동 4
 
2.0%
아이파크 4
 
2.0%
102-1502 4
 
2.0%
노은면 4
 
2.0%
구룡리 3
 
1.5%
Other values (57) 86
42.2%
2023-12-13T04:27:28.889391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164
17.1%
80
 
8.3%
1 52
 
5.4%
45
 
4.7%
40
 
4.2%
40
 
4.2%
40
 
4.2%
40
 
4.2%
2 27
 
2.8%
5 25
 
2.6%
Other values (87) 406
42.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 574
59.9%
Decimal Number 179
 
18.7%
Space Separator 164
 
17.1%
Dash Punctuation 21
 
2.2%
Close Punctuation 9
 
0.9%
Open Punctuation 9
 
0.9%
Other Punctuation 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
 
13.9%
45
 
7.8%
40
 
7.0%
40
 
7.0%
40
 
7.0%
40
 
7.0%
22
 
3.8%
17
 
3.0%
16
 
2.8%
14
 
2.4%
Other values (72) 220
38.3%
Decimal Number
ValueCountFrequency (%)
1 52
29.1%
2 27
15.1%
5 25
14.0%
3 25
14.0%
0 15
 
8.4%
4 12
 
6.7%
7 8
 
4.5%
9 5
 
2.8%
6 5
 
2.8%
8 5
 
2.8%
Space Separator
ValueCountFrequency (%)
164
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 574
59.9%
Common 385
40.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
 
13.9%
45
 
7.8%
40
 
7.0%
40
 
7.0%
40
 
7.0%
40
 
7.0%
22
 
3.8%
17
 
3.0%
16
 
2.8%
14
 
2.4%
Other values (72) 220
38.3%
Common
ValueCountFrequency (%)
164
42.6%
1 52
 
13.5%
2 27
 
7.0%
5 25
 
6.5%
3 25
 
6.5%
- 21
 
5.5%
0 15
 
3.9%
4 12
 
3.1%
) 9
 
2.3%
( 9
 
2.3%
Other values (5) 26
 
6.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 574
59.9%
ASCII 385
40.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
164
42.6%
1 52
 
13.5%
2 27
 
7.0%
5 25
 
6.5%
3 25
 
6.5%
- 21
 
5.5%
0 15
 
3.9%
4 12
 
3.1%
) 9
 
2.3%
( 9
 
2.3%
Other values (5) 26
 
6.8%
Hangul
ValueCountFrequency (%)
80
 
13.9%
45
 
7.8%
40
 
7.0%
40
 
7.0%
40
 
7.0%
40
 
7.0%
22
 
3.8%
17
 
3.0%
16
 
2.8%
14
 
2.4%
Other values (72) 220
38.3%
Distinct23
Distinct (%)57.5%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-12-13T04:27:29.142099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length41
Median length36
Mean length23.775
Min length17

Characters and Unicode

Total characters951
Distinct characters96
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique13 ?
Unique (%)32.5%

Sample

1st row충청북도 충주시 노은면 감노로 1495-40
2nd row충청북도 충주시 노은면 감노로 1495-40
3rd row충청북도 충주시 소태면 구룡로 1298
4th row충청북도 충주시 소태면 구룡로 1298
5th row충청북도 충주시 소태면 구룡로 1298
ValueCountFrequency (%)
충청북도 40
19.9%
충주시 40
19.9%
대소원면 12
 
6.0%
주덕읍 5
 
2.5%
월은1길 4
 
2.0%
37 4
 
2.0%
연수로 4
 
2.0%
1길12(연수동 4
 
2.0%
아이파크 4
 
2.0%
102-1502 4
 
2.0%
Other values (51) 80
39.8%
2023-12-13T04:27:29.532224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
161
16.9%
80
 
8.4%
1 57
 
6.0%
45
 
4.7%
40
 
4.2%
40
 
4.2%
40
 
4.2%
40
 
4.2%
2 25
 
2.6%
25
 
2.6%
Other values (86) 398
41.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 569
59.8%
Decimal Number 186
 
19.6%
Space Separator 161
 
16.9%
Dash Punctuation 14
 
1.5%
Close Punctuation 9
 
0.9%
Open Punctuation 9
 
0.9%
Other Punctuation 3
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
80
 
14.1%
45
 
7.9%
40
 
7.0%
40
 
7.0%
40
 
7.0%
40
 
7.0%
25
 
4.4%
22
 
3.9%
19
 
3.3%
15
 
2.6%
Other values (71) 203
35.7%
Decimal Number
ValueCountFrequency (%)
1 57
30.6%
2 25
13.4%
5 21
 
11.3%
3 19
 
10.2%
0 18
 
9.7%
4 14
 
7.5%
7 11
 
5.9%
9 9
 
4.8%
8 6
 
3.2%
6 6
 
3.2%
Space Separator
ValueCountFrequency (%)
161
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 9
100.0%
Open Punctuation
ValueCountFrequency (%)
( 9
100.0%
Other Punctuation
ValueCountFrequency (%)
, 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 569
59.8%
Common 382
40.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
80
 
14.1%
45
 
7.9%
40
 
7.0%
40
 
7.0%
40
 
7.0%
40
 
7.0%
25
 
4.4%
22
 
3.9%
19
 
3.3%
15
 
2.6%
Other values (71) 203
35.7%
Common
ValueCountFrequency (%)
161
42.1%
1 57
 
14.9%
2 25
 
6.5%
5 21
 
5.5%
3 19
 
5.0%
0 18
 
4.7%
4 14
 
3.7%
- 14
 
3.7%
7 11
 
2.9%
) 9
 
2.4%
Other values (5) 33
 
8.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 569
59.8%
ASCII 382
40.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
161
42.1%
1 57
 
14.9%
2 25
 
6.5%
5 21
 
5.5%
3 19
 
5.0%
0 18
 
4.7%
4 14
 
3.7%
- 14
 
3.7%
7 11
 
2.9%
) 9
 
2.4%
Other values (5) 33
 
8.6%
Hangul
ValueCountFrequency (%)
80
 
14.1%
45
 
7.9%
40
 
7.0%
40
 
7.0%
40
 
7.0%
40
 
7.0%
25
 
4.4%
22
 
3.9%
19
 
3.3%
15
 
2.6%
Other values (71) 203
35.7%

관리부서 전화번호
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
043-850-6141
39 
<NA>
 
1

Length

Max length12
Median length12
Mean length11.8
Min length4

Unique

Unique1 ?
Unique (%)2.5%

Sample

1st row043-850-6141
2nd row043-850-6141
3rd row043-850-6141
4th row043-850-6141
5th row043-850-6141

Common Values

ValueCountFrequency (%)
043-850-6141 39
97.5%
<NA> 1
 
2.5%

Length

2023-12-13T04:27:29.688560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:27:29.820001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
043-850-6141 39
97.5%
na 1
 
2.5%

등록기준일
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size452.0 B
2023-04-30
39 
<NA>
 
1

Length

Max length10
Median length10
Mean length9.85
Min length4

Unique

Unique1 ?
Unique (%)2.5%

Sample

1st row2023-04-30
2nd row2023-04-30
3rd row2023-04-30
4th row2023-04-30
5th row2023-04-30

Common Values

ValueCountFrequency (%)
2023-04-30 39
97.5%
<NA> 1
 
2.5%

Length

2023-12-13T04:27:29.938620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:27:30.042614image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2023-04-30 39
97.5%
na 1
 
2.5%

Correlations

2023-12-13T04:27:30.121185image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업체명업종선택주소도로명주소
업체명1.0000.0001.0001.000
업종선택0.0001.0000.4960.355
주소1.0000.4961.0001.000
도로명주소1.0000.3551.0001.000
2023-12-13T04:27:30.231664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종선택관리부서 전화번호등록기준일
업종선택1.0001.0001.000
관리부서 전화번호1.0001.0001.000
등록기준일1.0001.0001.000
2023-12-13T04:27:30.328427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
업종선택관리부서 전화번호등록기준일
업종선택1.0001.0001.000
관리부서 전화번호1.0001.0001.000
등록기준일1.0001.0001.000

Missing values

2023-12-13T04:27:27.347465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:27:27.479077image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명업종선택주소도로명주소관리부서 전화번호등록기준일
0㈜이레산업산림골재충청북도 충주시 노은면 연하리 산29-2충청북도 충주시 노은면 감노로 1495-40043-850-61412023-04-30
1㈜이레산업선별파쇄충청북도 충주시 노은면 연하리 산29-2충청북도 충주시 노은면 감노로 1495-40043-850-61412023-04-30
2㈜명진개발산림골재충청북도 충주시 소태면 구룡리 산 36-3충청북도 충주시 소태면 구룡로 1298043-850-61412023-04-30
3㈜명진개발육상골재충청북도 충주시 소태면 구룡리 산 36-3충청북도 충주시 소태면 구룡로 1298043-850-61412023-04-30
4㈜명진개발선별파쇄충청북도 충주시 소태면 구룡리 산 36-3충청북도 충주시 소태면 구룡로 1298043-850-61412023-04-30
5신진개발㈜산림골재충청북도 충주시 대소원면 산독정길 143-1충청북도 충주시 대소원면 산정독정길 143-1043-850-61412023-04-30
6신진개발㈜육상골재충청북도 충주시 대소원면 산독정길 143-1충청북도 충주시 대소원면 산정독정길 143-1043-850-61412023-04-30
7신진개발㈜선별파쇄충청북도 충주시 대소원면 산독정길 143-1충청북도 충주시 대소원면 산정독정길 143-1043-850-61412023-04-30
8해광산업㈜선별파쇄충청북도 충주시 노은면 하너미로 124충청북도 충주시 노은면 하너미로 124043-850-61412023-04-30
9㈜충주산업육상골재충청북도 충주시 대소원면 중원대로 4052충청북도 충주시 대소원면 중원대로 4052043-850-61412023-04-30
업체명업종선택주소도로명주소관리부서 전화번호등록기준일
30㈜일심산업육상골재충청북도 충주시 동량면 수회길 110충청북도 충주시 동량면 수회길 110043-850-61412023-04-30
31태산그린산업㈜육상골재충청북도 충주시 대소원면 월은1길 37충청북도 충주시 대소원면 월은1길 37043-850-61412023-04-30
32㈜영성육상골재충청북도 충주시 형설로 32, 103동 103호(호암동, 세영더조은아파트)충청북도 충주시 형설로 32, 103동 103호(호암동, 세영더조은아파트)043-850-61412023-04-30
33㈜현성산업육상골재충청북도 충주시 상방3길 55(봉방동)충청북도 충주시 상방3길 55(봉방동)043-850-61412023-04-30
34이음건설㈜육상골재충청북도 충주시 감노로 1574충청북도 충주시 감노로 1574043-850-61412023-04-30
35이음건설㈜선별파쇄충청북도 충주시 감노로 1574충청북도 충주시 감노로 1574043-850-61412023-04-30
36아이리스㈜선별세척충청북도 충주시 산척면 영덕리 73-1충청북도 충주시 산척면 인등로 177043-850-61412023-04-30
37상앤엠㈜선별파쇄충청북도 충주시 대소원면 외동길 217충청북도 충주시 대소원면 외동길 217043-850-61412023-04-30
38㈜삼일산업산림골재충청북도 충주시 대소원면 쇠실로 595-46충청북도 충주시 대소원면 쇠실로 595-46043-850-61412023-04-30
39㈜삼일산업선별파쇄충청북도 충주시 대소원면 쇠실로 595-46충청북도 충주시 대소원면 쇠실로 595-46<NA><NA>