Overview

Dataset statistics

Number of variables6
Number of observations22
Missing cells1
Missing cells (%)0.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.2 KiB
Average record size in memory54.0 B

Variable types

Text5
Categorical1

Dataset

Description계룡시 관내 대기 배출 시설 설치현황(사업장명,사업자등록번호,도로명소재지,전화번호,대표업종,종)에 대한 공공데이터 제공신청입니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=339&beforeMenuCd=DOM_000000201001001000&publicdatapk=15083581

Alerts

전화번호 has 1 (4.5%) missing valuesMissing
사업장명 has unique valuesUnique
도로명소재지 has unique valuesUnique

Reproduction

Analysis started2024-01-09 20:54:25.076340
Analysis finished2024-01-09 20:54:25.503069
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업장명
Text

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2024-01-10T05:54:25.615146image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length23
Median length14
Mean length10
Min length4

Characters and Unicode

Total characters220
Distinct characters108
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row주안레미콘(주)
2nd row농업회사법인신도안종합식품(주)
3rd row계룡현대서비스
4th row계룡대근무지원단
5th row길산스틸(주)계룡공장
ValueCountFrequency (%)
계룡대 2
 
6.5%
주안레미콘(주 1
 
3.2%
대전충청지역본부 1
 
3.2%
공공시설사업소(계룡문화예술의 1
 
3.2%
계룡시 1
 
3.2%
신유승모터스 1
 
3.2%
계룡점 1
 
3.2%
홈플러스(주 1
 
3.2%
계룡웰빙클럽 1
 
3.2%
계룡시청 1
 
3.2%
Other values (20) 20
64.5%
2024-01-10T05:54:25.913940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
13
 
5.9%
13
 
5.9%
( 11
 
5.0%
) 11
 
5.0%
10
 
4.5%
9
 
4.1%
9
 
4.1%
7
 
3.2%
5
 
2.3%
4
 
1.8%
Other values (98) 128
58.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 189
85.9%
Open Punctuation 11
 
5.0%
Close Punctuation 11
 
5.0%
Space Separator 9
 
4.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
13
 
6.9%
13
 
6.9%
10
 
5.3%
9
 
4.8%
7
 
3.7%
5
 
2.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (95) 116
61.4%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 189
85.9%
Common 31
 
14.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
13
 
6.9%
13
 
6.9%
10
 
5.3%
9
 
4.8%
7
 
3.7%
5
 
2.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (95) 116
61.4%
Common
ValueCountFrequency (%)
( 11
35.5%
) 11
35.5%
9
29.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 189
85.9%
ASCII 31
 
14.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
13
 
6.9%
13
 
6.9%
10
 
5.3%
9
 
4.8%
7
 
3.7%
5
 
2.6%
4
 
2.1%
4
 
2.1%
4
 
2.1%
4
 
2.1%
Other values (95) 116
61.4%
ASCII
ValueCountFrequency (%)
( 11
35.5%
) 11
35.5%
9
29.0%
Distinct20
Distinct (%)90.9%
Missing0
Missing (%)0.0%
Memory size308.0 B
2024-01-10T05:54:26.079260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters264
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)86.4%

Sample

1st row308-81-26392
2nd row308-81-09092
3rd row308-02-83874
4th row308-83-04413
5th row306-81-28318
ValueCountFrequency (%)
308-83-03559 3
 
13.6%
308-81-26392 1
 
4.5%
308-81-43426 1
 
4.5%
220-81-60348 1
 
4.5%
370-18-01499 1
 
4.5%
601-87-00459 1
 
4.5%
308-03-99596 1
 
4.5%
550-87-01034 1
 
4.5%
452-86-00224 1
 
4.5%
305-82-08484 1
 
4.5%
Other values (10) 10
45.5%
2024-01-10T05:54:26.343710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 44
16.7%
- 44
16.7%
8 40
15.2%
3 36
13.6%
5 18
6.8%
2 18
6.8%
4 18
6.8%
1 16
 
6.1%
9 14
 
5.3%
6 10
 
3.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 220
83.3%
Dash Punctuation 44
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 44
20.0%
8 40
18.2%
3 36
16.4%
5 18
8.2%
2 18
8.2%
4 18
8.2%
1 16
 
7.3%
9 14
 
6.4%
6 10
 
4.5%
7 6
 
2.7%
Dash Punctuation
ValueCountFrequency (%)
- 44
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 264
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 44
16.7%
- 44
16.7%
8 40
15.2%
3 36
13.6%
5 18
6.8%
2 18
6.8%
4 18
6.8%
1 16
 
6.1%
9 14
 
5.3%
6 10
 
3.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 264
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 44
16.7%
- 44
16.7%
8 40
15.2%
3 36
13.6%
5 18
6.8%
2 18
6.8%
4 18
6.8%
1 16
 
6.1%
9 14
 
5.3%
6 10
 
3.8%

도로명소재지
Text

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2024-01-10T05:54:26.523126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length33
Median length30
Mean length23.863636
Min length19

Characters and Unicode

Total characters525
Distinct characters68
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row충청남도 계룡시 엄사면 계백로 2906-27
2nd row충청남도 계룡시 엄사면 도곡로 193
3rd row충청남도 계룡시 두마면 왕대로 99
4th row충청남도 계룡시 신도안면 부남리 사서함 78호
5th row충청남도 계룡시 두마면 제1산단로 25-46
ValueCountFrequency (%)
충청남도 22
19.1%
계룡시 22
19.1%
두마면 12
 
10.4%
엄사면 6
 
5.2%
제1산단로 6
 
5.2%
계룡대로 2
 
1.7%
입암길 2
 
1.7%
신도안면 2
 
1.7%
금암동 2
 
1.7%
42-45 1
 
0.9%
Other values (38) 38
33.0%
2024-01-10T05:54:26.812451image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
95
18.1%
29
 
5.5%
27
 
5.1%
26
 
5.0%
23
 
4.4%
23
 
4.4%
23
 
4.4%
22
 
4.2%
1 22
 
4.2%
20
 
3.8%
Other values (58) 215
41.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 329
62.7%
Space Separator 95
 
18.1%
Decimal Number 81
 
15.4%
Dash Punctuation 10
 
1.9%
Close Punctuation 5
 
1.0%
Open Punctuation 5
 
1.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
 
8.8%
27
 
8.2%
26
 
7.9%
23
 
7.0%
23
 
7.0%
23
 
7.0%
22
 
6.7%
20
 
6.1%
14
 
4.3%
12
 
3.6%
Other values (44) 110
33.4%
Decimal Number
ValueCountFrequency (%)
1 22
27.2%
2 14
17.3%
4 12
14.8%
5 6
 
7.4%
9 6
 
7.4%
6 5
 
6.2%
7 5
 
6.2%
0 5
 
6.2%
3 3
 
3.7%
8 3
 
3.7%
Space Separator
ValueCountFrequency (%)
95
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 329
62.7%
Common 196
37.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
 
8.8%
27
 
8.2%
26
 
7.9%
23
 
7.0%
23
 
7.0%
23
 
7.0%
22
 
6.7%
20
 
6.1%
14
 
4.3%
12
 
3.6%
Other values (44) 110
33.4%
Common
ValueCountFrequency (%)
95
48.5%
1 22
 
11.2%
2 14
 
7.1%
4 12
 
6.1%
- 10
 
5.1%
5 6
 
3.1%
9 6
 
3.1%
6 5
 
2.6%
7 5
 
2.6%
) 5
 
2.6%
Other values (4) 16
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 329
62.7%
ASCII 196
37.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
95
48.5%
1 22
 
11.2%
2 14
 
7.1%
4 12
 
6.1%
- 10
 
5.1%
5 6
 
3.1%
9 6
 
3.1%
6 5
 
2.6%
7 5
 
2.6%
) 5
 
2.6%
Other values (4) 16
 
8.2%
Hangul
ValueCountFrequency (%)
29
 
8.8%
27
 
8.2%
26
 
7.9%
23
 
7.0%
23
 
7.0%
23
 
7.0%
22
 
6.7%
20
 
6.1%
14
 
4.3%
12
 
3.6%
Other values (44) 110
33.4%

전화번호
Text

MISSING 

Distinct20
Distinct (%)95.2%
Missing1
Missing (%)4.5%
Memory size308.0 B
2024-01-10T05:54:26.982353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length11
Min length1

Characters and Unicode

Total characters231
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique19 ?
Unique (%)90.5%

Sample

1st row042-841-1361
2nd row042 8417474
3rd row042-841-1111
4th row042-936-5204
5th row042-551-9994
ValueCountFrequency (%)
042 1
 
5.0%
8417474 1
 
5.0%
070-7728-9072 1
 
5.0%
042-550-8000 1
 
5.0%
042-482-0399 1
 
5.0%
042-841-1580 1
 
5.0%
042-841-9220 1
 
5.0%
042-825-5587 1
 
5.0%
042-229-3518 1
 
5.0%
042-841-1361 1
 
5.0%
Other values (10) 10
50.0%
2024-01-10T05:54:27.263915image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 38
16.5%
- 36
15.6%
4 35
15.2%
2 32
13.9%
1 20
8.7%
8 17
7.4%
5 16
6.9%
9 13
 
5.6%
7 10
 
4.3%
3 6
 
2.6%
Other values (2) 8
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 192
83.1%
Dash Punctuation 36
 
15.6%
Space Separator 3
 
1.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 38
19.8%
4 35
18.2%
2 32
16.7%
1 20
10.4%
8 17
8.9%
5 16
8.3%
9 13
 
6.8%
7 10
 
5.2%
3 6
 
3.1%
6 5
 
2.6%
Dash Punctuation
ValueCountFrequency (%)
- 36
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 231
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 38
16.5%
- 36
15.6%
4 35
15.2%
2 32
13.9%
1 20
8.7%
8 17
7.4%
5 16
6.9%
9 13
 
5.6%
7 10
 
4.3%
3 6
 
2.6%
Other values (2) 8
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 231
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 38
16.5%
- 36
15.6%
4 35
15.2%
2 32
13.9%
1 20
8.7%
8 17
7.4%
5 16
6.9%
9 13
 
5.6%
7 10
 
4.3%
3 6
 
2.6%
Other values (2) 8
 
3.5%
Distinct17
Distinct (%)77.3%
Missing0
Missing (%)0.0%
Memory size308.0 B
2024-01-10T05:54:27.428668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length24
Median length13
Mean length9.8181818
Min length3

Characters and Unicode

Total characters216
Distinct characters72
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique14 ?
Unique (%)63.6%

Sample

1st row시멘트 석회 플라스터 및 그 제품 제조업
2nd row전분제품 및 당류 제조업
3rd row자동차 종합 수리업
4th row국방 행정
5th row그외 기타 금속가공업
ValueCountFrequency (%)
제조업 7
 
11.1%
종합 5
 
7.9%
5
 
7.9%
자동차 4
 
6.3%
수리업 4
 
6.3%
행정 3
 
4.8%
부동산업 2
 
3.2%
국방 2
 
3.2%
기타 2
 
3.2%
배관공급업 1
 
1.6%
Other values (28) 28
44.4%
2024-01-10T05:54:27.691083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
43
19.9%
19
 
8.8%
11
 
5.1%
9
 
4.2%
6
 
2.8%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
5
 
2.3%
Other values (62) 103
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 173
80.1%
Space Separator 43
 
19.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
19
 
11.0%
11
 
6.4%
9
 
5.2%
6
 
3.5%
5
 
2.9%
5
 
2.9%
5
 
2.9%
5
 
2.9%
5
 
2.9%
4
 
2.3%
Other values (61) 99
57.2%
Space Separator
ValueCountFrequency (%)
43
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 173
80.1%
Common 43
 
19.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
19
 
11.0%
11
 
6.4%
9
 
5.2%
6
 
3.5%
5
 
2.9%
5
 
2.9%
5
 
2.9%
5
 
2.9%
5
 
2.9%
4
 
2.3%
Other values (61) 99
57.2%
Common
ValueCountFrequency (%)
43
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 173
80.1%
ASCII 43
 
19.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
43
100.0%
Hangul
ValueCountFrequency (%)
19
 
11.0%
11
 
6.4%
9
 
5.2%
6
 
3.5%
5
 
2.9%
5
 
2.9%
5
 
2.9%
5
 
2.9%
5
 
2.9%
4
 
2.3%
Other values (61) 99
57.2%


Categorical

Distinct2
Distinct (%)9.1%
Missing0
Missing (%)0.0%
Memory size308.0 B
5종
16 
4종

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5종
2nd row5종
3rd row5종
4th row5종
5th row4종

Common Values

ValueCountFrequency (%)
5종 16
72.7%
4종 6
 
27.3%

Length

2024-01-10T05:54:27.799514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:54:27.880635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5종 16
72.7%
4종 6
 
27.3%

Correlations

2024-01-10T05:54:27.939343image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장명사업자등록번호도로명소재지전화번호대표업종
사업장명1.0001.0001.0001.0001.0001.000
사업자등록번호1.0001.0001.0000.9750.9611.000
도로명소재지1.0001.0001.0001.0001.0001.000
전화번호1.0000.9751.0001.0000.9561.000
대표업종1.0000.9611.0000.9561.0000.000
1.0001.0001.0001.0000.0001.000

Missing values

2024-01-10T05:54:25.385939image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:54:25.468306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명사업자등록번호도로명소재지전화번호대표업종
0주안레미콘(주)308-81-26392충청남도 계룡시 엄사면 계백로 2906-27042-841-1361시멘트 석회 플라스터 및 그 제품 제조업5종
1농업회사법인신도안종합식품(주)308-81-09092충청남도 계룡시 엄사면 도곡로 193042 8417474전분제품 및 당류 제조업5종
2계룡현대서비스308-02-83874충청남도 계룡시 두마면 왕대로 99042-841-1111자동차 종합 수리업5종
3계룡대근무지원단308-83-04413충청남도 계룡시 신도안면 부남리 사서함 78호042-936-5204국방 행정5종
4길산스틸(주)계룡공장306-81-28318충청남도 계룡시 두마면 제1산단로 25-46042-551-9994그외 기타 금속가공업4종
5계룡시(계룡하수처리장)308-83-03559충청남도 계룡시 두마면 대둔로 1422 (계룡하수종말처리장)042-841-4441계룡시하수처리시설5종
6오케이퓨쳐(주)124-81-16243충청남도 계룡시 두마면 제1산단로 25-57070-8630-5900토목공사 및 유사용 기계장비 제조업4종
7(주)아워홈 계룡영업소308-85-09085충청남도 계룡시 두마면 제1산단로 26-21042-719-9200두부 및 유사식품 제조업4종
8진모터스314-05-52800충청남도 계룡시 두마면 제1산단로 7-11042-542-4972자동차 종합 수리업4종
9대전우편집중국308-83-04427충청남도 계룡시 두마면 사계로 184 (대전우편집중국)042-602-1520정부기관 일반 보조 행정5종
사업장명사업자등록번호도로명소재지전화번호대표업종
12한국가스공사 대전충청지역본부305-82-08484충청남도 계룡시 엄사면 계룡대로 470042-229-3518가스 제조 및 배관공급업5종
13(주)티에스씨452-86-00224충청남도 계룡시 두마면 입암길 42-19042-825-5587계면활성제 제조업5종
14(주)메덱스550-87-01034충청남도 계룡시 두마면 제1산단로 52042-841-9220산업용 세탁업4종
15한국지엠 계룡대 서비스308-03-99596충청남도 계룡시 엄사면 연화동길 10042-841-1580자동차 종합 수리업5종
16명랑시대외식청년창업협동조합601-87-00459충청남도 계룡시 두마면 입암길 42-45042-482-0399빵류 제조업5종
17계룡시청308-83-03559충청남도 계룡시 장안로 46 계룡시청 (금암동)부동산업5종
18계룡웰빙클럽370-18-01499충청남도 계룡시 엄사면 번영11길 18-16욕탕업5종
19홈플러스(주) 계룡점220-81-60348충청남도 계룡시 계룡대로 304 (금암동)042-550-8000대형 종합 소매업5종
20신유승모터스179-03-02528충청남도 계룡시 두마면 왕대리 243<NA>자동차 종합 수리업5종
21계룡시 공공시설사업소(계룡문화예술의 전당)308-83-03559충청남도 계룡시 엄사면 유동리 295-1042-840-8816부동산업5종