Overview

Dataset statistics

Number of variables6
Number of observations110
Missing cells9
Missing cells (%)1.4%
Duplicate rows1
Duplicate rows (%)0.9%
Total size in memory5.4 KiB
Average record size in memory50.2 B

Variable types

Text4
Categorical1
DateTime1

Dataset

Description1. 충청남도 홍성군에서 제공하는 대기배출시설 사업장 현황으로, 업체명, 도로명주소, 전화번호, 업종, 종별, 데이터기준일자로 구성되어 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=339&beforeMenuCd=DOM_000000201001001000&publicdatapk=15083590

Alerts

데이터기준일자 has constant value ""Constant
Dataset has 1 (0.9%) duplicate rowsDuplicates
전화번호 has 9 (8.2%) missing valuesMissing

Reproduction

Analysis started2024-01-09 20:31:20.182321
Analysis finished2024-01-09 20:31:20.815411
Duration0.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct108
Distinct (%)98.2%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2024-01-10T05:31:20.953133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length14
Mean length8.2363636
Min length4

Characters and Unicode

Total characters906
Distinct characters183
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)96.4%

Sample

1st row한국지엠홍성서비스센터(합)
2nd row한국자동차공업㈜
3rd row금마농협미곡종합처리장
4th row조운정미영농조합법인
5th row갈산토기
ValueCountFrequency (%)
주식회사 6
 
4.3%
원강금속㈜ 2
 
1.4%
공업사 2
 
1.4%
홍동농협 2
 
1.4%
농업회사법인 2
 
1.4%
㈜거흥산업 2
 
1.4%
건강한사람들㈜ 2
 
1.4%
홍성공장 2
 
1.4%
한국가스공사 2
 
1.4%
홍성 2
 
1.4%
Other values (115) 117
83.0%
2024-01-10T05:31:21.279823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
64
 
7.1%
38
 
4.2%
33
 
3.6%
31
 
3.4%
21
 
2.3%
21
 
2.3%
21
 
2.3%
20
 
2.2%
20
 
2.2%
16
 
1.8%
Other values (173) 621
68.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 798
88.1%
Other Symbol 64
 
7.1%
Space Separator 31
 
3.4%
Close Punctuation 4
 
0.4%
Open Punctuation 4
 
0.4%
Decimal Number 4
 
0.4%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
4.8%
33
 
4.1%
21
 
2.6%
21
 
2.6%
21
 
2.6%
20
 
2.5%
20
 
2.5%
16
 
2.0%
16
 
2.0%
15
 
1.9%
Other values (166) 577
72.3%
Decimal Number
ValueCountFrequency (%)
2 2
50.0%
1 2
50.0%
Other Symbol
ValueCountFrequency (%)
64
100.0%
Space Separator
ValueCountFrequency (%)
31
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 862
95.1%
Common 44
 
4.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
64
 
7.4%
38
 
4.4%
33
 
3.8%
21
 
2.4%
21
 
2.4%
21
 
2.4%
20
 
2.3%
20
 
2.3%
16
 
1.9%
16
 
1.9%
Other values (167) 592
68.7%
Common
ValueCountFrequency (%)
31
70.5%
) 4
 
9.1%
( 4
 
9.1%
2 2
 
4.5%
1 2
 
4.5%
, 1
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 798
88.1%
None 64
 
7.1%
ASCII 44
 
4.9%

Most frequent character per block

None
ValueCountFrequency (%)
64
100.0%
Hangul
ValueCountFrequency (%)
38
 
4.8%
33
 
4.1%
21
 
2.6%
21
 
2.6%
21
 
2.6%
20
 
2.5%
20
 
2.5%
16
 
2.0%
16
 
2.0%
15
 
1.9%
Other values (166) 577
72.3%
ASCII
ValueCountFrequency (%)
31
70.5%
) 4
 
9.1%
( 4
 
9.1%
2 2
 
4.5%
1 2
 
4.5%
, 1
 
2.3%
Distinct105
Distinct (%)95.5%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2024-01-10T05:31:21.568653image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length28
Mean length23.409091
Min length18

Characters and Unicode

Total characters2575
Distinct characters67
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique100 ?
Unique (%)90.9%

Sample

1st row충청남도 홍성군 홍성읍 충서로1216번길 6-34
2nd row충청남도 홍성군 홍성읍 충서로1216번길 6-6
3rd row충청남도 홍성군 금마면 광금북로 425
4th row충청남도 홍성군 결성면 홍남서로 860
5th row충청남도 홍성군 갈산면 갈산서길475번길 128
ValueCountFrequency (%)
충청남도 110
20.0%
홍성군 110
20.0%
갈산면 23
 
4.2%
결성면 14
 
2.5%
금마면 14
 
2.5%
홍성읍 13
 
2.4%
충서로 10
 
1.8%
광천읍 10
 
1.8%
내포로 9
 
1.6%
은하면 9
 
1.6%
Other values (164) 228
41.5%
2024-01-10T05:31:21.986709image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
449
17.4%
153
 
5.9%
140
 
5.4%
126
 
4.9%
125
 
4.9%
1 120
 
4.7%
110
 
4.3%
110
 
4.3%
110
 
4.3%
97
 
3.8%
Other values (57) 1035
40.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1555
60.4%
Decimal Number 530
 
20.6%
Space Separator 449
 
17.4%
Dash Punctuation 41
 
1.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
153
 
9.8%
140
 
9.0%
126
 
8.1%
125
 
8.0%
110
 
7.1%
110
 
7.1%
110
 
7.1%
97
 
6.2%
80
 
5.1%
60
 
3.9%
Other values (45) 444
28.6%
Decimal Number
ValueCountFrequency (%)
1 120
22.6%
2 59
11.1%
3 56
10.6%
4 54
10.2%
6 49
9.2%
5 43
 
8.1%
7 41
 
7.7%
8 40
 
7.5%
0 39
 
7.4%
9 29
 
5.5%
Space Separator
ValueCountFrequency (%)
449
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1555
60.4%
Common 1020
39.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
153
 
9.8%
140
 
9.0%
126
 
8.1%
125
 
8.0%
110
 
7.1%
110
 
7.1%
110
 
7.1%
97
 
6.2%
80
 
5.1%
60
 
3.9%
Other values (45) 444
28.6%
Common
ValueCountFrequency (%)
449
44.0%
1 120
 
11.8%
2 59
 
5.8%
3 56
 
5.5%
4 54
 
5.3%
6 49
 
4.8%
5 43
 
4.2%
7 41
 
4.0%
- 41
 
4.0%
8 40
 
3.9%
Other values (2) 68
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1555
60.4%
ASCII 1020
39.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
449
44.0%
1 120
 
11.8%
2 59
 
5.8%
3 56
 
5.5%
4 54
 
5.3%
6 49
 
4.8%
5 43
 
4.2%
7 41
 
4.0%
- 41
 
4.0%
8 40
 
3.9%
Other values (2) 68
 
6.7%
Hangul
ValueCountFrequency (%)
153
 
9.8%
140
 
9.0%
126
 
8.1%
125
 
8.0%
110
 
7.1%
110
 
7.1%
110
 
7.1%
97
 
6.2%
80
 
5.1%
60
 
3.9%
Other values (45) 444
28.6%

전화번호
Text

MISSING 

Distinct95
Distinct (%)94.1%
Missing9
Missing (%)8.2%
Memory size1012.0 B
2024-01-10T05:31:22.221192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length26
Median length12
Mean length12.237624
Min length12

Characters and Unicode

Total characters1236
Distinct characters14
Distinct categories5 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique89 ?
Unique (%)88.1%

Sample

1st row041-634-8255
2nd row041-634-8077
3rd row041-634-9347
4th row041-642-5767
5th row041-633-1711
ValueCountFrequency (%)
041-642-0878 2
 
2.0%
041-632-3820 2
 
2.0%
041-630-8000 2
 
2.0%
041-642-0101 2
 
2.0%
041-642-7611 2
 
2.0%
041-633-1977 2
 
2.0%
041-640-0001 1
 
1.0%
041-642-9110 1
 
1.0%
041-631-0355 1
 
1.0%
041-631-5031 1
 
1.0%
Other values (86) 86
84.3%
2024-01-10T05:31:22.574256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 204
16.5%
0 187
15.1%
4 173
14.0%
1 163
13.2%
6 139
11.2%
3 137
11.1%
2 68
 
5.5%
7 44
 
3.6%
9 41
 
3.3%
5 38
 
3.1%
Other values (4) 42
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1025
82.9%
Dash Punctuation 204
 
16.5%
Math Symbol 5
 
0.4%
Other Punctuation 1
 
0.1%
Space Separator 1
 
0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 187
18.2%
4 173
16.9%
1 163
15.9%
6 139
13.6%
3 137
13.4%
2 68
 
6.6%
7 44
 
4.3%
9 41
 
4.0%
5 38
 
3.7%
8 35
 
3.4%
Dash Punctuation
ValueCountFrequency (%)
- 204
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Other Punctuation
ValueCountFrequency (%)
, 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 1236
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 204
16.5%
0 187
15.1%
4 173
14.0%
1 163
13.2%
6 139
11.2%
3 137
11.1%
2 68
 
5.5%
7 44
 
3.6%
9 41
 
3.3%
5 38
 
3.1%
Other values (4) 42
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1236
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 204
16.5%
0 187
15.1%
4 173
14.0%
1 163
13.2%
6 139
11.2%
3 137
11.1%
2 68
 
5.5%
7 44
 
3.6%
9 41
 
3.3%
5 38
 
3.1%
Other values (4) 42
 
3.4%

업종
Text

Distinct64
Distinct (%)58.2%
Missing0
Missing (%)0.0%
Memory size1012.0 B
2024-01-10T05:31:22.758994image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length64
Median length27
Mean length11.045455
Min length2

Characters and Unicode

Total characters1215
Distinct characters172
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique46 ?
Unique (%)41.8%

Sample

1st row자동차종합수리업
2nd row자동차종합수리업
3rd row곡물도정업
4th row곡물도정업
5th row가정용및장식용도자기제조업
ValueCountFrequency (%)
자동차종합수리업 15
 
13.6%
곡물도정업 11
 
10.0%
레미콘제조업 6
 
5.5%
그외기타자동차부품제조업 3
 
2.7%
그외기타분류안된비금속광물제품제조업 3
 
2.7%
그외기타금속가공업 2
 
1.8%
기타비료및질소화합물제조업 2
 
1.8%
건설폐기물처리업 2
 
1.8%
축산분뇨처리업 2
 
1.8%
연료용가스제조및배관공급업 2
 
1.8%
Other values (54) 62
56.4%
2024-01-10T05:31:23.113783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
115
 
9.5%
89
 
7.3%
85
 
7.0%
37
 
3.0%
34
 
2.8%
34
 
2.8%
29
 
2.4%
28
 
2.3%
25
 
2.1%
25
 
2.1%
Other values (162) 714
58.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1179
97.0%
Decimal Number 16
 
1.3%
Math Symbol 13
 
1.1%
Close Punctuation 3
 
0.2%
Open Punctuation 3
 
0.2%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
115
 
9.8%
89
 
7.5%
85
 
7.2%
37
 
3.1%
34
 
2.9%
34
 
2.9%
29
 
2.5%
28
 
2.4%
25
 
2.1%
25
 
2.1%
Other values (152) 678
57.5%
Decimal Number
ValueCountFrequency (%)
9 5
31.2%
3 3
18.8%
1 3
18.8%
2 2
 
12.5%
0 2
 
12.5%
4 1
 
6.2%
Math Symbol
ValueCountFrequency (%)
+ 13
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1179
97.0%
Common 36
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
115
 
9.8%
89
 
7.5%
85
 
7.2%
37
 
3.1%
34
 
2.9%
34
 
2.9%
29
 
2.5%
28
 
2.4%
25
 
2.1%
25
 
2.1%
Other values (152) 678
57.5%
Common
ValueCountFrequency (%)
+ 13
36.1%
9 5
 
13.9%
3 3
 
8.3%
) 3
 
8.3%
( 3
 
8.3%
1 3
 
8.3%
2 2
 
5.6%
0 2
 
5.6%
/ 1
 
2.8%
4 1
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1179
97.0%
ASCII 36
 
3.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
115
 
9.8%
89
 
7.5%
85
 
7.2%
37
 
3.1%
34
 
2.9%
34
 
2.9%
29
 
2.5%
28
 
2.4%
25
 
2.1%
25
 
2.1%
Other values (152) 678
57.5%
ASCII
ValueCountFrequency (%)
+ 13
36.1%
9 5
 
13.9%
3 3
 
8.3%
) 3
 
8.3%
( 3
 
8.3%
1 3
 
8.3%
2 2
 
5.6%
0 2
 
5.6%
/ 1
 
2.8%
4 1
 
2.8%

종별
Categorical

Distinct3
Distinct (%)2.7%
Missing0
Missing (%)0.0%
Memory size1012.0 B
5
52 
4
51 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row4
2nd row4
3rd row5
4th row5
5th row5

Common Values

ValueCountFrequency (%)
5 52
47.3%
4 51
46.4%
3 7
 
6.4%

Length

2024-01-10T05:31:23.237306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:31:23.327001image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
5 52
47.3%
4 51
46.4%
3 7
 
6.4%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size1012.0 B
Minimum2023-07-27 00:00:00
Maximum2023-07-27 00:00:00
2024-01-10T05:31:23.405338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:31:23.529739image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Correlations

2024-01-10T05:31:23.615843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전화번호업종종별
전화번호1.0000.9970.552
업종0.9971.0000.861
종별0.5520.8611.000

Missing values

2024-01-10T05:31:20.697975image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-01-10T05:31:20.780423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업체명도로명주소전화번호업종종별데이터기준일자
0한국지엠홍성서비스센터(합)충청남도 홍성군 홍성읍 충서로1216번길 6-34041-634-8255자동차종합수리업42023-07-27
1한국자동차공업㈜충청남도 홍성군 홍성읍 충서로1216번길 6-6041-634-8077자동차종합수리업42023-07-27
2금마농협미곡종합처리장충청남도 홍성군 금마면 광금북로 425041-634-9347곡물도정업52023-07-27
3조운정미영농조합법인충청남도 홍성군 결성면 홍남서로 860041-642-5767곡물도정업52023-07-27
4갈산토기충청남도 홍성군 갈산면 갈산서길475번길 128041-633-1711가정용및장식용도자기제조업52023-07-27
5갈산1급 자동차 공업사충청남도 홍성군 갈산면 와룡로 440041-634-0111자동차종합수리업52023-07-27
6성촌토기충청남도 홍성군 갈산면 갈산서길475번길 146041-634-9999가정용및장식용도자기제조업52023-07-27
7세일전장 주식회사충청남도 홍성군 은하면 구성남로 386041-630-2700그외기타자동차부품제조업52023-07-27
8천수만미곡종합처리장충청남도 홍성군 서부면 서부로 487041-633-4954곡물도정업42023-07-27
9㈜대한철강충청남도 홍성군 구항면 충서로966번길 41-15041-633-6000플라스틱필름+시트및판제조업+구조용금속판및금속공작물제조업52023-07-27
업체명도로명주소전화번호업종종별데이터기준일자
100㈜삼일엘리베이터충청남도 홍성군 홍북읍 첨단산단3길 32031-366-3850승강기제조업52023-07-27
101대전지방법원, 대전가정법원 홍성지원충청남도 홍성군 홍성읍 법원로 38041-640-3284법원52023-07-27
102농업회사법인 주식회사 대산충청남도 홍성군 금마면 금마로 107-33041-634-9637농산물창고업/채소류+서류및향신작물류도매업42023-07-27
103롯데쇼핑㈜ 롯데마트 홍성점충청남도 홍성군 홍성읍 조양로247번길 9041-339-2580대형마트52023-07-27
104농산개발㈜충청남도 홍성군 갈산면 수덕사로317번길 137-42041-633-1977건설용석재채굴및쇄석생산업42023-07-27
105농업회사법인내포농산㈜충청남도 홍성군 금마면 금마로 411041-633-7167곡물도정업42023-07-27
106한국가스공사 중부안전건설단충청남도 홍성군 홍북읍 첨단산단5길 105041-634-9625연료용가스제조및배관공급업52023-07-27
107㈜벽산 홍성공장충청남도 홍성군 갈산면 산단로388번길 100041-632-5060플라스틱발포성형제품제조업42023-07-27
108농업회사법인(주)성우충청남도 홍성군 결성면 홍남서로886번길 230041-523-4900축산분뇨처리업42023-07-27
109더블유아이케이중부 주식회사충청남도 홍성군 은하면 천광로 856-29041-642-0101건설폐기물처리업52023-07-27

Duplicate rows

Most frequently occurring

업체명도로명주소전화번호업종종별데이터기준일자# duplicates
0더블유아이케이중부 주식회사충청남도 홍성군 은하면 천광로 856-29041-642-0101건설폐기물처리업52023-07-272