Overview

Dataset statistics

Number of variables3
Number of observations22
Missing cells1
Missing cells (%)1.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory660.0 B
Average record size in memory30.0 B

Variable types

Text3

Dataset

Description2023년 6월 13일 현재 대전광역시 환경전문공사업 현황으로 사업장명, 주소, 전화번호 등을 안내드리오니 업무에 참고해주시기 바랍니다.
URLhttps://www.data.go.kr/data/15063362/fileData.do

Alerts

전화번호 has 1 (4.5%) missing valuesMissing
사업장명 has unique valuesUnique
주소 has unique valuesUnique

Reproduction

Analysis started2023-12-12 10:36:41.931798
Analysis finished2023-12-12 10:36:42.268108
Duration0.34 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사업장명
Text

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-12T19:36:42.426707image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length9
Mean length7.3181818
Min length4

Characters and Unicode

Total characters161
Distinct characters76
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row(주)성지환경건설
2nd row한국수자원공사
3rd row(주)한국테크
4th row(주)엠아이텍
5th row(주)하나환경
ValueCountFrequency (%)
주)성지환경건설 1
 
4.3%
열린환경기술(주 1
 
4.3%
주식회사 1
 
4.3%
㈜금영이엔지 1
 
4.3%
㈜태창이앤테크 1
 
4.3%
㈜성광이엔에프 1
 
4.3%
㈜유니에코 1
 
4.3%
㈜하이젠 1
 
4.3%
주)새암이엔지 1
 
4.3%
주)스탠더드시험연구소 1
 
4.3%
Other values (13) 13
56.5%
2023-12-12T19:36:42.855692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
12
 
7.5%
( 11
 
6.8%
) 11
 
6.8%
11
 
6.8%
7
 
4.3%
7
 
4.3%
5
 
3.1%
4
 
2.5%
4
 
2.5%
3
 
1.9%
Other values (66) 86
53.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 131
81.4%
Open Punctuation 11
 
6.8%
Close Punctuation 11
 
6.8%
Other Symbol 7
 
4.3%
Space Separator 1
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
 
9.2%
11
 
8.4%
7
 
5.3%
5
 
3.8%
4
 
3.1%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (62) 76
58.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Other Symbol
ValueCountFrequency (%)
7
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 138
85.7%
Common 23
 
14.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
12
 
8.7%
11
 
8.0%
7
 
5.1%
7
 
5.1%
5
 
3.6%
4
 
2.9%
4
 
2.9%
3
 
2.2%
3
 
2.2%
3
 
2.2%
Other values (63) 79
57.2%
Common
ValueCountFrequency (%)
( 11
47.8%
) 11
47.8%
1
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 131
81.4%
ASCII 23
 
14.3%
None 7
 
4.3%

Most frequent character per block

Hangul
ValueCountFrequency (%)
12
 
9.2%
11
 
8.4%
7
 
5.3%
5
 
3.8%
4
 
3.1%
4
 
3.1%
3
 
2.3%
3
 
2.3%
3
 
2.3%
3
 
2.3%
Other values (62) 76
58.0%
ASCII
ValueCountFrequency (%)
( 11
47.8%
) 11
47.8%
1
 
4.3%
None
ValueCountFrequency (%)
7
100.0%

주소
Text

UNIQUE 

Distinct22
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size308.0 B
2023-12-12T19:36:43.169338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length52
Median length29.5
Mean length27.545455
Min length14

Characters and Unicode

Total characters606
Distinct characters91
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique22 ?
Unique (%)100.0%

Sample

1st row대전 유성구 노은로 151
2nd row대전 대덕구 연축동 산6-2 / 신탄진로 220
3rd row대전 유성구 테크노8로 53-10, 3층(용산동)
4th row대전 서구 구봉산북로7번길 57, 2층 202호(관저동)
5th row대전 서구 갈마로 169(괴정동)
ValueCountFrequency (%)
대전 23
 
20.0%
유성구 11
 
9.6%
대덕구 6
 
5.2%
서구 4
 
3.5%
테크노2로 4
 
3.5%
대화로 2
 
1.7%
테크노8로 2
 
1.7%
덕암로222번길 1
 
0.9%
7(덕암동 1
 
0.9%
323-11(탑립동 1
 
0.9%
Other values (60) 60
52.2%
2023-12-12T19:36:43.646993image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
94
 
15.5%
39
 
6.4%
1 27
 
4.5%
25
 
4.1%
24
 
4.0%
2 23
 
3.8%
23
 
3.8%
22
 
3.6%
( 21
 
3.5%
) 21
 
3.5%
Other values (81) 287
47.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 318
52.5%
Decimal Number 130
21.5%
Space Separator 94
 
15.5%
Open Punctuation 21
 
3.5%
Close Punctuation 21
 
3.5%
Other Punctuation 14
 
2.3%
Dash Punctuation 7
 
1.2%
Connector Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
39
 
12.3%
25
 
7.9%
24
 
7.5%
23
 
7.2%
22
 
6.9%
13
 
4.1%
13
 
4.1%
9
 
2.8%
8
 
2.5%
7
 
2.2%
Other values (64) 135
42.5%
Decimal Number
ValueCountFrequency (%)
1 27
20.8%
2 23
17.7%
3 17
13.1%
6 12
9.2%
5 12
9.2%
0 11
8.5%
4 9
 
6.9%
8 9
 
6.9%
7 7
 
5.4%
9 3
 
2.3%
Other Punctuation
ValueCountFrequency (%)
, 13
92.9%
/ 1
 
7.1%
Space Separator
ValueCountFrequency (%)
94
100.0%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 7
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 318
52.5%
Common 288
47.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
39
 
12.3%
25
 
7.9%
24
 
7.5%
23
 
7.2%
22
 
6.9%
13
 
4.1%
13
 
4.1%
9
 
2.8%
8
 
2.5%
7
 
2.2%
Other values (64) 135
42.5%
Common
ValueCountFrequency (%)
94
32.6%
1 27
 
9.4%
2 23
 
8.0%
( 21
 
7.3%
) 21
 
7.3%
3 17
 
5.9%
, 13
 
4.5%
6 12
 
4.2%
5 12
 
4.2%
0 11
 
3.8%
Other values (7) 37
 
12.8%

Most occurring blocks

ValueCountFrequency (%)
Hangul 318
52.5%
ASCII 288
47.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
94
32.6%
1 27
 
9.4%
2 23
 
8.0%
( 21
 
7.3%
) 21
 
7.3%
3 17
 
5.9%
, 13
 
4.5%
6 12
 
4.2%
5 12
 
4.2%
0 11
 
3.8%
Other values (7) 37
 
12.8%
Hangul
ValueCountFrequency (%)
39
 
12.3%
25
 
7.9%
24
 
7.5%
23
 
7.2%
22
 
6.9%
13
 
4.1%
13
 
4.1%
9
 
2.8%
8
 
2.5%
7
 
2.2%
Other values (64) 135
42.5%

전화번호
Text

MISSING 

Distinct21
Distinct (%)100.0%
Missing1
Missing (%)4.5%
Memory size308.0 B
2023-12-12T19:36:43.893428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.095238
Min length12

Characters and Unicode

Total characters254
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique21 ?
Unique (%)100.0%

Sample

1st row042-476-4391
2nd row042-629-3331
3rd row042-634-1400
4th row042-931-3106
5th row042-526-2273
ValueCountFrequency (%)
042-476-4391 1
 
4.8%
042-632-2305 1
 
4.8%
042-824-5538 1
 
4.8%
042-630-4506 1
 
4.8%
042-824-2031 1
 
4.8%
042-380-8000 1
 
4.8%
042-670-4161 1
 
4.8%
042-867-6453 1
 
4.8%
042-525-0989 1
 
4.8%
042-933-2226 1
 
4.8%
Other values (11) 11
52.4%
2023-12-12T19:36:44.380869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 43
16.9%
- 42
16.5%
2 36
14.2%
4 34
13.4%
3 26
10.2%
6 18
7.1%
5 15
 
5.9%
7 12
 
4.7%
8 11
 
4.3%
1 9
 
3.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 212
83.5%
Dash Punctuation 42
 
16.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 43
20.3%
2 36
17.0%
4 34
16.0%
3 26
12.3%
6 18
8.5%
5 15
 
7.1%
7 12
 
5.7%
8 11
 
5.2%
1 9
 
4.2%
9 8
 
3.8%
Dash Punctuation
ValueCountFrequency (%)
- 42
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 254
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 43
16.9%
- 42
16.5%
2 36
14.2%
4 34
13.4%
3 26
10.2%
6 18
7.1%
5 15
 
5.9%
7 12
 
4.7%
8 11
 
4.3%
1 9
 
3.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 254
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 43
16.9%
- 42
16.5%
2 36
14.2%
4 34
13.4%
3 26
10.2%
6 18
7.1%
5 15
 
5.9%
7 12
 
4.7%
8 11
 
4.3%
1 9
 
3.5%

Correlations

2023-12-12T19:36:44.515827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
사업장명주소전화번호
사업장명1.0001.0001.000
주소1.0001.0001.000
전화번호1.0001.0001.000

Missing values

2023-12-12T19:36:42.142585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T19:36:42.231088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

사업장명주소전화번호
0(주)성지환경건설대전 유성구 노은로 151042-476-4391
1한국수자원공사대전 대덕구 연축동 산6-2 / 신탄진로 220042-629-3331
2(주)한국테크대전 유성구 테크노8로 53-10, 3층(용산동)042-634-1400
3(주)엠아이텍대전 서구 구봉산북로7번길 57, 2층 202호(관저동)042-931-3106
4(주)하나환경대전 서구 갈마로 169(괴정동)042-526-2273
5계룡건설산업(주)대전 서구 문정로48번길 48(탄방동)070-4470-7433
6㈜엔바이온대전 유성구 테크노2로 275(탑립동)042-863-8675
7이화엔지니어링대전 동구 한밭대로 1322,3층 (용전동)042-621-0348
8강력에이엔씨대전 서구 도산로 450, 5층 501-2호(용문동, (구)한국방송광고공사 대전 지사 사옥)042-672-2834
9(주)부강테크대전 유성구 유성대로1184번길 25(신성동)070-5050-5555
사업장명주소전화번호
12열린환경기술(주)대전 대덕구 덕암로222번길 7(덕암동)042-933-2226
13(주)정진하이테크대전 유성구 테크노2로 323-11(탑립동)042-525-0989
14(주)스탠더드시험연구소대전 유성구 가정로 168, 신관4층 (가정동)042-867-6453
15(주)새암이엔지대전 대덕구 대화로 160, 대전산업용재유통상가 17동306호(대화동)042-670-4161
16㈜하이젠대전 유성구 테크노2로 167-12(용산동)042-380-8000
17㈜유니에코대전 유성구 대학로 28, 4층 516호(봉명동, 홍인오피스텔)042-824-2031
18㈜성광이엔에프대전 유성구 테크노8로 53(용산동)042-630-4506
19㈜태창이앤테크대전 대덕구 대화로 106번길 66, 1131호(펜타플렉스)<NA>
20㈜금영이엔지대전 유성구 엑스포로 385, 본관동 1층, 3층(문지동)042-824-5538
21주식회사 칸필터대전 유성구 테크노2로 309-7(탑립동)042-349-0036