Overview

Dataset statistics

Number of variables5
Number of observations285
Missing cells6
Missing cells (%)0.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.3 KiB
Average record size in memory40.5 B

Variable types

Categorical1
Text3
DateTime1

Dataset

Description충청북도 내 층략업체에 대한 데이터로 업종, 업체명, 주소, 전화번호, 데이터 기준일자에 대한 데이터를 제공합니다.
URLhttps://www.data.go.kr/data/3075673/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
전화번호 has 6 (2.1%) missing valuesMissing

Reproduction

Analysis started2023-12-12 19:40:07.849983
Analysis finished2023-12-12 19:40:08.401947
Duration0.55 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

업종
Categorical

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
일반측량
214 
공공측량
54 
지적측량
 
17

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row지적측량
2nd row지적측량
3rd row지적측량
4th row지적측량
5th row지적측량

Common Values

ValueCountFrequency (%)
일반측량 214
75.1%
공공측량 54
 
18.9%
지적측량 17
 
6.0%

Length

2023-12-13T04:40:08.478778image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:40:08.609712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
일반측량 214
75.1%
공공측량 54
 
18.9%
지적측량 17
 
6.0%
Distinct272
Distinct (%)95.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-13T04:40:08.848124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length15
Mean length8.6666667
Min length2

Characters and Unicode

Total characters2470
Distinct characters185
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique259 ?
Unique (%)90.9%

Sample

1st row(주)충청
2nd row주식회사 청강
3rd row(주)대한지적기술단
4th row대성공간정보(주)
5th row태화기술단 주식회사
ValueCountFrequency (%)
주식회사 72
 
20.0%
주)삼일이앤씨 2
 
0.6%
두산랜드매니지먼트 2
 
0.6%
중앙측량설계공사 2
 
0.6%
대한측량설계공사 2
 
0.6%
현대측량설계공사 2
 
0.6%
두리측량설계 2
 
0.6%
주)부광기술공사 2
 
0.6%
주)대명이엔지 2
 
0.6%
주)진우엔지니어링 2
 
0.6%
Other values (266) 270
75.0%
2023-12-13T04:40:09.362854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
184
 
7.4%
147
 
6.0%
138
 
5.6%
138
 
5.6%
119
 
4.8%
111
 
4.5%
79
 
3.2%
78
 
3.2%
73
 
3.0%
73
 
3.0%
Other values (175) 1330
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2234
90.4%
Space Separator 79
 
3.2%
Open Punctuation 73
 
3.0%
Close Punctuation 73
 
3.0%
Uppercase Letter 8
 
0.3%
Other Symbol 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
184
 
8.2%
147
 
6.6%
138
 
6.2%
138
 
6.2%
119
 
5.3%
111
 
5.0%
78
 
3.5%
73
 
3.3%
73
 
3.3%
68
 
3.0%
Other values (165) 1105
49.5%
Uppercase Letter
ValueCountFrequency (%)
N 2
25.0%
G 2
25.0%
H 1
12.5%
J 1
12.5%
E 1
12.5%
I 1
12.5%
Space Separator
ValueCountFrequency (%)
79
100.0%
Open Punctuation
ValueCountFrequency (%)
( 73
100.0%
Close Punctuation
ValueCountFrequency (%)
) 73
100.0%
Other Symbol
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2237
90.6%
Common 225
 
9.1%
Latin 8
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
184
 
8.2%
147
 
6.6%
138
 
6.2%
138
 
6.2%
119
 
5.3%
111
 
5.0%
78
 
3.5%
73
 
3.3%
73
 
3.3%
68
 
3.0%
Other values (166) 1108
49.5%
Latin
ValueCountFrequency (%)
N 2
25.0%
G 2
25.0%
H 1
12.5%
J 1
12.5%
E 1
12.5%
I 1
12.5%
Common
ValueCountFrequency (%)
79
35.1%
( 73
32.4%
) 73
32.4%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2234
90.4%
ASCII 233
 
9.4%
None 3
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
184
 
8.2%
147
 
6.6%
138
 
6.2%
138
 
6.2%
119
 
5.3%
111
 
5.0%
78
 
3.5%
73
 
3.3%
73
 
3.3%
68
 
3.0%
Other values (165) 1105
49.5%
ASCII
ValueCountFrequency (%)
79
33.9%
( 73
31.3%
) 73
31.3%
N 2
 
0.9%
G 2
 
0.9%
H 1
 
0.4%
J 1
 
0.4%
E 1
 
0.4%
I 1
 
0.4%
None
ValueCountFrequency (%)
3
100.0%

주소
Text

Distinct277
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
2023-12-13T04:40:09.725104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length53
Median length41
Mean length28.308772
Min length17

Characters and Unicode

Total characters8068
Distinct characters210
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique269 ?
Unique (%)94.4%

Sample

1st row충청북도 청주시 상당구 대성로106번길 18(문화동)
2nd row충청북도 청주시 상당구 대성로106번길 18, 2층(문화동)
3rd row충청북도 청주시 상당구 중흥로187번길 4, 5층(용암동)
4th row충청북도 청주시 서원구 남이면 청남로 1231-6
5th row충청북도 청주시 서원구 신성화로 49, 505호(성화동)
ValueCountFrequency (%)
충청북도 285
 
16.9%
청주시 94
 
5.6%
충주시 45
 
2.7%
2층 42
 
2.5%
음성군 42
 
2.5%
40
 
2.4%
음성읍 32
 
1.9%
흥덕구 30
 
1.8%
청원구 28
 
1.7%
제천시 27
 
1.6%
Other values (501) 1025
60.7%
2023-12-13T04:40:10.308910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1790
22.2%
430
 
5.3%
332
 
4.1%
1 299
 
3.7%
290
 
3.6%
285
 
3.5%
235
 
2.9%
2 234
 
2.9%
188
 
2.3%
175
 
2.2%
Other values (200) 3810
47.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4507
55.9%
Space Separator 1790
 
22.2%
Decimal Number 1233
 
15.3%
Other Punctuation 164
 
2.0%
Close Punctuation 155
 
1.9%
Open Punctuation 155
 
1.9%
Dash Punctuation 56
 
0.7%
Uppercase Letter 8
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
430
 
9.5%
332
 
7.4%
290
 
6.4%
285
 
6.3%
235
 
5.2%
188
 
4.2%
175
 
3.9%
148
 
3.3%
124
 
2.8%
123
 
2.7%
Other values (181) 2177
48.3%
Decimal Number
ValueCountFrequency (%)
1 299
24.2%
2 234
19.0%
3 136
11.0%
0 129
10.5%
4 104
 
8.4%
5 79
 
6.4%
8 72
 
5.8%
6 70
 
5.7%
7 58
 
4.7%
9 52
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
K 3
37.5%
T 3
37.5%
F 1
 
12.5%
S 1
 
12.5%
Space Separator
ValueCountFrequency (%)
1790
100.0%
Other Punctuation
ValueCountFrequency (%)
, 164
100.0%
Close Punctuation
ValueCountFrequency (%)
) 155
100.0%
Open Punctuation
ValueCountFrequency (%)
( 155
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 56
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4507
55.9%
Common 3553
44.0%
Latin 8
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
430
 
9.5%
332
 
7.4%
290
 
6.4%
285
 
6.3%
235
 
5.2%
188
 
4.2%
175
 
3.9%
148
 
3.3%
124
 
2.8%
123
 
2.7%
Other values (181) 2177
48.3%
Common
ValueCountFrequency (%)
1790
50.4%
1 299
 
8.4%
2 234
 
6.6%
, 164
 
4.6%
) 155
 
4.4%
( 155
 
4.4%
3 136
 
3.8%
0 129
 
3.6%
4 104
 
2.9%
5 79
 
2.2%
Other values (5) 308
 
8.7%
Latin
ValueCountFrequency (%)
K 3
37.5%
T 3
37.5%
F 1
 
12.5%
S 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4507
55.9%
ASCII 3561
44.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1790
50.3%
1 299
 
8.4%
2 234
 
6.6%
, 164
 
4.6%
) 155
 
4.4%
( 155
 
4.4%
3 136
 
3.8%
0 129
 
3.6%
4 104
 
2.9%
5 79
 
2.2%
Other values (9) 316
 
8.9%
Hangul
ValueCountFrequency (%)
430
 
9.5%
332
 
7.4%
290
 
6.4%
285
 
6.3%
235
 
5.2%
188
 
4.2%
175
 
3.9%
148
 
3.3%
124
 
2.8%
123
 
2.7%
Other values (181) 2177
48.3%

전화번호
Text

MISSING 

Distinct271
Distinct (%)97.1%
Missing6
Missing (%)2.1%
Memory size2.4 KiB
2023-12-13T04:40:10.614274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length13
Median length12
Mean length12.032258
Min length12

Characters and Unicode

Total characters3357
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique263 ?
Unique (%)94.3%

Sample

1st row043-253-8800
2nd row043-260-2020
3rd row043-216-4224
4th row043-715-8899
5th row043-873-0811
ValueCountFrequency (%)
070-4216-0976 2
 
0.7%
043-210-5345 2
 
0.7%
043-652-0038 2
 
0.7%
043-235-6435 2
 
0.7%
043-423-0003 2
 
0.7%
043-216-4224 2
 
0.7%
043-236-7801 2
 
0.7%
043-872-4353 2
 
0.7%
043-652-5500 1
 
0.4%
043-920-8881 1
 
0.4%
Other values (261) 261
93.5%
2023-12-13T04:40:11.137554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 558
16.6%
0 508
15.1%
3 503
15.0%
4 457
13.6%
2 285
8.5%
8 246
7.3%
5 207
 
6.2%
7 188
 
5.6%
1 173
 
5.2%
6 142
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2799
83.4%
Dash Punctuation 558
 
16.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 508
18.1%
3 503
18.0%
4 457
16.3%
2 285
10.2%
8 246
8.8%
5 207
7.4%
7 188
 
6.7%
1 173
 
6.2%
6 142
 
5.1%
9 90
 
3.2%
Dash Punctuation
ValueCountFrequency (%)
- 558
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3357
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 558
16.6%
0 508
15.1%
3 503
15.0%
4 457
13.6%
2 285
8.5%
8 246
7.3%
5 207
 
6.2%
7 188
 
5.6%
1 173
 
5.2%
6 142
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3357
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 558
16.6%
0 508
15.1%
3 503
15.0%
4 457
13.6%
2 285
8.5%
8 246
7.3%
5 207
 
6.2%
7 188
 
5.6%
1 173
 
5.2%
6 142
 
4.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
Minimum2023-07-01 00:00:00
Maximum2023-07-01 00:00:00
2023-12-13T04:40:11.292681image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:40:11.414370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Missing values

2023-12-13T04:40:08.228789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:40:08.355199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

업종업체명주소전화번호데이터기준일자
0지적측량(주)충청충청북도 청주시 상당구 대성로106번길 18(문화동)043-253-88002023-07-01
1지적측량주식회사 청강충청북도 청주시 상당구 대성로106번길 18, 2층(문화동)043-260-20202023-07-01
2지적측량(주)대한지적기술단충청북도 청주시 상당구 중흥로187번길 4, 5층(용암동)043-216-42242023-07-01
3지적측량대성공간정보(주)충청북도 청주시 서원구 남이면 청남로 1231-6043-715-88992023-07-01
4지적측량태화기술단 주식회사충청북도 청주시 서원구 신성화로 49, 505호(성화동)043-873-08112023-07-01
5지적측량한맥엔지니어링㈜충청북도 청주시 서원구 신성화로46번길 6-8, 301호(성화동)043-257-65002023-07-01
6지적측량건화산업개발 주식회사충청북도 청주시 청원구 율량로 42, 4층(주중동)070-4044-18992023-07-01
7지적측량청우토지정보 주식회사충청북도 청주시 청원구 중앙로 134, 3층(우암동)043-901-98892023-07-01
8지적측량새한항업(주)충청북도 청주시 흥덕구 직지대로 530, 1동 614호(송정동, 청주테크노에스타워)043-222-67922023-07-01
9지적측량(주)현대지적측량충청북도 청주시 흥덕구 1순환로 388, 2층 (신봉동)043-268-00042023-07-01
업종업체명주소전화번호데이터기준일자
275일반측량라오측량설계공사충청북도 단양군 단양읍 별곡1로 28 , 2층043-421-15722023-07-01
276일반측량주식회사 덕림충청북도 단양군 단양읍 별곡1로 29 , 2층043-421-33632023-07-01
277일반측량가장측량공사충청북도 단양군 단양읍 별곡4길 14, 1층043-422-27002023-07-01
278일반측량세명측량설계사무소충청북도 단양군 단양읍 삼봉로 325, 2층043-423-90812023-07-01
279일반측량주식회사 유건기술단충청북도 단양군 단양읍 삼봉로 325-1, 2층043-421-55802023-07-01
280일반측량(주)대명이엔지충청북도 단양군 단양읍 중앙1로 23043-423-00032023-07-01
281일반측량(주)한길충청북도 단양군 단양읍 중앙1로 23, KT단양빌딩043-421-55082023-07-01
282일반측량단양측량설계공사충청북도 단양군 단양읍 중앙1로 42 , 2층043-421-60012023-07-01
283일반측량대한측량설계공사충청북도 단양군 단양읍 중앙2로 1043-423-42242023-07-01
284일반측량대교토목측량설계공사충청북도 단양군 단양읍 중앙2로 2 , 2층(보림빌딩)043-423-56652023-07-01