Overview

Dataset statistics

Number of variables4
Number of observations82
Missing cells27
Missing cells (%)8.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.8 KiB
Average record size in memory34.6 B

Variable types

Numeric1
Text3

Dataset

Description안양시 동안구 관내 옥외광고업 (관내 옥외광고업소명, 관내 옥외 광고업 도로명, 관내 옥외광고전화번호)등록업소 현황 데이터 정보입니다.
Author경기도 안양시
URLhttps://www.data.go.kr/data/15055488/fileData.do

Alerts

영업장전화번호 has 27 (32.9%) missing valuesMissing
순번 has unique valuesUnique
업소명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:58:11.739558
Analysis finished2023-12-12 09:58:12.354697
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct82
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean41.5
Minimum1
Maximum82
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size870.0 B
2023-12-12T18:58:12.462506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.05
Q121.25
median41.5
Q361.75
95-th percentile77.95
Maximum82
Range81
Interquartile range (IQR)40.5

Descriptive statistics

Standard deviation23.815261
Coefficient of variation (CV)0.57386172
Kurtosis-1.2
Mean41.5
Median Absolute Deviation (MAD)20.5
Skewness0
Sum3403
Variance567.16667
MonotonicityStrictly increasing
2023-12-12T18:58:12.644957image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
63 1
 
1.2%
61 1
 
1.2%
60 1
 
1.2%
59 1
 
1.2%
58 1
 
1.2%
57 1
 
1.2%
56 1
 
1.2%
55 1
 
1.2%
54 1
 
1.2%
Other values (72) 72
87.8%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
ValueCountFrequency (%)
82 1
1.2%
81 1
1.2%
80 1
1.2%
79 1
1.2%
78 1
1.2%
77 1
1.2%
76 1
1.2%
75 1
1.2%
74 1
1.2%
73 1
1.2%

업소명
Text

UNIQUE 

Distinct82
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size788.0 B
2023-12-12T18:58:12.952190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length11
Median length9
Mean length5.8170732
Min length2

Characters and Unicode

Total characters477
Distinct characters149
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique82 ?
Unique (%)100.0%

Sample

1st row(주)엘림더드림팩토리
2nd row이룸디자인.스카이
3rd row태디(주)
4th row광고기획 앤비컴
5th row(주)안양광역신문사
ValueCountFrequency (%)
주식회사 5
 
5.6%
주)엘림더드림팩토리 1
 
1.1%
영광기획 1
 
1.1%
광고뱅크 1
 
1.1%
주)마블사인 1
 
1.1%
제일광고기획 1
 
1.1%
수디자인 1
 
1.1%
골드기업 1
 
1.1%
신성전광판 1
 
1.1%
하늘기획 1
 
1.1%
Other values (76) 76
84.4%
2023-12-12T18:58:13.415363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
27
 
5.7%
26
 
5.5%
24
 
5.0%
( 21
 
4.4%
) 21
 
4.4%
18
 
3.8%
18
 
3.8%
17
 
3.6%
16
 
3.4%
13
 
2.7%
Other values (139) 276
57.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 422
88.5%
Open Punctuation 21
 
4.4%
Close Punctuation 21
 
4.4%
Space Separator 8
 
1.7%
Uppercase Letter 3
 
0.6%
Other Punctuation 1
 
0.2%
Decimal Number 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
27
 
6.4%
26
 
6.2%
24
 
5.7%
18
 
4.3%
18
 
4.3%
17
 
4.0%
16
 
3.8%
13
 
3.1%
13
 
3.1%
13
 
3.1%
Other values (131) 237
56.2%
Uppercase Letter
ValueCountFrequency (%)
D 1
33.3%
E 1
33.3%
L 1
33.3%
Open Punctuation
ValueCountFrequency (%)
( 21
100.0%
Close Punctuation
ValueCountFrequency (%)
) 21
100.0%
Space Separator
ValueCountFrequency (%)
8
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 422
88.5%
Common 52
 
10.9%
Latin 3
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
27
 
6.4%
26
 
6.2%
24
 
5.7%
18
 
4.3%
18
 
4.3%
17
 
4.0%
16
 
3.8%
13
 
3.1%
13
 
3.1%
13
 
3.1%
Other values (131) 237
56.2%
Common
ValueCountFrequency (%)
( 21
40.4%
) 21
40.4%
8
 
15.4%
. 1
 
1.9%
3 1
 
1.9%
Latin
ValueCountFrequency (%)
D 1
33.3%
E 1
33.3%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 422
88.5%
ASCII 55
 
11.5%

Most frequent character per block

Hangul
ValueCountFrequency (%)
27
 
6.4%
26
 
6.2%
24
 
5.7%
18
 
4.3%
18
 
4.3%
17
 
4.0%
16
 
3.8%
13
 
3.1%
13
 
3.1%
13
 
3.1%
Other values (131) 237
56.2%
ASCII
ValueCountFrequency (%)
( 21
38.2%
) 21
38.2%
8
 
14.5%
. 1
 
1.8%
D 1
 
1.8%
E 1
 
1.8%
L 1
 
1.8%
3 1
 
1.8%
Distinct81
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size788.0 B
2023-12-12T18:58:13.752222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length60
Median length46
Mean length34.97561
Min length23

Characters and Unicode

Total characters2868
Distinct characters136
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique80 ?
Unique (%)97.6%

Sample

1st row경기도 안양시 동안구 흥안대로427번길 57-2, 아이에스비즈타워 3층 309호 (평촌동)
2nd row경기도 안양시 동안구 엘에스로116번길 118, 안양2차 SK V1 center 지2층 비205호 (호계동)
3rd row경기도 안양시 동안구 시민대로 187, 안양건설타워 616호 (비산동)
4th row경기도 안양시 동안구 경수대로 601 (호계동)
5th row경기도 안양시 동안구 관악대로 399, 5층 (관양동, J&J빌딩)
ValueCountFrequency (%)
경기도 82
 
13.5%
동안구 82
 
13.5%
안양시 82
 
13.5%
호계동 33
 
5.4%
관양동 31
 
5.1%
비산동 12
 
2.0%
흥안대로 9
 
1.5%
관악대로 8
 
1.3%
평촌동 6
 
1.0%
경수대로 6
 
1.0%
Other values (189) 258
42.4%
2023-12-12T18:58:14.219979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
527
 
18.4%
184
 
6.4%
172
 
6.0%
121
 
4.2%
1 108
 
3.8%
92
 
3.2%
91
 
3.2%
83
 
2.9%
83
 
2.9%
83
 
2.9%
Other values (126) 1324
46.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1610
56.1%
Space Separator 527
 
18.4%
Decimal Number 456
 
15.9%
Close Punctuation 82
 
2.9%
Open Punctuation 82
 
2.9%
Other Punctuation 63
 
2.2%
Uppercase Letter 24
 
0.8%
Lowercase Letter 18
 
0.6%
Dash Punctuation 4
 
0.1%
Math Symbol 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
184
 
11.4%
172
 
10.7%
121
 
7.5%
92
 
5.7%
91
 
5.7%
83
 
5.2%
83
 
5.2%
83
 
5.2%
82
 
5.1%
73
 
4.5%
Other values (94) 546
33.9%
Decimal Number
ValueCountFrequency (%)
1 108
23.7%
2 70
15.4%
0 52
11.4%
3 45
9.9%
4 39
 
8.6%
5 36
 
7.9%
6 31
 
6.8%
8 26
 
5.7%
9 25
 
5.5%
7 24
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
B 5
20.8%
S 4
16.7%
K 4
16.7%
V 4
16.7%
J 2
 
8.3%
D 1
 
4.2%
F 1
 
4.2%
A 1
 
4.2%
I 1
 
4.2%
T 1
 
4.2%
Lowercase Letter
ValueCountFrequency (%)
e 6
33.3%
c 3
16.7%
n 3
16.7%
t 3
16.7%
r 3
16.7%
Other Punctuation
ValueCountFrequency (%)
, 62
98.4%
& 1
 
1.6%
Space Separator
ValueCountFrequency (%)
527
100.0%
Close Punctuation
ValueCountFrequency (%)
) 82
100.0%
Open Punctuation
ValueCountFrequency (%)
( 82
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%
Math Symbol
ValueCountFrequency (%)
~ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1610
56.1%
Common 1216
42.4%
Latin 42
 
1.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
184
 
11.4%
172
 
10.7%
121
 
7.5%
92
 
5.7%
91
 
5.7%
83
 
5.2%
83
 
5.2%
83
 
5.2%
82
 
5.1%
73
 
4.5%
Other values (94) 546
33.9%
Common
ValueCountFrequency (%)
527
43.3%
1 108
 
8.9%
) 82
 
6.7%
( 82
 
6.7%
2 70
 
5.8%
, 62
 
5.1%
0 52
 
4.3%
3 45
 
3.7%
4 39
 
3.2%
5 36
 
3.0%
Other values (7) 113
 
9.3%
Latin
ValueCountFrequency (%)
e 6
14.3%
B 5
11.9%
S 4
9.5%
K 4
9.5%
V 4
9.5%
c 3
7.1%
n 3
7.1%
t 3
7.1%
r 3
7.1%
J 2
 
4.8%
Other values (5) 5
11.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1610
56.1%
ASCII 1258
43.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
527
41.9%
1 108
 
8.6%
) 82
 
6.5%
( 82
 
6.5%
2 70
 
5.6%
, 62
 
4.9%
0 52
 
4.1%
3 45
 
3.6%
4 39
 
3.1%
5 36
 
2.9%
Other values (22) 155
 
12.3%
Hangul
ValueCountFrequency (%)
184
 
11.4%
172
 
10.7%
121
 
7.5%
92
 
5.7%
91
 
5.7%
83
 
5.2%
83
 
5.2%
83
 
5.2%
82
 
5.1%
73
 
4.5%
Other values (94) 546
33.9%

영업장전화번호
Text

MISSING 

Distinct55
Distinct (%)100.0%
Missing27
Missing (%)32.9%
Memory size788.0 B
2023-12-12T18:58:14.456905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length12
Mean length12
Min length12

Characters and Unicode

Total characters660
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique55 ?
Unique (%)100.0%

Sample

1st row031-453-2547
2nd row031-386-8010
3rd row031-474-0024
4th row031-454-2149
5th row02-2263-0560
ValueCountFrequency (%)
031-453-2547 1
 
1.8%
031-476-3501 1
 
1.8%
031-424-2345 1
 
1.8%
031-444-3600 1
 
1.8%
031-423-5674 1
 
1.8%
031-458-7772 1
 
1.8%
032-209-7400 1
 
1.8%
031-443-9403 1
 
1.8%
031-476-0077 1
 
1.8%
031-455-4294 1
 
1.8%
Other values (45) 45
81.8%
2023-12-12T18:58:14.817506image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 110
16.7%
0 97
14.7%
3 87
13.2%
4 86
13.0%
1 75
11.4%
5 48
7.3%
7 40
 
6.1%
8 34
 
5.2%
2 32
 
4.8%
6 29
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 550
83.3%
Dash Punctuation 110
 
16.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 97
17.6%
3 87
15.8%
4 86
15.6%
1 75
13.6%
5 48
8.7%
7 40
7.3%
8 34
 
6.2%
2 32
 
5.8%
6 29
 
5.3%
9 22
 
4.0%
Dash Punctuation
ValueCountFrequency (%)
- 110
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 660
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 110
16.7%
0 97
14.7%
3 87
13.2%
4 86
13.0%
1 75
11.4%
5 48
7.3%
7 40
 
6.1%
8 34
 
5.2%
2 32
 
4.8%
6 29
 
4.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 660
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 110
16.7%
0 97
14.7%
3 87
13.2%
4 86
13.0%
1 75
11.4%
5 48
7.3%
7 40
 
6.1%
8 34
 
5.2%
2 32
 
4.8%
6 29
 
4.4%

Interactions

2023-12-12T18:58:12.060367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:58:14.921162image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번업소명영업장도로명주소영업장전화번호
순번1.0001.0000.9351.000
업소명1.0001.0001.0001.000
영업장도로명주소0.9351.0001.0001.000
영업장전화번호1.0001.0001.0001.000

Missing values

2023-12-12T18:58:12.211539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:58:12.311892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번업소명영업장도로명주소영업장전화번호
01(주)엘림더드림팩토리경기도 안양시 동안구 흥안대로427번길 57-2, 아이에스비즈타워 3층 309호 (평촌동)<NA>
12이룸디자인.스카이경기도 안양시 동안구 엘에스로116번길 118, 안양2차 SK V1 center 지2층 비205호 (호계동)<NA>
23태디(주)경기도 안양시 동안구 시민대로 187, 안양건설타워 616호 (비산동)<NA>
34광고기획 앤비컴경기도 안양시 동안구 경수대로 601 (호계동)031-453-2547
45(주)안양광역신문사경기도 안양시 동안구 관악대로 399, 5층 (관양동, J&J빌딩)031-386-8010
56(주)애드텍경기도 안양시 동안구 엘에스로 122, 호계 데시앙플렉스 315, 316호 (호계동)<NA>
67(주)제일광고기획경기도 안양시 동안구 시민대로393번길 15 (관양동)031-474-0024
78소망광고기획경기도 안양시 동안구 경수대로519번길 47, 102동 303호 (호계동)031-454-2149
89주식회사 티움코리아경기도 안양시 동안구 엘에스로 142, 호계 금정역 SK V1 center 2층 211호 (호계동)<NA>
910(주) 도시환경개발경기도 안양시 동안구 관악대로 253-1, 2층 (비산동)02-2263-0560
순번업소명영업장도로명주소영업장전화번호
7273관양종합공사경기도 안양시 동안구 관악대로360번길 34, 지층 1호 (관양동)031-386-0062
7374(주)애드트로닉경기도 안양시 동안구 흥안대로 415 (평촌동)031-478-5400
7475대림광고경기도 안양시 동안구 관악대로 167 (비산동)031-382-8959
7576드림광고경기도 안양시 동안구 귀인로 93 (호계동)031-452-2225
7677오주디자인경기도 안양시 동안구 관양로 111, 6층 (관양동)031-476-5992
7778(주)용문에이앤씨경기도 안양시 동안구 흥안대로 415, 102동 103호 (평촌동)031-478-5311
7879우정종합광고경기도 안양시 동안구 흥안대로133번길 34 (호계동, 현대상가1층)031-457-6569
7980케이에스아이(주)경기도 안양시 동안구 전파로 88, 701호 (호계동)031-478-8500
8081솔디자인경기도 안양시 동안구 경수대로 685 (호계동)031-455-0804
8182제일현수막경기도 안양시 동안구 흥안대로 457 (평촌동)031-421-5533