Overview

Dataset statistics

Number of variables3
Number of observations314
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory7.5 KiB
Average record size in memory24.4 B

Variable types

Text2
DateTime1

Dataset

Description제공신청에 의한 데이터로 이러닝사업자 신고확인서 전국 소재지 발급현황의 회사명,소재지, 확인서_발급일을 제공합니다.
Author산업통상자원부
URLhttps://www.data.go.kr/data/15099822/fileData.do

Reproduction

Analysis started2023-12-12 16:17:22.722922
Analysis finished2023-12-12 16:17:23.188460
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct271
Distinct (%)86.3%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-13T01:17:23.404688image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length13
Mean length8.6847134
Min length2

Characters and Unicode

Total characters2727
Distinct characters308
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique236 ?
Unique (%)75.2%

Sample

1st row주식회사 스킬서포트
2nd row㈜한국디지털페이먼츠
3rd row㈜브랜드콘텐츠
4th row주식회사 윈즈데이
5th row씨아이씨소프트 주식회사
ValueCountFrequency (%)
주식회사 172
33.7%
12
 
2.3%
이음컨텐츠 4
 
0.8%
에프앤이노에듀 4
 
0.8%
오베네프 3
 
0.6%
㈜브랜드콘텐츠 3
 
0.6%
㈜인더스트리미디어 3
 
0.6%
㈜스톰미디어 3
 
0.6%
협동조합 3
 
0.6%
한국이러닝개발원 2
 
0.4%
Other values (272) 302
59.1%
2023-12-13T01:17:23.801520image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
197
 
7.2%
184
 
6.7%
182
 
6.7%
178
 
6.5%
177
 
6.5%
99
 
3.6%
95
 
3.5%
87
 
3.2%
52
 
1.9%
41
 
1.5%
Other values (298) 1435
52.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2404
88.2%
Space Separator 197
 
7.2%
Other Symbol 99
 
3.6%
Uppercase Letter 11
 
0.4%
Lowercase Letter 5
 
0.2%
Open Punctuation 4
 
0.1%
Close Punctuation 4
 
0.1%
Other Punctuation 3
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
184
 
7.7%
182
 
7.6%
178
 
7.4%
177
 
7.4%
95
 
4.0%
87
 
3.6%
52
 
2.2%
41
 
1.7%
38
 
1.6%
37
 
1.5%
Other values (276) 1333
55.4%
Uppercase Letter
ValueCountFrequency (%)
M 2
18.2%
S 1
9.1%
C 1
9.1%
N 1
9.1%
E 1
9.1%
P 1
9.1%
T 1
9.1%
V 1
9.1%
O 1
9.1%
D 1
9.1%
Lowercase Letter
ValueCountFrequency (%)
t 1
20.0%
i 1
20.0%
a 1
20.0%
d 1
20.0%
e 1
20.0%
Other Punctuation
ValueCountFrequency (%)
. 1
33.3%
& 1
33.3%
: 1
33.3%
Space Separator
ValueCountFrequency (%)
197
100.0%
Other Symbol
ValueCountFrequency (%)
99
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2503
91.8%
Common 208
 
7.6%
Latin 16
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
184
 
7.4%
182
 
7.3%
178
 
7.1%
177
 
7.1%
99
 
4.0%
95
 
3.8%
87
 
3.5%
52
 
2.1%
41
 
1.6%
38
 
1.5%
Other values (277) 1370
54.7%
Latin
ValueCountFrequency (%)
M 2
 
12.5%
t 1
 
6.2%
S 1
 
6.2%
C 1
 
6.2%
N 1
 
6.2%
E 1
 
6.2%
P 1
 
6.2%
i 1
 
6.2%
T 1
 
6.2%
V 1
 
6.2%
Other values (5) 5
31.2%
Common
ValueCountFrequency (%)
197
94.7%
( 4
 
1.9%
) 4
 
1.9%
. 1
 
0.5%
& 1
 
0.5%
: 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2404
88.2%
ASCII 224
 
8.2%
None 99
 
3.6%

Most frequent character per block

ASCII
ValueCountFrequency (%)
197
87.9%
( 4
 
1.8%
) 4
 
1.8%
M 2
 
0.9%
t 1
 
0.4%
. 1
 
0.4%
S 1
 
0.4%
C 1
 
0.4%
N 1
 
0.4%
E 1
 
0.4%
Other values (11) 11
 
4.9%
Hangul
ValueCountFrequency (%)
184
 
7.7%
182
 
7.6%
178
 
7.4%
177
 
7.4%
95
 
4.0%
87
 
3.6%
52
 
2.2%
41
 
1.7%
38
 
1.6%
37
 
1.5%
Other values (276) 1333
55.4%
None
ValueCountFrequency (%)
99
100.0%
Distinct276
Distinct (%)87.9%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
2023-12-13T01:17:24.163952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length74
Median length53
Mean length39.025478
Min length19

Characters and Unicode

Total characters12254
Distinct characters384
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique245 ?
Unique (%)78.0%

Sample

1st row서울특별시 강남구 삼성로95길 15, 7츨(삼성동, 천해빌딩)
2nd row서울특별시 강남구 테헤란로38길 10, 8층(역삼동, IS빌딩)
3rd row서울특별시 강서구 공항대로61길 29, B동 2층 207호(등촌동, SBA국제유통센터)
4th row서울특별시 서초구 서리풀2길 30, 3층(서초동, 성도빌딩)
5th row경상남도 진주시 정촌면 뿌리신단로 90
ValueCountFrequency (%)
서울특별시 212
 
10.4%
경기도 43
 
2.1%
금천구 33
 
1.6%
마포구 28
 
1.4%
서초구 25
 
1.2%
강남구 25
 
1.2%
영등포구 22
 
1.1%
가산디지털1로 20
 
1.0%
구로구 18
 
0.9%
4층 13
 
0.6%
Other values (1019) 1594
78.4%
2023-12-13T01:17:24.710248image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1719
 
14.0%
, 574
 
4.7%
1 448
 
3.7%
399
 
3.3%
355
 
2.9%
339
 
2.8%
327
 
2.7%
( 313
 
2.6%
) 312
 
2.5%
311
 
2.5%
Other values (374) 7157
58.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 7053
57.6%
Decimal Number 2104
 
17.2%
Space Separator 1719
 
14.0%
Other Punctuation 574
 
4.7%
Open Punctuation 313
 
2.6%
Close Punctuation 312
 
2.5%
Uppercase Letter 113
 
0.9%
Dash Punctuation 41
 
0.3%
Lowercase Letter 19
 
0.2%
Math Symbol 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
399
 
5.7%
355
 
5.0%
339
 
4.8%
327
 
4.6%
311
 
4.4%
222
 
3.1%
212
 
3.0%
212
 
3.0%
209
 
3.0%
193
 
2.7%
Other values (326) 4274
60.6%
Uppercase Letter
ValueCountFrequency (%)
B 17
15.0%
C 12
10.6%
S 11
9.7%
M 11
9.7%
A 10
8.8%
D 9
8.0%
I 8
 
7.1%
T 5
 
4.4%
K 4
 
3.5%
Y 4
 
3.5%
Other values (11) 22
19.5%
Decimal Number
ValueCountFrequency (%)
1 448
21.3%
0 295
14.0%
2 293
13.9%
3 225
10.7%
4 186
8.8%
5 166
 
7.9%
6 155
 
7.4%
8 134
 
6.4%
7 112
 
5.3%
9 90
 
4.3%
Lowercase Letter
ValueCountFrequency (%)
e 5
26.3%
t 3
15.8%
n 2
 
10.5%
c 2
 
10.5%
r 2
 
10.5%
a 1
 
5.3%
u 1
 
5.3%
s 1
 
5.3%
d 1
 
5.3%
i 1
 
5.3%
Space Separator
ValueCountFrequency (%)
1719
100.0%
Other Punctuation
ValueCountFrequency (%)
, 574
100.0%
Open Punctuation
ValueCountFrequency (%)
( 313
100.0%
Close Punctuation
ValueCountFrequency (%)
) 312
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%
Math Symbol
ValueCountFrequency (%)
~ 5
100.0%
Other Symbol
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 7054
57.6%
Common 5068
41.4%
Latin 132
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
399
 
5.7%
355
 
5.0%
339
 
4.8%
327
 
4.6%
311
 
4.4%
222
 
3.1%
212
 
3.0%
212
 
3.0%
209
 
3.0%
193
 
2.7%
Other values (327) 4275
60.6%
Latin
ValueCountFrequency (%)
B 17
12.9%
C 12
 
9.1%
S 11
 
8.3%
M 11
 
8.3%
A 10
 
7.6%
D 9
 
6.8%
I 8
 
6.1%
T 5
 
3.8%
e 5
 
3.8%
K 4
 
3.0%
Other values (21) 40
30.3%
Common
ValueCountFrequency (%)
1719
33.9%
, 574
 
11.3%
1 448
 
8.8%
( 313
 
6.2%
) 312
 
6.2%
0 295
 
5.8%
2 293
 
5.8%
3 225
 
4.4%
4 186
 
3.7%
5 166
 
3.3%
Other values (6) 537
 
10.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 7053
57.6%
ASCII 5200
42.4%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1719
33.1%
, 574
 
11.0%
1 448
 
8.6%
( 313
 
6.0%
) 312
 
6.0%
0 295
 
5.7%
2 293
 
5.6%
3 225
 
4.3%
4 186
 
3.6%
5 166
 
3.2%
Other values (37) 669
 
12.9%
Hangul
ValueCountFrequency (%)
399
 
5.7%
355
 
5.0%
339
 
4.8%
327
 
4.6%
311
 
4.4%
222
 
3.1%
212
 
3.0%
212
 
3.0%
209
 
3.0%
193
 
2.7%
Other values (326) 4274
60.6%
None
ValueCountFrequency (%)
1
100.0%
Distinct157
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size2.6 KiB
Minimum2021-01-05 00:00:00
Maximum2022-05-20 00:00:00
2023-12-13T01:17:24.877378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T01:17:25.040012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Missing values

2023-12-13T01:17:23.069974image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T01:17:23.147299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

회사명소재지확인서_발급일
0주식회사 스킬서포트서울특별시 강남구 삼성로95길 15, 7츨(삼성동, 천해빌딩)2021-01-05
1㈜한국디지털페이먼츠서울특별시 강남구 테헤란로38길 10, 8층(역삼동, IS빌딩)2021-01-06
2㈜브랜드콘텐츠서울특별시 강서구 공항대로61길 29, B동 2층 207호(등촌동, SBA국제유통센터)2021-01-07
3주식회사 윈즈데이서울특별시 서초구 서리풀2길 30, 3층(서초동, 성도빌딩)2021-01-07
4씨아이씨소프트 주식회사경상남도 진주시 정촌면 뿌리신단로 902021-01-12
5㈜블루킹스아카데미부산광역시 남구 수영로 269, 603호(대연동, 기아자동차부산지역본부)2021-01-12
6주식회사 에듀마루서울특별시 노원구 공릉로 232, 208호(공릉동, 서울과학기술대학교제1창업보육센터)2021-01-15
7㈜스톰미디어서울특별시 서초구 효령로 336, 6층(서초동, 윤일빌딩)2021-01-19
8㈜한국인적자원관리원서울특별시 용산구 한강대로46길 19, 4층(한강로2가, 호진빌딩)2021-01-20
9㈜입소서울특별시 구로구 디지털로32길 30, 207호(구로동, 코오롱디지털타워빌란트 1차)2021-01-25
회사명소재지확인서_발급일
304㈜ 러닝팩토리서울특별시 서초구 사임당로 64, 401호(서초동, 교대벤처타워)2022-05-05
305㈜ 세움에듀경상남도 진주시 범골로54번길 30-9, 드림IT밸리 B동 715호,710호,709호,708호(충무공동)2022-05-10
306브이알펄스대전광역시 유성구 테크노3로 65, 6층 638호(관평동, 한신에스메카)2022-05-13
307미림미디어랩 주식회사서울특별시 마포구 매봉산로 37, 1006호(상암동, DMC 산학협력연구센터)2022-05-13
308주식회사 미소능력개발센터전라북도 전주시 완산구 중인1길 189, 2층(중인동)2022-05-17
309주식회사 다빈치커뮤니케이션서울특별시 금천구 디지털로9길 68, 1610호(가산동, 대륭포스트타워 5차)2022-05-17
310백발십필름서울특별시 강남구 언주로151길 13, 지하1층(신사동, 계천빌딩)2022-05-18
311더에이아이랩 주식회사서울특별시 강남구 선릉로108길 9, 5층(삼성동, 신원빌딩)2022-05-18
312㈜그린비미디어서울특별시 마포구 동교로 128, B동 3층(서교동, 진영빌딩)2022-05-19
313모먼트코퍼레이션서울특별시 서대문구 연세로2나길 61, 101호(창천동, 캠퍼스타운 에스큐브)2022-05-20