Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells371
Missing cells (%)0.5%
Duplicate rows735
Duplicate rows (%)7.3%
Total size in memory625.0 KiB
Average record size in memory64.0 B

Variable types

Categorical4
Text2
DateTime1

Dataset

Description역대전국장애인체육대회 메달 상세 내역(대회구분, 구분별, 세부종목별, 경기일자, 소속, 메달구분) 데이터화 제공
Author대한장애인체육회
URLhttps://www.data.go.kr/data/15072755/fileData.do

Alerts

Dataset has 735 (7.3%) duplicate rowsDuplicates
소속명 has 371 (3.7%) missing valuesMissing

Reproduction

Analysis started2023-12-12 07:05:29.105836
Analysis finished2023-12-12 07:05:30.212275
Duration1.11 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

대회구분
Categorical

Distinct18
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
43회 전국장애인체육대회
1215 
42회 전국장애인체육대회
1202 
38회 전국장애인체육대회
1094 
37회 전국장애인체육대회
1046 
39회 전국장애인체육대회
1043 
Other values (13)
4400 

Length

Max length15
Median length13
Mean length13.277
Min length13

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row36회 전국장애인체육대회
2nd row38회 전국장애인체육대회
3rd row36회 전국장애인체육대회
4th row42회 전국장애인체육대회
5th row39회 전국장애인체육대회

Common Values

ValueCountFrequency (%)
43회 전국장애인체육대회 1215
12.2%
42회 전국장애인체육대회 1202
12.0%
38회 전국장애인체육대회 1094
10.9%
37회 전국장애인체육대회 1046
10.5%
39회 전국장애인체육대회 1043
10.4%
41회 전국장애인체육대회 1002
10.0%
36회 전국장애인체육대회 982
9.8%
17회 전국장애학생체육대회 398
 
4.0%
13회 전국장애학생체육대회 391
 
3.9%
11회 전국장애학생체육대회 345
 
3.5%
Other values (8) 1282
12.8%

Length

2023-12-12T16:05:30.308659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전국장애인체육대회 7584
37.9%
전국장애학생체육대회 2062
 
10.3%
43회 1215
 
6.1%
42회 1202
 
6.0%
38회 1094
 
5.5%
37회 1046
 
5.2%
39회 1043
 
5.2%
41회 1002
 
5.0%
36회 982
 
4.9%
17회 474
 
2.4%
Other values (9) 2296
 
11.5%

종목
Categorical

Distinct46
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
수영
1183 
역도
1129 
육상트랙
1054 
탁구
809 
육상필드
779 
Other values (41)
5046 

Length

Max length8
Median length2
Mean length2.8721
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row축구
2nd row탁구
3rd row축구
4th row파크골프
5th row휠체어럭비

Common Values

ValueCountFrequency (%)
수영 1183
 
11.8%
역도 1129
 
11.3%
육상트랙 1054
 
10.5%
탁구 809
 
8.1%
육상필드 779
 
7.8%
축구 435
 
4.3%
볼링 433
 
4.3%
댄스스포츠 387
 
3.9%
사이클 308
 
3.1%
배드민턴 303
 
3.0%
Other values (36) 3180
31.8%

Length

2023-12-12T16:05:30.441293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
수영 1183
 
11.8%
역도 1129
 
11.3%
육상트랙 1054
 
10.5%
탁구 809
 
8.1%
육상필드 779
 
7.8%
축구 435
 
4.3%
볼링 433
 
4.3%
댄스스포츠 387
 
3.9%
사이클 308
 
3.1%
배드민턴 303
 
3.0%
Other values (36) 3180
31.8%

종별
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
선수부
8121 
동호인부
1879 

Length

Max length4
Median length3
Mean length3.1879
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동호인부
2nd row선수부
3rd row선수부
4th row선수부
5th row선수부

Common Values

ValueCountFrequency (%)
선수부 8121
81.2%
동호인부 1879
 
18.8%

Length

2023-12-12T16:05:30.564640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:05:30.656859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
선수부 8121
81.2%
동호인부 1879
 
18.8%
Distinct2256
Distinct (%)22.6%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2023-12-12T16:05:30.952924image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length43
Median length36
Mean length21.2612
Min length11

Characters and Unicode

Total characters212612
Distinct characters253
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique593 ?
Unique (%)5.9%

Sample

1st row남자 11인제 축구 IDD(동호인부)
2nd row여자 단체전(3명) DF(선수부)
3rd row남자 11인제 축구 DB(선수부)
4th row여자 개인전 PGST1(선수부)
5th row혼성 휠체어럭비 Quad(선수부)
ValueCountFrequency (%)
남자 5096
 
13.2%
여자 3061
 
8.0%
혼성 1822
 
4.7%
class 938
 
2.4%
단체전 790
 
2.1%
100m 682
 
1.8%
open(선수부 629
 
1.6%
개인전 551
 
1.4%
복식 543
 
1.4%
db(선수부 515
 
1.3%
Other values (831) 23875
62.0%
2023-12-12T16:05:31.441242image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28522
 
13.4%
( 11601
 
5.5%
) 11601
 
5.5%
8679
 
4.1%
7920
 
3.7%
0 6156
 
2.9%
6144
 
2.9%
6032
 
2.8%
5096
 
2.4%
1 4936
 
2.3%
Other values (243) 115925
54.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 92355
43.4%
Space Separator 28522
 
13.4%
Uppercase Letter 28237
 
13.3%
Decimal Number 23581
 
11.1%
Open Punctuation 11601
 
5.5%
Close Punctuation 11601
 
5.5%
Lowercase Letter 10879
 
5.1%
Other Punctuation 3430
 
1.6%
Dash Punctuation 1567
 
0.7%
Math Symbol 834
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8679
 
9.4%
7920
 
8.6%
6144
 
6.7%
6032
 
6.5%
5096
 
5.5%
4202
 
4.5%
3061
 
3.3%
2222
 
2.4%
1989
 
2.2%
1862
 
2.0%
Other values (178) 45148
48.9%
Uppercase Letter
ValueCountFrequency (%)
P 3370
11.9%
S 3120
11.0%
B 2824
10.0%
N 2490
8.8%
O 2352
8.3%
E 2216
7.8%
T 2164
7.7%
D 2048
7.3%
C 1677
 
5.9%
I 1134
 
4.0%
Other values (13) 4842
17.1%
Lowercase Letter
ValueCountFrequency (%)
m 3147
28.9%
k 1635
15.0%
g 1340
12.3%
s 898
 
8.3%
a 746
 
6.9%
e 501
 
4.6%
n 481
 
4.4%
l 439
 
4.0%
i 270
 
2.5%
d 241
 
2.2%
Other values (9) 1181
 
10.9%
Decimal Number
ValueCountFrequency (%)
0 6156
26.1%
1 4936
20.9%
2 2542
10.8%
3 2532
10.7%
4 2191
 
9.3%
5 2146
 
9.1%
6 990
 
4.2%
7 864
 
3.7%
8 838
 
3.6%
9 386
 
1.6%
Letter Number
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Other Punctuation
ValueCountFrequency (%)
, 1859
54.2%
/ 1359
39.6%
. 212
 
6.2%
Math Symbol
ValueCountFrequency (%)
~ 575
68.9%
+ 259
31.1%
Space Separator
ValueCountFrequency (%)
28522
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11601
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11601
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1567
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 92355
43.4%
Common 81136
38.2%
Latin 39121
18.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
8679
 
9.4%
7920
 
8.6%
6144
 
6.7%
6032
 
6.5%
5096
 
5.5%
4202
 
4.5%
3061
 
3.3%
2222
 
2.4%
1989
 
2.2%
1862
 
2.0%
Other values (178) 45148
48.9%
Latin
ValueCountFrequency (%)
P 3370
 
8.6%
m 3147
 
8.0%
S 3120
 
8.0%
B 2824
 
7.2%
N 2490
 
6.4%
O 2352
 
6.0%
E 2216
 
5.7%
T 2164
 
5.5%
D 2048
 
5.2%
C 1677
 
4.3%
Other values (36) 13713
35.1%
Common
ValueCountFrequency (%)
28522
35.2%
( 11601
14.3%
) 11601
14.3%
0 6156
 
7.6%
1 4936
 
6.1%
2 2542
 
3.1%
3 2532
 
3.1%
4 2191
 
2.7%
5 2146
 
2.6%
, 1859
 
2.3%
Other values (9) 7050
 
8.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 120252
56.6%
Hangul 92355
43.4%
Number Forms 5
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28522
23.7%
( 11601
 
9.6%
) 11601
 
9.6%
0 6156
 
5.1%
1 4936
 
4.1%
P 3370
 
2.8%
m 3147
 
2.6%
S 3120
 
2.6%
B 2824
 
2.3%
2 2542
 
2.1%
Other values (51) 42433
35.3%
Hangul
ValueCountFrequency (%)
8679
 
9.4%
7920
 
8.6%
6144
 
6.7%
6032
 
6.5%
5096
 
5.5%
4202
 
4.5%
3061
 
3.3%
2222
 
2.4%
1989
 
2.2%
1862
 
2.0%
Other values (178) 45148
48.9%
Number Forms
ValueCountFrequency (%)
2
40.0%
1
20.0%
1
20.0%
1
20.0%
Distinct110
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
Minimum2016-10-21 00:00:00
Maximum2023-11-08 00:00:00
2023-12-12T16:05:31.615671image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T16:05:31.801535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

소속명
Text

MISSING 

Distinct1233
Distinct (%)12.8%
Missing371
Missing (%)3.7%
Memory size156.2 KiB
2023-12-12T16:05:32.052619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length18
Mean length8.3656662
Min length2

Characters and Unicode

Total characters80553
Distinct characters452
Distinct categories9 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique378 ?
Unique (%)3.9%

Sample

1st row전남FC
2nd row속초시장애인체육회
3rd row충북농아인체육연맹
4th row부천장애인골프협회
5th row경남본드
ValueCountFrequency (%)
일반(개인 1176
 
11.3%
소속팀없음 500
 
4.8%
울산광역시장애인역도연맹 148
 
1.4%
충북장애인역도연맹 95
 
0.9%
부산장애인역도연맹 90
 
0.9%
부산육상 84
 
0.8%
경북장애인육상연맹 83
 
0.8%
수영사랑 72
 
0.7%
경기도 72
 
0.7%
충청남도장애인육상연맹 69
 
0.7%
Other values (1277) 7992
77.0%
2023-12-12T16:05:32.439436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
5819
 
7.2%
4231
 
5.3%
4171
 
5.2%
2370
 
2.9%
2368
 
2.9%
2289
 
2.8%
1914
 
2.4%
1729
 
2.1%
) 1411
 
1.8%
( 1410
 
1.8%
Other values (442) 52841
65.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 75937
94.3%
Close Punctuation 1411
 
1.8%
Open Punctuation 1410
 
1.8%
Space Separator 756
 
0.9%
Uppercase Letter 727
 
0.9%
Lowercase Letter 193
 
0.2%
Other Punctuation 93
 
0.1%
Dash Punctuation 14
 
< 0.1%
Decimal Number 12
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5819
 
7.7%
4231
 
5.6%
4171
 
5.5%
2370
 
3.1%
2368
 
3.1%
2289
 
3.0%
1914
 
2.5%
1729
 
2.3%
1336
 
1.8%
1308
 
1.7%
Other values (397) 48402
63.7%
Uppercase Letter
ValueCountFrequency (%)
C 256
35.2%
F 194
26.7%
B 90
 
12.4%
K 43
 
5.9%
M 24
 
3.3%
R 23
 
3.2%
N 22
 
3.0%
S 18
 
2.5%
L 14
 
1.9%
D 13
 
1.8%
Other values (9) 30
 
4.1%
Lowercase Letter
ValueCountFrequency (%)
i 32
16.6%
n 28
14.5%
w 26
13.5%
g 26
13.5%
o 25
13.0%
e 12
 
6.2%
c 9
 
4.7%
r 7
 
3.6%
m 6
 
3.1%
d 5
 
2.6%
Other values (5) 17
8.8%
Decimal Number
ValueCountFrequency (%)
7 9
75.0%
1 1
 
8.3%
5 1
 
8.3%
2 1
 
8.3%
Other Punctuation
ValueCountFrequency (%)
. 83
89.2%
& 8
 
8.6%
, 2
 
2.2%
Close Punctuation
ValueCountFrequency (%)
) 1411
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1410
100.0%
Space Separator
ValueCountFrequency (%)
756
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 75937
94.3%
Common 3696
 
4.6%
Latin 920
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5819
 
7.7%
4231
 
5.6%
4171
 
5.5%
2370
 
3.1%
2368
 
3.1%
2289
 
3.0%
1914
 
2.5%
1729
 
2.3%
1336
 
1.8%
1308
 
1.7%
Other values (397) 48402
63.7%
Latin
ValueCountFrequency (%)
C 256
27.8%
F 194
21.1%
B 90
 
9.8%
K 43
 
4.7%
i 32
 
3.5%
n 28
 
3.0%
w 26
 
2.8%
g 26
 
2.8%
o 25
 
2.7%
M 24
 
2.6%
Other values (24) 176
19.1%
Common
ValueCountFrequency (%)
) 1411
38.2%
( 1410
38.1%
756
20.5%
. 83
 
2.2%
- 14
 
0.4%
7 9
 
0.2%
& 8
 
0.2%
, 2
 
0.1%
1 1
 
< 0.1%
5 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 75937
94.3%
ASCII 4616
 
5.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
5819
 
7.7%
4231
 
5.6%
4171
 
5.5%
2370
 
3.1%
2368
 
3.1%
2289
 
3.0%
1914
 
2.5%
1729
 
2.3%
1336
 
1.8%
1308
 
1.7%
Other values (397) 48402
63.7%
ASCII
ValueCountFrequency (%)
) 1411
30.6%
( 1410
30.5%
756
16.4%
C 256
 
5.5%
F 194
 
4.2%
B 90
 
1.9%
. 83
 
1.8%
K 43
 
0.9%
i 32
 
0.7%
n 28
 
0.6%
Other values (35) 313
 
6.8%

메달구분
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
동메달
3422 
금메달
3368 
은메달
3210 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row동메달
2nd row은메달
3rd row은메달
4th row동메달
5th row은메달

Common Values

ValueCountFrequency (%)
동메달 3422
34.2%
금메달 3368
33.7%
은메달 3210
32.1%

Length

2023-12-12T16:05:32.573918image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T16:05:32.657097image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
동메달 3422
34.2%
금메달 3368
33.7%
은메달 3210
32.1%

Correlations

2023-12-12T16:05:32.717473image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대회구분종목종별메달구분
대회구분1.0000.7740.3100.110
종목0.7741.0000.6190.184
종별0.3100.6191.0000.000
메달구분0.1100.1840.0001.000
2023-12-12T16:05:32.802425image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
종별메달구분대회구분종목
종별1.0000.0000.2440.498
메달구분0.0001.0000.0500.090
대회구분0.2440.0501.0000.310
종목0.4980.0900.3101.000
2023-12-12T16:05:32.887592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
대회구분종목종별메달구분
대회구분1.0000.3100.2440.050
종목0.3101.0000.4980.090
종별0.2440.4981.0000.000
메달구분0.0500.0900.0001.000

Missing values

2023-12-12T16:05:29.951727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T16:05:30.138104image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

대회구분종목종별세부종목경기일자소속명메달구분
362836회 전국장애인체육대회축구동호인부남자 11인제 축구 IDD(동호인부)2016-10-25전남FC동메달
1012938회 전국장애인체육대회탁구선수부여자 단체전(3명) DF(선수부)2018-10-29속초시장애인체육회은메달
381436회 전국장애인체육대회축구선수부남자 11인제 축구 DB(선수부)2016-10-25충북농아인체육연맹은메달
2936242회 전국장애인체육대회파크골프선수부여자 개인전 PGST1(선수부)2022-10-22부천장애인골프협회동메달
1633439회 전국장애인체육대회휠체어럭비선수부혼성 휠체어럭비 Quad(선수부)2018-10-29경남본드은메달
2777041회 전국장애인체육대회사이클선수부남자 개인도로 84km (Tandem) B(선수부)2021-10-24전라남도장애인사이클연맹은메달
782537회 전국장애인체육대회사이클선수부남자 개인도로 75km 이내 C1,C2(선수부)2017-09-18<NA>은메달
3104442회 전국장애인체육대회역도선수부여자 -79kg급 벤치프레스종합 OPEN(선수부)2022-10-22제주중앙여자고등학교동메달
2831742회 전국장애인체육대회축구선수부남자 11인제 축구 DB(선수부)2022-10-24용인유나이티드농아인축구클럽금메달
4217443회 전국장애인체육대회육상트랙선수부여자 100mB T11(선수부)2023-11-05일반(개인)은메달
대회구분종목종별세부종목경기일자소속명메달구분
1166438회 전국장애인체육대회육상필드선수부여자 원반던지기 F37(선수부)2018-10-25제주특별자치도장애인육상연맹은메달
1296138회 전국장애인체육대회댄스스포츠동호인부혼성 스탠더드 단체전 Class B(동호인부)2018-10-26DCAK클럽금메달
658437회 전국장애인체육대회수영선수부남자 개인혼영 200m SM9(선수부)2017-09-18cd swimming club금메달
269236회 전국장애인체육대회역도동호인부여자 -66kg급 파워리프트종합 OPEN(지적,동호인부)2016-10-23충북장애인역도연맹은메달
1786014회 전국장애인동계체육대회휠체어컬링선수부혼성 휠체어컬링 WC-E (선수부)2017-02-10인천광역시체육회동메달
474837회 전국장애인체육대회테니스선수부여자 단체전 OPEN(선수부)2017-09-17스포츠 토토동메달
4253743회 전국장애인체육대회사이클선수부여자 트랙 개인추발 3km (Tandem) B(선수부)2023-11-03제주특별자치도장애인사이클연맹은메달
1145638회 전국장애인체육대회사이클선수부남자 개인도로독주 20km 이내 H2(선수부)2018-10-26일반(개인)은메달
253536회 전국장애인체육대회론볼동호인부혼성 복식 B5(동호인부)2016-10-23송파론볼클럽은메달
2975642회 전국장애인체육대회수영선수부남자 자유형 50m S9(선수부)2022-10-21충청남도장애인수영연맹은메달

Duplicate rows

Most frequently occurring

대회구분종목종별세부종목경기일자소속명메달구분# duplicates
60442회 전국장애인체육대회축구동호인부남자 11인제 축구 IDD(동호인부)2022-10-24울산돌고래축구회(지적)은메달8
70643회 전국장애인체육대회축구선수부남자 11인제 축구 DB(선수부)2023-11-08기드온엔젤스은메달8
20036회 전국장애인체육대회축구선수부남자 11인제 축구 DB(선수부)2016-10-25용인시농아인축구클럽금메달7
61142회 전국장애인체육대회축구선수부남자 11인제 축구 DB(선수부)2022-10-24충남데프FC동메달7
71643회 전국장애인체육대회축구선수부남자 7인제 축구 FT1,FT2,FT3(선수부)2023-11-08사천제니우스동메달7
2912회 전국장애학생체육대회축구선수부혼성 11인제 축구 OPEN(초/중/고)2018-05-18성광FC은메달6
4813회 전국장애학생체육대회축구선수부남자 11인제 축구 OPEN(초/중/고)2019-05-17성광FC금메달6
4913회 전국장애학생체육대회축구선수부남자 11인제 축구 OPEN(초/중/고)2019-05-17울산지적축구학생부은메달6
10717회 전국장애학생체육대회농구선수부혼성 지적(발달)농구 IDD(중)2023-05-17아산시장애인복지관(지적학생농구팀)금메달6
14636회 전국장애인체육대회농구동호인부남자 지적농구 IDD(동호인부)2016-10-25구미혜당학교동메달6