Overview

Dataset statistics

Number of variables16
Number of observations10000
Missing cells69995
Missing cells (%)43.7%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.3 MiB
Average record size in memory136.0 B

Variable types

Text9
Categorical7

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-21706/F/1/datasetView.do

Alerts

Unnamed: 2 has constant value ""Constant
Unnamed: 3 has constant value ""Constant
Unnamed: 5 has constant value ""Constant
Unnamed: 6 has constant value ""Constant
Unnamed: 8 has constant value ""Constant
Unnamed: 11 has constant value ""Constant
Unnamed: 12 has constant value ""Constant
Unnamed: 14 is highly overall correlated with Unnamed: 4 and 2 other fieldsHigh correlation
Unnamed: 9 is highly overall correlated with Unnamed: 4 and 1 other fieldsHigh correlation
Unnamed: 15 is highly overall correlated with Unnamed: 4 and 5 other fieldsHigh correlation
Unnamed: 10 is highly overall correlated with Unnamed: 4 and 1 other fieldsHigh correlation
Unnamed: 4 is highly overall correlated with Unnamed: 7 and 5 other fieldsHigh correlation
Unnamed: 13 is highly overall correlated with Unnamed: 4 and 1 other fieldsHigh correlation
Unnamed: 7 is highly overall correlated with Unnamed: 4 and 2 other fieldsHigh correlation
Unnamed: 4 is highly imbalanced (50.1%)Imbalance
Unnamed: 15 is highly imbalanced (99.8%)Imbalance
Unnamed: 2 has 9999 (> 99.9%) missing valuesMissing
Unnamed: 3 has 9999 (> 99.9%) missing valuesMissing
Unnamed: 5 has 9999 (> 99.9%) missing valuesMissing
Unnamed: 6 has 9999 (> 99.9%) missing valuesMissing
Unnamed: 8 has 9999 (> 99.9%) missing valuesMissing
Unnamed: 11 has 9999 (> 99.9%) missing valuesMissing
Unnamed: 12 has 9999 (> 99.9%) missing valuesMissing

Reproduction

Analysis started2024-04-17 10:38:09.704453
Analysis finished2024-04-17 10:38:11.341683
Duration1.64 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct9999
Distinct (%)100.0%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-17T19:38:11.636889image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length6
Median length5
Mean length4.4447445
Min length1

Characters and Unicode

Total characters44443
Distinct characters16
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9999 ?
Unique (%)100.0%

Sample

1st row13422
2nd row9239
3rd row18976
4th row18732
5th row19529
ValueCountFrequency (%)
13422 1
 
< 0.1%
8278 1
 
< 0.1%
15200 1
 
< 0.1%
13400 1
 
< 0.1%
15630 1
 
< 0.1%
15713 1
 
< 0.1%
17837 1
 
< 0.1%
18508 1
 
< 0.1%
9321 1
 
< 0.1%
15791 1
 
< 0.1%
Other values (9989) 9989
99.9%
2024-04-17T19:38:12.091505image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 8916
20.1%
4 4066
9.1%
3 4041
9.1%
7 4029
9.1%
6 4013
9.0%
5 3993
9.0%
2 3989
9.0%
8 3983
9.0%
9 3944
8.9%
0 3463
 
7.8%
Other values (6) 6
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 44437
> 99.9%
Other Letter 4
 
< 0.1%
Uppercase Letter 2
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 8916
20.1%
4 4066
9.2%
3 4041
9.1%
7 4029
9.1%
6 4013
9.0%
5 3993
9.0%
2 3989
9.0%
8 3983
9.0%
9 3944
8.9%
0 3463
 
7.8%
Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Uppercase Letter
ValueCountFrequency (%)
I 1
50.0%
D 1
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 44437
> 99.9%
Hangul 4
 
< 0.1%
Latin 2
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
1 8916
20.1%
4 4066
9.2%
3 4041
9.1%
7 4029
9.1%
6 4013
9.0%
5 3993
9.0%
2 3989
9.0%
8 3983
9.0%
9 3944
8.9%
0 3463
 
7.8%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Latin
ValueCountFrequency (%)
I 1
50.0%
D 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 44439
> 99.9%
Hangul 4
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 8916
20.1%
4 4066
9.1%
3 4041
9.1%
7 4029
9.1%
6 4013
9.0%
5 3993
9.0%
2 3989
9.0%
8 3983
9.0%
9 3944
8.9%
0 3463
 
7.8%
Other values (2) 2
 
< 0.1%
Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Distinct1001
Distinct (%)10.0%
Missing1
Missing (%)< 0.1%
Memory size156.2 KiB
2024-04-17T19:38:12.463315image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length7
Median length6
Mean length6.3667367
Min length6

Characters and Unicode

Total characters63661
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st row18-2005
2nd row12-067
3rd row24-212
4th row24-2034
5th row25-2037
ValueCountFrequency (%)
13-032 17
 
0.2%
23-868 16
 
0.2%
11-2039 16
 
0.2%
11-145 15
 
0.2%
14-002 15
 
0.2%
21-1132 15
 
0.2%
19-459 15
 
0.2%
14-2040 15
 
0.2%
16-240 15
 
0.2%
23-031 15
 
0.2%
Other values (991) 9845
98.5%
2024-04-17T19:38:12.924319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 11595
18.2%
1 10412
16.4%
- 9998
15.7%
2 9368
14.7%
3 5086
8.0%
4 3702
 
5.8%
5 3214
 
5.0%
6 2657
 
4.2%
7 2648
 
4.2%
9 2496
 
3.9%
Other values (7) 2485
 
3.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 53657
84.3%
Dash Punctuation 9998
 
15.7%
Other Letter 6
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 11595
21.6%
1 10412
19.4%
2 9368
17.5%
3 5086
9.5%
4 3702
 
6.9%
5 3214
 
6.0%
6 2657
 
5.0%
7 2648
 
4.9%
9 2496
 
4.7%
8 2479
 
4.6%
Other Letter
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
Dash Punctuation
ValueCountFrequency (%)
- 9998
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 63655
> 99.9%
Hangul 6
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 11595
18.2%
1 10412
16.4%
- 9998
15.7%
2 9368
14.7%
3 5086
8.0%
4 3702
 
5.8%
5 3214
 
5.0%
6 2657
 
4.2%
7 2648
 
4.2%
9 2496
 
3.9%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 63655
> 99.9%
Hangul 6
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 11595
18.2%
1 10412
16.4%
- 9998
15.7%
2 9368
14.7%
3 5086
8.0%
4 3702
 
5.8%
5 3214
 
5.0%
6 2657
 
4.2%
7 2648
 
4.2%
9 2496
 
3.9%
Hangul
ValueCountFrequency (%)
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%
1
16.7%

Unnamed: 2
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
2024-04-17T19:38:13.040751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters4
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row조사일자
ValueCountFrequency (%)
조사일자 1
100.0%
2024-04-17T19:38:13.277308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 3
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
2024-04-17T19:38:13.383189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters4
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row조사요일
ValueCountFrequency (%)
조사요일 1
100.0%
2024-04-17T19:38:13.622712image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 4
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
여자
5400 
남자
4598 
<NA>
 
1
남여구분
 
1

Length

Max length4
Median length2
Mean length2.0004
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row남자
2nd row여자
3rd row남자
4th row여자
5th row남자

Common Values

ValueCountFrequency (%)
여자 5400
54.0%
남자 4598
46.0%
<NA> 1
 
< 0.1%
남여구분 1
 
< 0.1%

Length

2024-04-17T19:38:13.741980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:38:13.847386image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
여자 5400
54.0%
남자 4598
46.0%
na 1
 
< 0.1%
남여구분 1
 
< 0.1%

Unnamed: 5
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
2024-04-17T19:38:13.946289image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters5
Distinct characters5
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row조사시간대
ValueCountFrequency (%)
조사시간대 1
100.0%
2024-04-17T19:38:14.201462image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 5
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 5
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 5
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Unnamed: 6
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
2024-04-17T19:38:14.329691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length9
Mean length9
Min length9

Characters and Unicode

Total characters9
Distinct characters9
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row조사시간대_텍스트
ValueCountFrequency (%)
조사시간대_텍스트 1
100.0%
2024-04-17T19:38:14.580632image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
_ 1
11.1%
1
11.1%
1
11.1%
1
11.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 8
88.9%
Connector Punctuation 1
 
11.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 8
88.9%
Common 1
 
11.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Common
ValueCountFrequency (%)
_ 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 8
88.9%
ASCII 1
 
11.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
ASCII
ValueCountFrequency (%)
_ 1
100.0%

Unnamed: 7
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
50-59세
2320 
20-29세
1897 
40-49세
1862 
30-39세
1817 
60세이상
1570 
Other values (3)
534 

Length

Max length6
Median length6
Mean length5.8425
Min length3

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row30-39세
2nd row40-49세
3rd row40-49세
4th row40-49세
5th row50-59세

Common Values

ValueCountFrequency (%)
50-59세 2320
23.2%
20-29세 1897
19.0%
40-49세 1862
18.6%
30-39세 1817
18.2%
60세이상 1570
15.7%
15-19세 532
 
5.3%
<NA> 1
 
< 0.1%
연령대 1
 
< 0.1%

Length

2024-04-17T19:38:14.914479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:38:15.020928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
50-59세 2320
23.2%
20-29세 1897
19.0%
40-49세 1862
18.6%
30-39세 1817
18.2%
60세이상 1570
15.7%
15-19세 532
 
5.3%
na 1
 
< 0.1%
연령대 1
 
< 0.1%

Unnamed: 8
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
2024-04-17T19:38:15.129075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters3
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row거주지
ValueCountFrequency (%)
거주지 1
100.0%
2024-04-17T19:38:15.334477image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Unnamed: 9
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
출근
1716 
개인용무/집안일(병원,은행,관공서,종교활동,봉사활동
1563 
업무관련
1425 
여가/오락/친교/모임/식사(회식,음료포함)
1354 
물건을사려고
953 
Other values (9)
2989 

Length

Max length28
Median length19
Mean length12.8215
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row역/정류장/교통수단이용
2nd row귀가/퇴근/하교
3rd row출근
4th row귀가/퇴근/하교
5th row업무관련

Common Values

ValueCountFrequency (%)
출근 1716
17.2%
개인용무/집안일(병원,은행,관공서,종교활동,봉사활동 1563
15.6%
업무관련 1425
14.2%
여가/오락/친교/모임/식사(회식,음료포함) 1354
13.5%
물건을사려고 953
9.5%
그냥걸으려고(운동,산책,기분전환등) 810
8.1%
역/정류장/교통수단이용 569
 
5.7%
귀가/퇴근/하교 446
 
4.5%
학업관련(학원/도서관) 411
 
4.1%
누군가를데리러(또는데려다주러) 312
 
3.1%
Other values (4) 441
 
4.4%

Length

2024-04-17T19:38:15.450449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
출근 1716
17.2%
개인용무/집안일(병원,은행,관공서,종교활동,봉사활동 1563
15.6%
업무관련 1425
14.2%
여가/오락/친교/모임/식사(회식,음료포함 1354
13.5%
물건을사려고 953
9.5%
그냥걸으려고(운동,산책,기분전환등 810
8.1%
역/정류장/교통수단이용 569
 
5.7%
귀가/퇴근/하교 446
 
4.5%
학업관련(학원/도서관 411
 
4.1%
누군가를데리러(또는데려다주러 312
 
3.1%
Other values (4) 441
 
4.4%

Unnamed: 10
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
매일
3417 
주3~5회
3041 
주1~2회
1672 
월1~2회
1010 
오늘처음
440 
Other values (3)
420 

Length

Max length7
Median length5
Mean length4.0143
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row월1~2회
2nd row매일
3rd row매일
4th row주3~5회
5th row주3~5회

Common Values

ValueCountFrequency (%)
매일 3417
34.2%
주3~5회 3041
30.4%
주1~2회 1672
16.7%
월1~2회 1010
 
10.1%
오늘처음 440
 
4.4%
6개월1~3회 418
 
4.2%
<NA> 1
 
< 0.1%
방문횟수 1
 
< 0.1%

Length

2024-04-17T19:38:15.555227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:38:15.656570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
매일 3417
34.2%
주3~5회 3041
30.4%
주1~2회 1672
16.7%
월1~2회 1010
 
10.1%
오늘처음 440
 
4.4%
6개월1~3회 418
 
4.2%
na 1
 
< 0.1%
방문횟수 1
 
< 0.1%

Unnamed: 11
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
2024-04-17T19:38:15.762656image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters4
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row동행자명
ValueCountFrequency (%)
동행자명 1
100.0%
2024-04-17T19:38:15.986586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 12
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing9999
Missing (%)> 99.9%
Memory size156.2 KiB
2024-04-17T19:38:16.109940image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters4
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row교통수단
ValueCountFrequency (%)
교통수단 1
100.0%
2024-04-17T19:38:16.345901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 13
Categorical

HIGH CORRELATION 

Distinct7
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
보통
4870 
약간만족
1957 
약간불만족
1535 
매우만족
1147 
매우불만족
489 
Other values (2)
 
2

Length

Max length5
Median length4
Mean length3.2284
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row보통
2nd row보통
3rd row보통
4th row보통
5th row약간불만족

Common Values

ValueCountFrequency (%)
보통 4870
48.7%
약간만족 1957
19.6%
약간불만족 1535
 
15.3%
매우만족 1147
 
11.5%
매우불만족 489
 
4.9%
<NA> 1
 
< 0.1%
보행환경 1
 
< 0.1%

Length

2024-04-17T19:38:16.461811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:38:16.579550image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보통 4870
48.7%
약간만족 1957
19.6%
약간불만족 1535
 
15.3%
매우만족 1147
 
11.5%
매우불만족 489
 
4.9%
na 1
 
< 0.1%
보행환경 1
 
< 0.1%

Unnamed: 14
Categorical

HIGH CORRELATION 

Distinct14
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
(전업)주부
2022 
사무/기술직
1444 
(대)학생
1393 
판매/서비스직
1132 
자영업
1052 
Other values (9)
2957 

Length

Max length7
Median length6
Mean length5.5784
Min length2

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st row자영업
2nd row전문/자유직
3rd row판매/서비스직
4th row판매/서비스직
5th row사무/기술직

Common Values

ValueCountFrequency (%)
(전업)주부 2022
20.2%
사무/기술직 1444
14.4%
(대)학생 1393
13.9%
판매/서비스직 1132
11.3%
자영업 1052
10.5%
전문/자유직 860
8.6%
무직/기타 782
 
7.8%
일용/작업직 621
 
6.2%
경영/관리직 412
 
4.1%
생산/운수직 277
 
2.8%
Other values (4) 5
 
0.1%

Length

2024-04-17T19:38:16.705363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전업)주부 2022
20.2%
사무/기술직 1444
14.4%
대)학생 1393
13.9%
판매/서비스직 1132
11.3%
자영업 1052
10.5%
전문/자유직 860
8.6%
무직/기타 782
 
7.8%
일용/작업직 621
 
6.2%
경영/관리직 412
 
4.1%
생산/운수직 277
 
2.8%
Other values (4) 5
 
< 0.1%

Unnamed: 15
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
2013
9998 
<NA>
 
1
년도
 
1

Length

Max length4
Median length4
Mean length3.9998
Min length2

Unique

Unique2 ?
Unique (%)< 0.1%

Sample

1st row2013
2nd row2013
3rd row2013
4th row2013
5th row2013

Common Values

ValueCountFrequency (%)
2013 9998
> 99.9%
<NA> 1
 
< 0.1%
년도 1
 
< 0.1%

Length

2024-04-17T19:38:16.822856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-17T19:38:16.918664image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2013 9998
> 99.9%
na 1
 
< 0.1%
년도 1
 
< 0.1%

Correlations

2024-04-17T19:38:16.989810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 4Unnamed: 7Unnamed: 9Unnamed: 10Unnamed: 13Unnamed: 14Unnamed: 15
Unnamed: 41.0000.7690.8480.7690.9410.8851.000
Unnamed: 70.7691.0000.7580.8130.6390.8231.000
Unnamed: 90.8480.7581.0000.7340.7080.8471.000
Unnamed: 100.7690.8130.7341.0000.6410.6941.000
Unnamed: 130.9410.6390.7080.6411.0000.7081.000
Unnamed: 140.8850.8230.8470.6940.7081.0001.000
Unnamed: 151.0001.0001.0001.0001.0001.0001.000
2024-04-17T19:38:17.093184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 14Unnamed: 9Unnamed: 15Unnamed: 10Unnamed: 4Unnamed: 13Unnamed: 7
Unnamed: 141.0000.3880.9990.4130.7900.4490.570
Unnamed: 90.3881.0000.9990.4550.7290.4490.482
Unnamed: 150.9990.9991.0001.0001.0001.0001.000
Unnamed: 100.4130.4551.0001.0000.7070.4510.411
Unnamed: 40.7900.7291.0000.7071.0000.7070.708
Unnamed: 130.4490.4491.0000.4510.7071.0000.449
Unnamed: 70.5700.4821.0000.4110.7080.4491.000
2024-04-17T19:38:17.196870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Unnamed: 4Unnamed: 7Unnamed: 9Unnamed: 10Unnamed: 13Unnamed: 14Unnamed: 15
Unnamed: 41.0000.7080.7290.7070.7070.7901.000
Unnamed: 70.7081.0000.4820.4110.4490.5701.000
Unnamed: 90.7290.4821.0000.4550.4490.3880.999
Unnamed: 100.7070.4110.4551.0000.4510.4131.000
Unnamed: 130.7070.4490.4490.4511.0000.4491.000
Unnamed: 140.7900.5700.3880.4130.4491.0000.999
Unnamed: 151.0001.0000.9991.0001.0000.9991.000

Missing values

2024-04-17T19:38:10.748489image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-17T19:38:10.952704image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-04-17T19:38:11.165977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

유동인구_속성조사_2013Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15
134241342218-2005<NA><NA>남자<NA><NA>30-39세<NA>역/정류장/교통수단이용월1~2회<NA><NA>보통자영업2013
9241923912-067<NA><NA>여자<NA><NA>40-49세<NA>귀가/퇴근/하교매일<NA><NA>보통전문/자유직2013
189781897624-212<NA><NA>남자<NA><NA>40-49세<NA>출근매일<NA><NA>보통판매/서비스직2013
187341873224-2034<NA><NA>여자<NA><NA>40-49세<NA>귀가/퇴근/하교주3~5회<NA><NA>보통판매/서비스직2013
195311952925-2037<NA><NA>남자<NA><NA>50-59세<NA>업무관련주3~5회<NA><NA>약간불만족사무/기술직2013
7442744010-048<NA><NA>남자<NA><NA>20-29세<NA>여가/오락/친교/모임/식사(회식,음료포함)주1~2회<NA><NA>약간만족(대)학생2013
1509150702-063<NA><NA>여자<NA><NA>60세이상<NA>그냥걸으려고(운동,산책,기분전환등)주1~2회<NA><NA>약간불만족(전업)주부2013
3936393405-057<NA><NA>남자<NA><NA>30-39세<NA>업무관련6개월1~3회<NA><NA>보통자영업2013
9316931412-094<NA><NA>여자<NA><NA>50-59세<NA>누군가를데리러(또는데려다주러)주1~2회<NA><NA>보통(전업)주부2013
5355535307-027<NA><NA>여자<NA><NA>60세이상<NA>그냥걸으려고(운동,산책,기분전환등)매일<NA><NA>약간만족(전업)주부2013
유동인구_속성조사_2013Unnamed: 1Unnamed: 2Unnamed: 3Unnamed: 4Unnamed: 5Unnamed: 6Unnamed: 7Unnamed: 8Unnamed: 9Unnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15
5230522806-3097<NA><NA>여자<NA><NA>60세이상<NA>개인용무/집안일(병원,은행,관공서,종교활동,봉사활동주3~5회<NA><NA>약간불만족무직/기타2013
150971509521-012<NA><NA>남자<NA><NA>20-29세<NA>개인용무/집안일(병원,은행,관공서,종교활동,봉사활동주3~5회<NA><NA>보통무직/기타2013
41140901-2040<NA><NA>여자<NA><NA>50-59세<NA>업무관련주3~5회<NA><NA>약간불만족전문/자유직2013
177631776123-431<NA><NA>남자<NA><NA>50-59세<NA>출근주3~5회<NA><NA>매우만족일용/작업직2013
147251472320-030<NA><NA>남자<NA><NA>50-59세<NA>출근매일<NA><NA>매우불만족사무/기술직2013
8203820111-173<NA><NA>남자<NA><NA>40-49세<NA>출근매일<NA><NA>보통사무/기술직2013
3974397205-073<NA><NA>여자<NA><NA>40-49세<NA>여가/오락/친교/모임/식사(회식,음료포함)오늘처음<NA><NA>보통(전업)주부2013
143581435619-1306<NA><NA>여자<NA><NA>30-39세<NA>업무관련주3~5회<NA><NA>보통전문/자유직2013
7437743510-048<NA><NA>남자<NA><NA>50-59세<NA>개인용무/집안일(병원,은행,관공서,종교활동,봉사활동주1~2회<NA><NA>보통사무/기술직2013
7049704709-067<NA><NA>여자<NA><NA>20-29세<NA>역/정류장/교통수단이용주3~5회<NA><NA>보통판매/서비스직2013