Overview

Dataset statistics

Number of variables7
Number of observations109
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.4 KiB
Average record size in memory60.2 B

Variable types

Numeric3
Categorical3
Text1

Dataset

Description일련번호,FAQ구분,FAQ구분명,대분류코드,대분류명,질문,수정일시
Author120다산콜재단
URLhttps://data.seoul.go.kr/dataList/OA-1127/S/1/datasetView.do

Alerts

FAQ구분명 is highly overall correlated with 대분류코드 and 2 other fieldsHigh correlation
FAQ구분 is highly overall correlated with 대분류코드 and 2 other fieldsHigh correlation
대분류코드 is highly overall correlated with FAQ구분 and 2 other fieldsHigh correlation
대분류명 is highly overall correlated with 대분류코드 and 2 other fieldsHigh correlation
일련번호 has unique valuesUnique
질문 has unique valuesUnique

Reproduction

Analysis started2023-12-11 07:18:52.408613
Analysis finished2023-12-11 07:18:54.319271
Duration1.91 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

일련번호
Real number (ℝ)

UNIQUE 

Distinct109
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean289596.61
Minimum289412
Maximum289825
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T16:18:54.426412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum289412
5-th percentile289435.4
Q1289502
median289568
Q3289695
95-th percentile289802.6
Maximum289825
Range413
Interquartile range (IQR)193

Descriptive statistics

Standard deviation117.84972
Coefficient of variation (CV)0.0004069444
Kurtosis-1.1592923
Mean289596.61
Median Absolute Deviation (MAD)97
Skewness0.29540769
Sum31566030
Variance13888.556
MonotonicityNot monotonic
2023-12-11T16:18:54.646190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
289427 1
 
0.9%
289458 1
 
0.9%
289519 1
 
0.9%
289517 1
 
0.9%
289516 1
 
0.9%
289515 1
 
0.9%
289510 1
 
0.9%
289498 1
 
0.9%
289495 1
 
0.9%
289491 1
 
0.9%
Other values (99) 99
90.8%
ValueCountFrequency (%)
289412 1
0.9%
289415 1
0.9%
289427 1
0.9%
289431 1
0.9%
289434 1
0.9%
289435 1
0.9%
289436 1
0.9%
289437 1
0.9%
289443 1
0.9%
289448 1
0.9%
ValueCountFrequency (%)
289825 1
0.9%
289808 1
0.9%
289806 1
0.9%
289805 1
0.9%
289804 1
0.9%
289803 1
0.9%
289802 1
0.9%
289801 1
0.9%
289800 1
0.9%
289799 1
0.9%

FAQ구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size1004.0 B
J
44 
S
44 
F
21 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowF
2nd rowF
3rd rowF
4th rowF
5th rowF

Common Values

ValueCountFrequency (%)
J 44
40.4%
S 44
40.4%
F 21
19.3%

Length

2023-12-11T16:18:54.806937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T16:18:54.937138image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
j 44
40.4%
s 44
40.4%
f 21
19.3%

FAQ구분명
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size1004.0 B
자치구 업무메뉴얼
44 
서울시 업무매뉴얼
44 
FAQ
21 

Length

Max length9
Median length9
Mean length7.8440367
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFAQ
2nd rowFAQ
3rd rowFAQ
4th rowFAQ
5th rowFAQ

Common Values

ValueCountFrequency (%)
자치구 업무메뉴얼 44
40.4%
서울시 업무매뉴얼 44
40.4%
FAQ 21
19.3%

Length

2023-12-11T16:18:55.061349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T16:18:55.171401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
자치구 44
22.3%
업무메뉴얼 44
22.3%
서울시 44
22.3%
업무매뉴얼 44
22.3%
faq 21
10.7%

대분류코드
Real number (ℝ)

HIGH CORRELATION 

Distinct21
Distinct (%)19.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22214188
Minimum22213012
Maximum22214339
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T16:18:55.318314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum22213012
5-th percentile22214024
Q122214080
median22214161
Q322214330
95-th percentile22214339
Maximum22214339
Range1327
Interquartile range (IQR)250

Descriptive statistics

Standard deviation171.67006
Coefficient of variation (CV)7.7279465 × 10-6
Kurtosis18.889764
Mean22214188
Median Absolute Deviation (MAD)137
Skewness-2.9538234
Sum2.4213465 × 109
Variance29470.608
MonotonicityNot monotonic
2023-12-11T16:18:55.500337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=21)
ValueCountFrequency (%)
22214080 14
12.8%
22214161 12
11.0%
22214047 10
 
9.2%
22214061 7
 
6.4%
22214024 7
 
6.4%
22214335 7
 
6.4%
22214339 7
 
6.4%
22214324 6
 
5.5%
22214327 6
 
5.5%
22214330 5
 
4.6%
Other values (11) 28
25.7%
ValueCountFrequency (%)
22213012 1
 
0.9%
22214000 2
 
1.8%
22214024 7
6.4%
22214047 10
9.2%
22214061 7
6.4%
22214080 14
12.8%
22214095 2
 
1.8%
22214136 1
 
0.9%
22214153 3
 
2.8%
22214161 12
11.0%
ValueCountFrequency (%)
22214339 7
6.4%
22214338 4
3.7%
22214337 2
 
1.8%
22214335 7
6.4%
22214332 3
2.8%
22214331 1
 
0.9%
22214330 5
4.6%
22214329 5
4.6%
22214328 4
3.7%
22214327 6
5.5%

대분류명
Categorical

HIGH CORRELATION 

Distinct21
Distinct (%)19.3%
Missing0
Missing (%)0.0%
Memory size1004.0 B
사회보장과복지
14 
제조건설과개발
12 
기업과경제
10 
수송및교통
지역개발
Other values (16)
59 

Length

Max length7
Median length6
Mean length5.2293578
Min length2

Unique

Unique3 ?
Unique (%)2.8%

Sample

1st row일반공공행정
2nd row문화와여가
3rd row문화와여가
4th row문화와여가
5th row문화와여가

Common Values

ValueCountFrequency (%)
사회보장과복지 14
12.8%
제조건설과개발 12
11.0%
기업과경제 10
 
9.2%
수송및교통 7
 
6.4%
지역개발 7
 
6.4%
문화와여가 7
 
6.4%
구정일반 7
 
6.4%
일반공공행정 6
 
5.5%
산업중소기업 6
 
5.5%
교육 5
 
4.6%
Other values (11) 28
25.7%

Length

2023-12-11T16:18:55.725581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
사회보장과복지 14
12.8%
제조건설과개발 12
11.0%
기업과경제 10
 
9.2%
수송및교통 7
 
6.4%
지역개발 7
 
6.4%
문화와여가 7
 
6.4%
구정일반 7
 
6.4%
일반공공행정 6
 
5.5%
산업중소기업 6
 
5.5%
사회복지 5
 
4.6%
Other values (11) 28
25.7%

질문
Text

UNIQUE 

Distinct109
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1004.0 B
2023-12-11T16:18:55.998108image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length202
Median length45
Mean length26.587156
Min length3

Characters and Unicode

Total characters2898
Distinct characters387
Distinct categories13 ?
Distinct scripts3 ?
Distinct blocks5 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique109 ?
Unique (%)100.0%

Sample

1st row[시ㆍ구정외 타기관 관련 상담] 고용노동부 [일자리 안정자금]
2nd row서대문문화체육회관 FAQ
3rd row서대문구립인조잔디구장 FAQ
4th row궁동체육관 FAQ
5th row홍제배드민턴장 FAQ
ValueCountFrequency (%)
12
 
2.2%
10
 
1.9%
한예종 8
 
1.5%
관련 8
 
1.5%
추진 7
 
1.3%
업무 6
 
1.1%
관리 6
 
1.1%
유치 6
 
1.1%
총괄 5
 
0.9%
테스트db_교육용 4
 
0.7%
Other values (399) 467
86.6%
2023-12-11T16:18:56.525079image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
591
 
20.4%
56
 
1.9%
54
 
1.9%
48
 
1.7%
47
 
1.6%
41
 
1.4%
) 40
 
1.4%
( 40
 
1.4%
36
 
1.2%
33
 
1.1%
Other values (377) 1912
66.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1956
67.5%
Space Separator 591
 
20.4%
Decimal Number 99
 
3.4%
Uppercase Letter 69
 
2.4%
Close Punctuation 56
 
1.9%
Open Punctuation 56
 
1.9%
Other Punctuation 32
 
1.1%
Lowercase Letter 19
 
0.7%
Connector Punctuation 11
 
0.4%
Dash Punctuation 4
 
0.1%
Other values (3) 5
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
56
 
2.9%
54
 
2.8%
48
 
2.5%
47
 
2.4%
41
 
2.1%
36
 
1.8%
33
 
1.7%
32
 
1.6%
31
 
1.6%
27
 
1.4%
Other values (320) 1551
79.3%
Uppercase Letter
ValueCountFrequency (%)
B 9
13.0%
D 8
11.6%
S 8
11.6%
F 6
8.7%
A 6
8.7%
O 5
7.2%
Q 4
 
5.8%
W 4
 
5.8%
C 4
 
5.8%
E 3
 
4.3%
Other values (8) 12
17.4%
Lowercase Letter
ValueCountFrequency (%)
a 4
21.1%
e 2
10.5%
r 2
10.5%
o 2
10.5%
t 1
 
5.3%
p 1
 
5.3%
y 1
 
5.3%
u 1
 
5.3%
l 1
 
5.3%
n 1
 
5.3%
Other values (3) 3
15.8%
Decimal Number
ValueCountFrequency (%)
1 29
29.3%
2 24
24.2%
0 14
14.1%
3 12
12.1%
8 6
 
6.1%
4 4
 
4.0%
6 3
 
3.0%
5 3
 
3.0%
9 2
 
2.0%
7 2
 
2.0%
Other Punctuation
ValueCountFrequency (%)
. 13
40.6%
? 6
18.8%
/ 5
 
15.6%
: 5
 
15.6%
3
 
9.4%
Close Punctuation
ValueCountFrequency (%)
) 40
71.4%
] 16
 
28.6%
Open Punctuation
ValueCountFrequency (%)
( 40
71.4%
[ 16
 
28.6%
Dash Punctuation
ValueCountFrequency (%)
3
75.0%
- 1
 
25.0%
Space Separator
ValueCountFrequency (%)
591
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 11
100.0%
Math Symbol
ValueCountFrequency (%)
~ 3
100.0%
Initial Punctuation
ValueCountFrequency (%)
1
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1956
67.5%
Common 854
29.5%
Latin 88
 
3.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
56
 
2.9%
54
 
2.8%
48
 
2.5%
47
 
2.4%
41
 
2.1%
36
 
1.8%
33
 
1.7%
32
 
1.6%
31
 
1.6%
27
 
1.4%
Other values (320) 1551
79.3%
Latin
ValueCountFrequency (%)
B 9
 
10.2%
D 8
 
9.1%
S 8
 
9.1%
F 6
 
6.8%
A 6
 
6.8%
O 5
 
5.7%
Q 4
 
4.5%
W 4
 
4.5%
a 4
 
4.5%
C 4
 
4.5%
Other values (21) 30
34.1%
Common
ValueCountFrequency (%)
591
69.2%
) 40
 
4.7%
( 40
 
4.7%
1 29
 
3.4%
2 24
 
2.8%
] 16
 
1.9%
[ 16
 
1.9%
0 14
 
1.6%
. 13
 
1.5%
3 12
 
1.4%
Other values (16) 59
 
6.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1955
67.5%
ASCII 934
32.2%
None 6
 
0.2%
Punctuation 2
 
0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
591
63.3%
) 40
 
4.3%
( 40
 
4.3%
1 29
 
3.1%
2 24
 
2.6%
] 16
 
1.7%
[ 16
 
1.7%
0 14
 
1.5%
. 13
 
1.4%
3 12
 
1.3%
Other values (43) 139
 
14.9%
Hangul
ValueCountFrequency (%)
56
 
2.9%
54
 
2.8%
48
 
2.5%
47
 
2.4%
41
 
2.1%
36
 
1.8%
33
 
1.7%
32
 
1.6%
31
 
1.6%
27
 
1.4%
Other values (319) 1550
79.3%
None
ValueCountFrequency (%)
3
50.0%
3
50.0%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
Punctuation
ValueCountFrequency (%)
1
50.0%
1
50.0%

수정일시
Real number (ℝ)

Distinct95
Distinct (%)87.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0191995 × 1013
Minimum2.0180107 × 1013
Maximum2.0200506 × 1013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.1 KiB
2023-12-11T16:18:56.773598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0180107 × 1013
5-th percentile2.0180459 × 1013
Q12.0190425 × 1013
median2.0190812 × 1013
Q32.0200213 × 1013
95-th percentile2.0200414 × 1013
Maximum2.0200506 × 1013
Range2.0398998 × 1010
Interquartile range (IQR)9.7879809 × 109

Descriptive statistics

Standard deviation7.0351052 × 109
Coefficient of variation (CV)0.0003484106
Kurtosis-0.98554061
Mean2.0191995 × 1013
Median Absolute Deviation (MAD)9.3109916 × 109
Skewness-0.26740566
Sum2.2009275 × 1015
Variance4.9492705 × 1019
MonotonicityNot monotonic
2023-12-11T16:18:56.981856image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20190425161723 10
 
9.2%
20190805092932 3
 
2.8%
20190816162025 2
 
1.8%
20190805092900 2
 
1.8%
20190917181020 2
 
1.8%
20190522102043 1
 
0.9%
20200414143921 1
 
0.9%
20190910173023 1
 
0.9%
20200414143350 1
 
0.9%
20200414142827 1
 
0.9%
Other values (85) 85
78.0%
ValueCountFrequency (%)
20180107143914 1
0.9%
20180128093739 1
0.9%
20180213145025 1
0.9%
20180222162224 1
0.9%
20180416135603 1
0.9%
20180420125311 1
0.9%
20180518094914 1
0.9%
20180518095015 1
0.9%
20180518095546 1
0.9%
20180618153148 1
0.9%
ValueCountFrequency (%)
20200506142228 1
0.9%
20200506102624 1
0.9%
20200414143921 1
0.9%
20200414143350 1
0.9%
20200414143121 1
0.9%
20200414142827 1
0.9%
20200414112704 1
0.9%
20200408141117 1
0.9%
20200408095734 1
0.9%
20200325155934 1
0.9%

Interactions

2023-12-11T16:18:53.705124image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T16:18:52.923977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T16:18:53.312458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T16:18:53.827618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T16:18:53.035293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T16:18:53.432319image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T16:18:53.959742image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T16:18:53.166964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T16:18:53.557945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T16:18:57.100780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호FAQ구분FAQ구분명대분류코드대분류명수정일시
일련번호1.0000.4710.4710.4440.7210.029
FAQ구분0.4711.0001.0000.9380.9560.219
FAQ구분명0.4711.0001.0000.9380.9560.219
대분류코드0.4440.9380.9381.0001.0000.214
대분류명0.7210.9560.9561.0001.0000.584
수정일시0.0290.2190.2190.2140.5841.000
2023-12-11T16:18:57.583685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
FAQ구분명대분류명FAQ구분
FAQ구분명1.0000.7101.000
대분류명0.7101.0000.710
FAQ구분1.0000.7101.000
2023-12-11T16:18:57.712270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
일련번호대분류코드수정일시FAQ구분FAQ구분명대분류명
일련번호1.000-0.1570.1370.3030.3030.332
대분류코드-0.1571.0000.2420.6900.6900.915
수정일시0.1370.2421.0000.2030.2030.335
FAQ구분0.3030.6900.2031.0001.0000.710
FAQ구분명0.3030.6900.2031.0001.0000.710
대분류명0.3320.9150.3350.7100.7101.000

Missing values

2023-12-11T16:18:54.121983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T16:18:54.264320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

일련번호FAQ구분FAQ구분명대분류코드대분류명질문수정일시
0289427FFAQ22214327일반공공행정[시ㆍ구정외 타기관 관련 상담] 고용노동부 [일자리 안정자금]20190522102043
1289434FFAQ22214061문화와여가서대문문화체육회관 FAQ20190816162025
2289435FFAQ22214061문화와여가서대문구립인조잔디구장 FAQ20190816162025
3289436FFAQ22214061문화와여가궁동체육관 FAQ20190816162014
4289437FFAQ22214061문화와여가홍제배드민턴장 FAQ20190816162034
5289472FFAQ22214153재난과안전양천생활안전체험교육관20180213145025
6289473FFAQ22214061문화와여가금천 나도스타 노래부르기 대회 (어린이날 행사)20190502160033
7289492FFAQ22214061문화와여가브라보 서초문화버스 (셔틀버스) 운행20181128120932
8289612FFAQ22214339지역개발마곡산업단지 내 문화시설 건립20180910131534
9289648FFAQ22214328농림해양수산서울반려동물교육센터20190408175105
일련번호FAQ구분FAQ구분명대분류코드대분류명질문수정일시
99289675S서울시 업무매뉴얼22214335수송및교통[사업종료] 서울시 상습불법주차 발생장소 (중점단속 구간 193개소)20200224113053
100289676S서울시 업무매뉴얼22214332문화체육관광관광약자를 위한 접근성 개선 지원 사업20180719152605
101289683S서울시 업무매뉴얼22214324산업중소기업제로페이 (서울페이 / 소상공인 결제 서비스)20200506102624
102289691S서울시 업무매뉴얼22214324산업중소기업폭염에 따른 에너지빈곤층 냉방물품 지원사업20200318153525
103289708S서울시 업무매뉴얼22214330사회복지서울형 갭이어(Gap-year) 지원사업_청년인생설계학교20200224080201
104289712S서울시 업무매뉴얼22214339지역개발서울식물원(Seoul Botanic Park)20200506142228
105289714S서울시 업무매뉴얼22214330사회복지서울시 중증 뇌병변장애인 일회용품 구입비 지원20200312153044
106289756S서울시 업무매뉴얼22214332문화체육관광시월 정동축제20200129102726
107289789S서울시 업무매뉴얼22214337보건서울시민 건강 한마당20200302132817
108289825S서울시 업무매뉴얼22214338환경보호[수도사업소]공공문자 알림서비스20191217185410