Amino acid dipepetide frequency for Issyk-Kul virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.892AlaAla: 2.892 ± 1.009
2.381AlaCys: 2.381 ± 0.263
2.296AlaAsp: 2.296 ± 0.04
3.657AlaGlu: 3.657 ± 0.115
1.361AlaPhe: 1.361 ± 0.413
1.956AlaGly: 1.956 ± 0.517
1.021AlaHis: 1.021 ± 0.161
4.337AlaIle: 4.337 ± 0.38
3.232AlaLys: 3.232 ± 0.353
4.508AlaLeu: 4.508 ± 0.828
0.936AlaMet: 0.936 ± 0.165
2.722AlaAsn: 2.722 ± 0.102
1.276AlaPro: 1.276 ± 0.398
1.106AlaGln: 1.106 ± 0.25
2.466AlaArg: 2.466 ± 0.577
3.997AlaSer: 3.997 ± 0.639
3.232AlaThr: 3.232 ± 1.035
4.933AlaVal: 4.933 ± 0.829
1.276AlaTrp: 1.276 ± 0.196
1.191AlaTyr: 1.191 ± 0.218
0.0AlaXaa: 0.0 ± 0.0
Cys
1.786CysAla: 1.786 ± 0.396
1.361CysCys: 1.361 ± 0.108
0.34CysAsp: 0.34 ± 0.115
1.191CysGlu: 1.191 ± 0.367
1.871CysPhe: 1.871 ± 0.136
0.936CysGly: 0.936 ± 0.307
0.85CysHis: 0.85 ± 0.289
2.637CysIle: 2.637 ± 0.267
2.296CysLys: 2.296 ± 0.605
1.871CysLeu: 1.871 ± 0.173
0.85CysMet: 0.85 ± 0.289
1.701CysAsn: 1.701 ± 0.765
1.871CysPro: 1.871 ± 0.524
0.68CysGln: 0.68 ± 0.229
1.106CysArg: 1.106 ± 0.078
1.531CysSer: 1.531 ± 0.263
1.531CysThr: 1.531 ± 0.263
0.85CysVal: 0.85 ± 0.289
0.51CysTrp: 0.51 ± 0.211
0.85CysTyr: 0.85 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
1.786AspAla: 1.786 ± 0.024
1.191AspCys: 1.191 ± 0.218
2.807AspAsp: 2.807 ± 0.512
3.317AspGlu: 3.317 ± 0.496
2.211AspPhe: 2.211 ± 0.471
2.977AspGly: 2.977 ± 0.196
1.021AspHis: 1.021 ± 0.236
4.933AspIle: 4.933 ± 0.232
3.402AspLys: 3.402 ± 0.531
5.103AspLeu: 5.103 ± 0.614
2.126AspMet: 2.126 ± 0.348
2.381AspAsn: 2.381 ± 0.524
1.106AspPro: 1.106 ± 0.078
1.361AspGln: 1.361 ± 0.275
1.361AspArg: 1.361 ± 0.14
4.763AspSer: 4.763 ± 0.887
1.956AspThr: 1.956 ± 0.039
3.232AspVal: 3.232 ± 0.718
1.021AspTrp: 1.021 ± 0.236
2.211AspTyr: 2.211 ± 0.138
0.0AspXaa: 0.0 ± 0.0
Glu
4.252GluAla: 4.252 ± 0.008
0.765GluCys: 0.765 ± 0.184
4.167GluAsp: 4.167 ± 0.97
4.848GluGlu: 4.848 ± 1.199
3.742GluPhe: 3.742 ± 0.233
4.848GluGly: 4.848 ± 0.404
2.551GluHis: 2.551 ± 0.207
3.147GluIle: 3.147 ± 0.575
4.763GluLys: 4.763 ± 0.55
7.229GluLeu: 7.229 ± 1.443
2.211GluMet: 2.211 ± 0.512
4.508GluAsn: 4.508 ± 0.901
1.616GluPro: 1.616 ± 0.265
1.701GluGln: 1.701 ± 0.211
2.807GluArg: 2.807 ± 0.203
4.763GluSer: 4.763 ± 0.362
3.402GluThr: 3.402 ± 0.289
6.209GluVal: 6.209 ± 0.381
0.425GluTrp: 0.425 ± 0.133
1.021GluTyr: 1.021 ± 0.071
0.0GluXaa: 0.0 ± 0.0
Phe
2.126PheAla: 2.126 ± 0.44
1.361PheCys: 1.361 ± 0.5
1.531PheAsp: 1.531 ± 0.277
2.892PheGlu: 2.892 ± 0.514
2.211PhePhe: 2.211 ± 0.247
2.551PheGly: 2.551 ± 0.431
1.446PheHis: 1.446 ± 0.322
1.956PheIle: 1.956 ± 0.417
4.593PheLys: 4.593 ± 0.117
5.613PheLeu: 5.613 ± 0.114
1.021PheMet: 1.021 ± 0.344
2.126PheAsn: 2.126 ± 0.094
1.361PhePro: 1.361 ± 0.275
1.191PheGln: 1.191 ± 0.401
1.021PheArg: 1.021 ± 0.344
5.868PheSer: 5.868 ± 0.144
2.296PheThr: 2.296 ± 0.04
1.106PheVal: 1.106 ± 0.078
0.51PheTrp: 0.51 ± 0.036
1.361PheTyr: 1.361 ± 0.459
0.0PheXaa: 0.0 ± 0.0
Gly
2.296GlyAla: 2.296 ± 0.341
1.616GlyCys: 1.616 ± 0.642
2.977GlyAsp: 2.977 ± 0.57
2.807GlyGlu: 2.807 ± 0.245
1.276GlyPhe: 1.276 ± 0.046
1.956GlyGly: 1.956 ± 0.253
1.531GlyHis: 1.531 ± 0.263
3.317GlyIle: 3.317 ± 0.134
4.423GlyLys: 4.423 ± 0.466
4.848GlyLeu: 4.848 ± 0.539
1.531GlyMet: 1.531 ± 0.277
2.381GlyAsn: 2.381 ± 0.116
2.126GlyPro: 2.126 ± 0.356
1.446GlyGln: 1.446 ± 0.353
2.807GlyArg: 2.807 ± 0.388
3.062GlySer: 3.062 ± 0.184
3.232GlyThr: 3.232 ± 0.862
2.126GlyVal: 2.126 ± 0.094
0.425GlyTrp: 0.425 ± 0.137
1.021GlyTyr: 1.021 ± 0.364
0.0GlyXaa: 0.0 ± 0.0
His
1.361HisAla: 1.361 ± 0.314
0.51HisCys: 0.51 ± 0.036
0.68HisAsp: 0.68 ± 0.245
1.361HisGlu: 1.361 ± 0.14
1.361HisPhe: 1.361 ± 0.108
1.361HisGly: 1.361 ± 0.5
0.68HisHis: 0.68 ± 0.054
1.701HisIle: 1.701 ± 0.166
1.361HisLys: 1.361 ± 0.14
2.296HisLeu: 2.296 ± 0.04
0.85HisMet: 0.85 ± 0.273
0.85HisAsn: 0.85 ± 0.106
0.595HisPro: 0.595 ± 0.176
0.595HisGln: 0.595 ± 0.093
1.021HisArg: 1.021 ± 0.236
2.381HisSer: 2.381 ± 0.55
1.786HisThr: 1.786 ± 0.142
2.296HisVal: 2.296 ± 0.04
0.17HisTrp: 0.17 ± 0.057
1.276HisTyr: 1.276 ± 0.046
0.0HisXaa: 0.0 ± 0.0
Ile
3.062IleAla: 3.062 ± 0.569
1.701IleCys: 1.701 ± 0.216
1.871IleAsp: 1.871 ± 0.136
3.232IleGlu: 3.232 ± 0.369
1.701IlePhe: 1.701 ± 0.099
2.041IleGly: 2.041 ± 0.322
1.276IleHis: 1.276 ± 0.046
4.508IleIle: 4.508 ± 0.897
6.634IleLys: 6.634 ± 0.961
6.719IleLeu: 6.719 ± 0.292
1.361IleMet: 1.361 ± 0.459
3.912IleAsn: 3.912 ± 0.645
1.956IlePro: 1.956 ± 0.285
2.807IleGln: 2.807 ± 0.48
3.232IleArg: 3.232 ± 0.422
5.783IleSer: 5.783 ± 0.529
4.423IleThr: 4.423 ± 0.663
2.892IleVal: 2.892 ± 0.103
0.936IleTrp: 0.936 ± 0.494
2.041IleTyr: 2.041 ± 0.322
0.0IleXaa: 0.0 ± 0.0
Lys
4.082LysAla: 4.082 ± 0.395
0.68LysCys: 0.68 ± 0.054
5.443LysAsp: 5.443 ± 1.132
5.783LysGlu: 5.783 ± 0.705
4.082LysPhe: 4.082 ± 0.645
4.337LysGly: 4.337 ± 0.47
2.041LysHis: 2.041 ± 0.503
3.997LysIle: 3.997 ± 0.297
5.528LysLys: 5.528 ± 0.702
7.654LysLeu: 7.654 ± 0.277
1.786LysMet: 1.786 ± 0.172
2.126LysAsn: 2.126 ± 0.284
2.892LysPro: 2.892 ± 0.242
2.722LysGln: 2.722 ± 0.164
4.508LysArg: 4.508 ± 0.252
6.974LysSer: 6.974 ± 0.737
5.273LysThr: 5.273 ± 1.363
6.294LysVal: 6.294 ± 0.148
0.595LysTrp: 0.595 ± 0.176
1.701LysTyr: 1.701 ± 0.321
0.0LysXaa: 0.0 ± 0.0
Leu
4.678LeuAla: 4.678 ± 0.584
2.126LeuCys: 2.126 ± 0.284
5.783LeuAsp: 5.783 ± 0.413
7.739LeuGlu: 7.739 ± 0.799
4.337LeuPhe: 4.337 ± 0.38
4.082LeuGly: 4.082 ± 0.283
2.637LeuHis: 2.637 ± 0.304
5.783LeuIle: 5.783 ± 0.686
7.91LeuLys: 7.91 ± 0.252
10.461LeuLeu: 10.461 ± 2.904
2.126LeuMet: 2.126 ± 0.473
5.528LeuAsn: 5.528 ± 0.558
4.423LeuPro: 4.423 ± 0.422
2.381LeuGln: 2.381 ± 0.341
3.657LeuArg: 3.657 ± 0.721
9.355LeuSer: 9.355 ± 0.356
7.91LeuThr: 7.91 ± 0.347
4.763LeuVal: 4.763 ± 0.871
0.68LeuTrp: 0.68 ± 0.313
4.167LeuTyr: 4.167 ± 0.24
0.0LeuXaa: 0.0 ± 0.0
Met
1.616MetAla: 1.616 ± 0.484
0.85MetCys: 0.85 ± 0.108
1.021MetAsp: 1.021 ± 0.192
2.041MetGlu: 2.041 ± 0.124
1.531MetPhe: 1.531 ± 0.107
1.021MetGly: 1.021 ± 0.071
0.425MetHis: 0.425 ± 0.133
0.68MetIle: 0.68 ± 0.229
1.786MetLys: 1.786 ± 0.361
2.551MetLeu: 2.551 ± 0.344
0.85MetMet: 0.85 ± 0.106
1.021MetAsn: 1.021 ± 0.161
0.936MetPro: 0.936 ± 0.129
1.021MetGln: 1.021 ± 0.344
1.701MetArg: 1.701 ± 0.099
2.807MetSer: 2.807 ± 0.324
0.68MetThr: 0.68 ± 0.054
1.106MetVal: 1.106 ± 0.146
0.255MetTrp: 0.255 ± 0.113
0.34MetTyr: 0.34 ± 0.115
0.0MetXaa: 0.0 ± 0.0
Asn
2.381AsnAla: 2.381 ± 0.604
1.361AsnCys: 1.361 ± 0.314
2.296AsnAsp: 2.296 ± 0.341
2.807AsnGlu: 2.807 ± 0.512
3.147AsnPhe: 3.147 ± 0.106
1.276AsnGly: 1.276 ± 0.251
1.701AsnHis: 1.701 ± 0.389
3.572AsnIle: 3.572 ± 0.215
3.232AsnLys: 3.232 ± 0.54
7.399AsnLeu: 7.399 ± 1.005
1.191AsnMet: 1.191 ± 0.218
1.786AsnAsn: 1.786 ± 0.361
1.616AsnPro: 1.616 ± 0.567
1.021AsnGln: 1.021 ± 0.421
1.871AsnArg: 1.871 ± 0.267
5.018AsnSer: 5.018 ± 0.223
3.912AsnThr: 3.912 ± 0.634
1.701AsnVal: 1.701 ± 0.953
1.276AsnTrp: 1.276 ± 0.046
1.871AsnTyr: 1.871 ± 0.432
0.0AsnXaa: 0.0 ± 0.0
Pro
1.616ProAla: 1.616 ± 0.313
0.34ProCys: 0.34 ± 0.115
2.041ProAsp: 2.041 ± 0.557
3.487ProGlu: 3.487 ± 0.225
0.85ProPhe: 0.85 ± 0.289
1.871ProGly: 1.871 ± 0.271
0.51ProHis: 0.51 ± 0.036
2.381ProIle: 2.381 ± 0.208
3.232ProLys: 3.232 ± 0.478
2.637ProLeu: 2.637 ± 0.208
1.021ProMet: 1.021 ± 0.071
1.531ProAsn: 1.531 ± 0.107
0.765ProPro: 0.765 ± 0.079
0.51ProGln: 0.51 ± 0.227
1.786ProArg: 1.786 ± 0.207
4.678ProSer: 4.678 ± 0.41
1.786ProThr: 1.786 ± 0.585
2.296ProVal: 2.296 ± 0.605
0.425ProTrp: 0.425 ± 0.133
0.51ProTyr: 0.51 ± 0.172
0.0ProXaa: 0.0 ± 0.0
Gln
2.381GlnAla: 2.381 ± 0.291
1.191GlnCys: 1.191 ± 0.072
0.68GlnAsp: 0.68 ± 0.229
2.466GlnGlu: 2.466 ± 0.209
1.106GlnPhe: 1.106 ± 0.328
1.106GlnGly: 1.106 ± 0.25
1.021GlnHis: 1.021 ± 0.071
1.786GlnIle: 1.786 ± 0.207
2.296GlnLys: 2.296 ± 0.53
2.977GlnLeu: 2.977 ± 0.569
0.936GlnMet: 0.936 ± 0.275
1.956GlnAsn: 1.956 ± 0.339
1.191GlnPro: 1.191 ± 0.367
1.956GlnGln: 1.956 ± 0.228
2.126GlnArg: 2.126 ± 0.094
1.786GlnSer: 1.786 ± 0.396
2.381GlnThr: 2.381 ± 0.263
1.361GlnVal: 1.361 ± 0.108
0.595GlnTrp: 0.595 ± 0.176
0.936GlnTyr: 0.936 ± 0.102
0.0GlnXaa: 0.0 ± 0.0
Arg
2.637ArgAla: 2.637 ± 0.208
1.361ArgCys: 1.361 ± 0.108
3.062ArgAsp: 3.062 ± 0.931
1.616ArgGlu: 1.616 ± 0.525
2.466ArgPhe: 2.466 ± 0.229
2.126ArgGly: 2.126 ± 0.238
1.531ArgHis: 1.531 ± 0.107
3.232ArgIle: 3.232 ± 0.313
3.827ArgLys: 3.827 ± 0.141
3.402ArgLeu: 3.402 ± 0.665
0.68ArgMet: 0.68 ± 0.054
2.892ArgAsn: 2.892 ± 0.103
1.616ArgPro: 1.616 ± 0.265
2.041ArgGln: 2.041 ± 0.181
2.466ArgArg: 2.466 ± 0.03
4.593ArgSer: 4.593 ± 0.46
2.126ArgThr: 2.126 ± 0.094
2.296ArgVal: 2.296 ± 0.835
0.51ArgTrp: 0.51 ± 0.036
0.68ArgTyr: 0.68 ± 0.054
0.0ArgXaa: 0.0 ± 0.0
Ser
5.188SerAla: 5.188 ± 0.099
3.062SerCys: 3.062 ± 0.775
4.678SerAsp: 4.678 ± 0.274
6.719SerGlu: 6.719 ± 0.402
3.997SerPhe: 3.997 ± 0.181
6.294SerGly: 6.294 ± 0.169
1.191SerHis: 1.191 ± 0.311
5.358SerIle: 5.358 ± 0.998
7.739SerLys: 7.739 ± 1.25
7.144SerLeu: 7.144 ± 1.065
1.191SerMet: 1.191 ± 0.185
2.041SerAsn: 2.041 ± 0.503
2.041SerPro: 2.041 ± 0.293
3.487SerGln: 3.487 ± 0.314
3.232SerArg: 3.232 ± 0.198
9.866SerSer: 9.866 ± 0.241
7.739SerThr: 7.739 ± 2.038
7.569SerVal: 7.569 ± 0.487
0.595SerTrp: 0.595 ± 0.24
2.722SerTyr: 2.722 ± 0.217
0.0SerXaa: 0.0 ± 0.0
Thr
2.722ThrAla: 2.722 ± 0.701
1.191ThrCys: 1.191 ± 0.93
3.402ThrAsp: 3.402 ± 0.596
4.423ThrGlu: 4.423 ± 0.296
2.892ThrPhe: 2.892 ± 0.789
2.637ThrGly: 2.637 ± 0.442
1.191ThrHis: 1.191 ± 0.367
3.232ThrIle: 3.232 ± 0.478
4.337ThrLys: 4.337 ± 0.294
6.549ThrLeu: 6.549 ± 0.355
1.531ThrMet: 1.531 ± 0.446
4.678ThrAsn: 4.678 ± 1.228
3.062ThrPro: 3.062 ± 1.078
1.616ThrGln: 1.616 ± 0.42
3.317ThrArg: 3.317 ± 0.73
6.038ThrSer: 6.038 ± 1.273
4.252ThrThr: 4.252 ± 1.445
4.593ThrVal: 4.593 ± 1.314
1.361ThrTrp: 1.361 ± 0.314
2.296ThrTyr: 2.296 ± 0.04
0.0ThrXaa: 0.0 ± 0.0
Val
2.637ValAla: 2.637 ± 0.442
1.701ValCys: 1.701 ± 0.393
3.062ValAsp: 3.062 ± 0.355
5.358ValGlu: 5.358 ± 0.434
2.126ValPhe: 2.126 ± 0.095
2.296ValGly: 2.296 ± 0.729
1.276ValHis: 1.276 ± 0.193
3.487ValIle: 3.487 ± 0.085
5.188ValLys: 5.188 ± 0.279
7.059ValLeu: 7.059 ± 0.422
1.021ValMet: 1.021 ± 0.236
2.977ValAsn: 2.977 ± 0.346
2.807ValPro: 2.807 ± 0.626
2.892ValGln: 2.892 ± 0.577
2.211ValArg: 2.211 ± 0.157
4.337ValSer: 4.337 ± 0.645
5.358ValThr: 5.358 ± 0.195
4.337ValVal: 4.337 ± 0.696
0.68ValTrp: 0.68 ± 0.343
0.85ValTyr: 0.85 ± 0.192
0.0ValXaa: 0.0 ± 0.0
Trp
0.255TrpAla: 0.255 ± 0.182
0.51TrpCys: 0.51 ± 0.211
0.936TrpAsp: 0.936 ± 0.307
1.191TrpGlu: 1.191 ± 0.072
0.51TrpPhe: 0.51 ± 0.364
0.765TrpGly: 0.765 ± 0.079
0.255TrpHis: 0.255 ± 0.113
0.68TrpIle: 0.68 ± 0.054
0.85TrpLys: 0.85 ± 0.108
1.531TrpLeu: 1.531 ± 0.211
0.595TrpMet: 0.595 ± 0.24
0.51TrpAsn: 0.51 ± 0.172
0.34TrpPro: 0.34 ± 0.079
0.34TrpGln: 0.34 ± 0.115
0.85TrpArg: 0.85 ± 0.265
0.85TrpSer: 0.85 ± 0.106
0.68TrpThr: 0.68 ± 0.343
0.85TrpVal: 0.85 ± 0.192
0.34TrpTrp: 0.34 ± 0.079
0.17TrpTyr: 0.17 ± 0.057
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.106TyrAla: 1.106 ± 0.33
1.361TyrCys: 1.361 ± 0.314
1.191TyrAsp: 1.191 ± 0.072
2.041TyrGlu: 2.041 ± 0.245
1.361TyrPhe: 1.361 ± 0.314
0.765TyrGly: 0.765 ± 0.079
0.34TyrHis: 0.34 ± 0.115
1.361TyrIle: 1.361 ± 0.108
1.871TyrLys: 1.871 ± 0.446
2.977TyrLeu: 2.977 ± 0.569
0.34TyrMet: 0.34 ± 0.115
2.807TyrAsn: 2.807 ± 0.467
0.68TyrPro: 0.68 ± 0.054
1.361TyrGln: 1.361 ± 0.14
1.616TyrArg: 1.616 ± 0.305
3.402TyrSer: 3.402 ± 0.199
1.361TyrThr: 1.361 ± 0.239
0.85TyrVal: 0.85 ± 0.289
0.425TyrTrp: 0.425 ± 0.137
1.021TyrTyr: 1.021 ± 0.161
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (11759 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski