Amino acid dipepetide frequency for Kowanyama virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.477AlaAla: 2.477 ± 1.543
1.239AlaCys: 1.239 ± 0.459
1.734AlaAsp: 1.734 ± 0.563
2.973AlaGlu: 2.973 ± 1.299
2.973AlaPhe: 2.973 ± 0.744
2.725AlaGly: 2.725 ± 0.854
0.743AlaHis: 0.743 ± 0.186
3.468AlaIle: 3.468 ± 0.822
3.716AlaLys: 3.716 ± 1.281
3.716AlaLeu: 3.716 ± 0.86
1.982AlaMet: 1.982 ± 0.55
2.229AlaAsn: 2.229 ± 0.801
1.982AlaPro: 1.982 ± 0.519
2.973AlaGln: 2.973 ± 0.269
2.477AlaArg: 2.477 ± 0.277
2.477AlaSer: 2.477 ± 0.594
2.229AlaThr: 2.229 ± 0.536
1.982AlaVal: 1.982 ± 0.519
0.495AlaTrp: 0.495 ± 0.578
2.477AlaTyr: 2.477 ± 1.543
0.0AlaXaa: 0.0 ± 0.0
Cys
1.486CysAla: 1.486 ± 0.669
0.248CysCys: 0.248 ± 0.222
1.486CysAsp: 1.486 ± 0.993
1.734CysGlu: 1.734 ± 0.865
1.239CysPhe: 1.239 ± 1.111
2.477CysGly: 2.477 ± 1.256
0.248CysHis: 0.248 ± 0.222
1.486CysIle: 1.486 ± 0.406
1.982CysLys: 1.982 ± 1.103
3.22CysLeu: 3.22 ± 0.973
0.991CysMet: 0.991 ± 0.552
2.477CysAsn: 2.477 ± 0.674
0.495CysPro: 0.495 ± 0.445
1.486CysGln: 1.486 ± 0.669
0.743CysArg: 0.743 ± 0.334
1.734CysSer: 1.734 ± 0.589
1.239CysThr: 1.239 ± 0.772
1.486CysVal: 1.486 ± 1.334
0.743CysTrp: 0.743 ± 0.186
0.495CysTyr: 0.495 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
1.734AspAla: 1.734 ± 0.563
0.743AspCys: 0.743 ± 0.334
3.22AspAsp: 3.22 ± 1.015
2.477AspGlu: 2.477 ± 0.677
4.211AspPhe: 4.211 ± 0.835
1.486AspGly: 1.486 ± 0.406
0.495AspHis: 0.495 ± 0.135
6.44AspIle: 6.44 ± 1.761
2.973AspLys: 2.973 ± 0.692
5.945AspLeu: 5.945 ± 2.406
2.477AspMet: 2.477 ± 0.295
2.229AspAsn: 2.229 ± 1.008
2.725AspPro: 2.725 ± 1.075
1.734AspGln: 1.734 ± 0.408
2.229AspArg: 2.229 ± 0.496
1.734AspSer: 1.734 ± 0.376
2.973AspThr: 2.973 ± 0.53
3.468AspVal: 3.468 ± 1.127
0.743AspTrp: 0.743 ± 0.547
3.716AspTyr: 3.716 ± 0.86
0.0AspXaa: 0.0 ± 0.0
Glu
3.468GluAla: 3.468 ± 0.983
1.734GluCys: 1.734 ± 0.885
3.716GluAsp: 3.716 ± 0.627
4.211GluGlu: 4.211 ± 1.393
2.973GluPhe: 2.973 ± 1.721
1.982GluGly: 1.982 ± 0.464
0.991GluHis: 0.991 ± 0.271
5.945GluIle: 5.945 ± 1.204
4.954GluLys: 4.954 ± 1.396
7.679GluLeu: 7.679 ± 2.435
2.229GluMet: 2.229 ± 0.757
2.229GluAsn: 2.229 ± 0.536
1.734GluPro: 1.734 ± 0.492
2.229GluGln: 2.229 ± 0.801
3.22GluArg: 3.22 ± 1.363
4.211GluSer: 4.211 ± 0.456
3.716GluThr: 3.716 ± 1.378
1.982GluVal: 1.982 ± 0.626
0.743GluTrp: 0.743 ± 0.334
2.477GluTyr: 2.477 ± 0.277
0.0GluXaa: 0.0 ± 0.0
Phe
2.229PheAla: 2.229 ± 0.386
1.734PheCys: 1.734 ± 0.408
2.477PheAsp: 2.477 ± 1.062
2.973PheGlu: 2.973 ± 0.87
1.734PhePhe: 1.734 ± 0.319
1.982PheGly: 1.982 ± 0.972
0.743PheHis: 0.743 ± 0.334
2.725PheIle: 2.725 ± 0.646
3.22PheLys: 3.22 ± 0.284
6.193PheLeu: 6.193 ± 2.405
0.495PheMet: 0.495 ± 0.188
1.982PheAsn: 1.982 ± 1.23
1.239PhePro: 1.239 ± 0.413
1.486PheGln: 1.486 ± 0.372
1.982PheArg: 1.982 ± 0.626
4.706PheSer: 4.706 ± 1.864
3.716PheThr: 3.716 ± 1.116
1.982PheVal: 1.982 ± 0.626
0.743PheTrp: 0.743 ± 0.334
2.229PheTyr: 2.229 ± 1.008
0.0PheXaa: 0.0 ± 0.0
Gly
1.486GlyAla: 1.486 ± 0.504
2.477GlyCys: 2.477 ± 1.543
2.725GlyAsp: 2.725 ± 0.895
3.22GlyGlu: 3.22 ± 1.031
2.229GlyPhe: 2.229 ± 0.704
0.248GlyGly: 0.248 ± 0.154
1.239GlyHis: 1.239 ± 0.287
3.963GlyIle: 3.963 ± 1.116
2.973GlyLys: 2.973 ± 0.536
4.954GlyLeu: 4.954 ± 1.805
0.743GlyMet: 0.743 ± 0.186
3.22GlyAsn: 3.22 ± 0.458
0.991GlyPro: 0.991 ± 0.313
1.734GlyGln: 1.734 ± 0.408
1.734GlyArg: 1.734 ± 0.565
4.211GlySer: 4.211 ± 2.719
3.468GlyThr: 3.468 ± 2.137
1.734GlyVal: 1.734 ± 0.589
0.743GlyTrp: 0.743 ± 0.334
1.486GlyTyr: 1.486 ± 1.096
0.0GlyXaa: 0.0 ± 0.0
His
0.248HisAla: 0.248 ± 0.154
0.495HisCys: 0.495 ± 0.445
1.486HisAsp: 1.486 ± 0.807
0.991HisGlu: 0.991 ± 0.271
0.991HisPhe: 0.991 ± 0.431
1.239HisGly: 1.239 ± 0.427
0.743HisHis: 0.743 ± 0.334
0.495HisIle: 0.495 ± 0.135
2.725HisLys: 2.725 ± 0.667
1.239HisLeu: 1.239 ± 0.287
0.495HisMet: 0.495 ± 0.135
0.991HisAsn: 0.991 ± 0.271
0.743HisPro: 0.743 ± 0.186
0.743HisGln: 0.743 ± 0.334
0.495HisArg: 0.495 ± 0.445
1.982HisSer: 1.982 ± 0.55
0.743HisThr: 0.743 ± 0.334
1.734HisVal: 1.734 ± 0.376
0.248HisTrp: 0.248 ± 0.154
0.743HisTyr: 0.743 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
4.706IleAla: 4.706 ± 0.754
1.982IleCys: 1.982 ± 1.436
4.459IleAsp: 4.459 ± 1.093
4.954IleGlu: 4.954 ± 1.083
2.477IlePhe: 2.477 ± 0.902
5.202IleGly: 5.202 ± 1.574
2.229IleHis: 2.229 ± 0.386
6.44IleIle: 6.44 ± 1.049
8.174IleLys: 8.174 ± 1.637
9.165IleLeu: 9.165 ± 2.276
2.725IleMet: 2.725 ± 1.083
5.697IleAsn: 5.697 ± 0.769
2.725IlePro: 2.725 ± 1.412
3.22IleGln: 3.22 ± 0.65
4.706IleArg: 4.706 ± 1.111
6.44IleSer: 6.44 ± 0.524
3.963IleThr: 3.963 ± 0.464
3.468IleVal: 3.468 ± 1.769
1.239IleTrp: 1.239 ± 0.287
2.973IleTyr: 2.973 ± 0.269
0.0IleXaa: 0.0 ± 0.0
Lys
2.973LysAla: 2.973 ± 1.543
1.486LysCys: 1.486 ± 0.669
5.697LysAsp: 5.697 ± 0.472
6.688LysGlu: 6.688 ± 1.414
2.725LysPhe: 2.725 ± 0.404
4.459LysGly: 4.459 ± 0.556
1.982LysHis: 1.982 ± 0.542
7.431LysIle: 7.431 ± 1.675
5.697LysLys: 5.697 ± 0.937
5.202LysLeu: 5.202 ± 1.224
1.982LysMet: 1.982 ± 0.541
3.963LysAsn: 3.963 ± 0.549
2.477LysPro: 2.477 ± 0.914
2.973LysGln: 2.973 ± 0.692
3.22LysArg: 3.22 ± 0.284
4.211LysSer: 4.211 ± 0.456
7.184LysThr: 7.184 ± 0.691
4.211LysVal: 4.211 ± 0.505
0.743LysTrp: 0.743 ± 0.186
4.211LysTyr: 4.211 ± 1.256
0.0LysXaa: 0.0 ± 0.0
Leu
4.706LeuAla: 4.706 ± 1.364
1.239LeuCys: 1.239 ± 0.459
4.706LeuAsp: 4.706 ± 1.109
5.697LeuGlu: 5.697 ± 2.103
4.211LeuPhe: 4.211 ± 0.906
2.973LeuGly: 2.973 ± 1.313
1.486LeuHis: 1.486 ± 0.606
8.918LeuIle: 8.918 ± 0.827
7.431LeuLys: 7.431 ± 1.783
10.651LeuLeu: 10.651 ± 2.046
2.229LeuMet: 2.229 ± 0.444
6.193LeuAsn: 6.193 ± 0.734
4.459LeuPro: 4.459 ± 0.667
3.22LeuGln: 3.22 ± 2.235
4.459LeuArg: 4.459 ± 0.553
9.413LeuSer: 9.413 ± 2.678
6.193LeuThr: 6.193 ± 0.712
4.954LeuVal: 4.954 ± 1.203
0.248LeuTrp: 0.248 ± 0.154
5.945LeuTyr: 5.945 ± 1.233
0.0LeuXaa: 0.0 ± 0.0
Met
1.486MetAla: 1.486 ± 0.807
0.743MetCys: 0.743 ± 0.334
1.734MetAsp: 1.734 ± 1.076
1.239MetGlu: 1.239 ± 0.457
1.239MetPhe: 1.239 ± 0.427
0.991MetGly: 0.991 ± 0.686
0.0MetHis: 0.0 ± 0.0
2.229MetIle: 2.229 ± 0.768
2.477MetLys: 2.477 ± 0.677
2.725MetLeu: 2.725 ± 1.244
0.743MetMet: 0.743 ± 0.186
0.991MetAsn: 0.991 ± 0.313
1.734MetPro: 1.734 ± 0.319
0.248MetGln: 0.248 ± 0.154
0.991MetArg: 0.991 ± 0.525
3.468MetSer: 3.468 ± 0.821
2.229MetThr: 2.229 ± 1.41
1.734MetVal: 1.734 ± 0.661
0.0MetTrp: 0.0 ± 0.0
0.991MetTyr: 0.991 ± 0.889
0.0MetXaa: 0.0 ± 0.0
Asn
2.229AsnAla: 2.229 ± 0.386
0.743AsnCys: 0.743 ± 0.186
2.229AsnAsp: 2.229 ± 0.241
2.725AsnGlu: 2.725 ± 0.646
2.477AsnPhe: 2.477 ± 1.537
2.725AsnGly: 2.725 ± 0.77
1.239AsnHis: 1.239 ± 0.413
6.193AsnIle: 6.193 ± 0.853
3.716AsnLys: 3.716 ± 0.642
5.945AsnLeu: 5.945 ± 2.005
0.991AsnMet: 0.991 ± 0.615
3.963AsnAsn: 3.963 ± 0.744
2.725AsnPro: 2.725 ± 1.141
1.982AsnGln: 1.982 ± 0.276
1.982AsnArg: 1.982 ± 2.106
3.716AsnSer: 3.716 ± 1.617
3.716AsnThr: 3.716 ± 0.649
1.982AsnVal: 1.982 ± 0.792
0.495AsnTrp: 0.495 ± 0.135
2.477AsnTyr: 2.477 ± 0.914
0.0AsnXaa: 0.0 ± 0.0
Pro
2.973ProAla: 2.973 ± 0.779
0.0ProCys: 0.0 ± 0.0
1.734ProAsp: 1.734 ± 0.637
2.725ProGlu: 2.725 ± 0.404
1.239ProPhe: 1.239 ± 0.459
2.725ProGly: 2.725 ± 1.386
0.248ProHis: 0.248 ± 0.222
3.716ProIle: 3.716 ± 0.87
2.477ProLys: 2.477 ± 0.277
2.725ProLeu: 2.725 ± 0.891
0.991ProMet: 0.991 ± 0.271
1.734ProAsn: 1.734 ± 0.319
0.495ProPro: 0.495 ± 0.307
1.486ProGln: 1.486 ± 0.372
0.743ProArg: 0.743 ± 0.186
1.982ProSer: 1.982 ± 0.626
1.239ProThr: 1.239 ± 0.287
1.734ProVal: 1.734 ± 0.492
0.743ProTrp: 0.743 ± 0.547
1.486ProTyr: 1.486 ± 0.606
0.0ProXaa: 0.0 ± 0.0
Gln
0.743GlnAla: 0.743 ± 0.461
0.743GlnCys: 0.743 ± 0.667
1.982GlnAsp: 1.982 ± 0.55
2.229GlnGlu: 2.229 ± 0.536
2.229GlnPhe: 2.229 ± 0.471
1.239GlnGly: 1.239 ± 0.644
1.239GlnHis: 1.239 ± 0.956
4.211GlnIle: 4.211 ± 0.631
2.477GlnLys: 2.477 ± 0.674
2.229GlnLeu: 2.229 ± 0.241
0.495GlnMet: 0.495 ± 0.578
1.734GlnAsn: 1.734 ± 0.906
0.743GlnPro: 0.743 ± 0.334
1.239GlnGln: 1.239 ± 0.287
1.486GlnArg: 1.486 ± 1.138
1.486GlnSer: 1.486 ± 0.669
3.716GlnThr: 3.716 ± 0.668
2.229GlnVal: 2.229 ± 1.41
0.743GlnTrp: 0.743 ± 0.334
1.486GlnTyr: 1.486 ± 0.372
0.0GlnXaa: 0.0 ± 0.0
Arg
1.982ArgAla: 1.982 ± 0.372
1.982ArgCys: 1.982 ± 0.737
2.229ArgAsp: 2.229 ± 0.471
2.725ArgGlu: 2.725 ± 1.075
2.229ArgPhe: 2.229 ± 0.471
1.239ArgGly: 1.239 ± 0.682
1.239ArgHis: 1.239 ± 0.427
1.982ArgIle: 1.982 ± 1.185
2.725ArgLys: 2.725 ± 0.854
5.45ArgLeu: 5.45 ± 1.946
0.495ArgMet: 0.495 ± 0.307
3.22ArgAsn: 3.22 ± 0.851
0.495ArgPro: 0.495 ± 0.544
1.734ArgGln: 1.734 ± 1.557
1.734ArgArg: 1.734 ± 0.756
3.22ArgSer: 3.22 ± 1.553
1.982ArgThr: 1.982 ± 0.978
1.734ArgVal: 1.734 ± 0.319
0.495ArgTrp: 0.495 ± 0.135
2.477ArgTyr: 2.477 ± 0.277
0.0ArgXaa: 0.0 ± 0.0
Ser
2.725SerAla: 2.725 ± 1.126
2.725SerCys: 2.725 ± 1.225
3.716SerAsp: 3.716 ± 1.403
5.45SerGlu: 5.45 ± 1.184
2.973SerPhe: 2.973 ± 0.931
3.716SerGly: 3.716 ± 1.418
1.239SerHis: 1.239 ± 0.682
7.431SerIle: 7.431 ± 0.915
6.936SerLys: 6.936 ± 0.814
7.679SerLeu: 7.679 ± 1.495
2.725SerMet: 2.725 ± 1.282
3.468SerAsn: 3.468 ± 1.127
1.734SerPro: 1.734 ± 0.565
2.477SerGln: 2.477 ± 1.021
3.716SerArg: 3.716 ± 0.63
4.954SerSer: 4.954 ± 1.348
5.202SerThr: 5.202 ± 2.028
3.716SerVal: 3.716 ± 0.649
0.0SerTrp: 0.0 ± 0.0
2.229SerTyr: 2.229 ± 0.768
0.0SerXaa: 0.0 ± 0.0
Thr
3.963ThrAla: 3.963 ± 0.943
2.973ThrCys: 2.973 ± 1.338
2.477ThrAsp: 2.477 ± 0.752
3.716ThrGlu: 3.716 ± 1.672
3.963ThrPhe: 3.963 ± 0.675
4.211ThrGly: 4.211 ± 1.61
0.991ThrHis: 0.991 ± 0.552
5.45ThrIle: 5.45 ± 0.744
4.954ThrLys: 4.954 ± 0.62
4.459ThrLeu: 4.459 ± 2.604
1.734ThrMet: 1.734 ± 0.756
2.229ThrAsn: 2.229 ± 0.768
2.477ThrPro: 2.477 ± 1.213
1.239ThrGln: 1.239 ± 0.656
2.477ThrArg: 2.477 ± 0.677
5.202ThrSer: 5.202 ± 2.441
3.468ThrThr: 3.468 ± 2.015
3.716ThrVal: 3.716 ± 1.665
0.495ThrTrp: 0.495 ± 0.135
2.973ThrTyr: 2.973 ± 0.744
0.0ThrXaa: 0.0 ± 0.0
Val
2.725ValAla: 2.725 ± 0.729
1.734ValCys: 1.734 ± 0.408
3.22ValAsp: 3.22 ± 0.987
2.973ValGlu: 2.973 ± 0.813
1.982ValPhe: 1.982 ± 0.464
1.982ValGly: 1.982 ± 0.564
0.743ValHis: 0.743 ± 0.461
3.22ValIle: 3.22 ± 0.779
3.963ValLys: 3.963 ± 1.504
5.945ValLeu: 5.945 ± 1.488
1.486ValMet: 1.486 ± 0.993
1.982ValAsn: 1.982 ± 1.014
1.239ValPro: 1.239 ± 0.682
0.991ValGln: 0.991 ± 0.525
1.239ValArg: 1.239 ± 0.459
3.963ValSer: 3.963 ± 0.549
3.22ValThr: 3.22 ± 0.749
1.734ValVal: 1.734 ± 0.589
0.248ValTrp: 0.248 ± 0.582
2.229ValTyr: 2.229 ± 0.558
0.0ValXaa: 0.0 ± 0.0
Trp
0.495TrpAla: 0.495 ± 0.307
0.495TrpCys: 0.495 ± 0.135
0.495TrpAsp: 0.495 ± 0.307
0.248TrpGlu: 0.248 ± 0.154
0.743TrpPhe: 0.743 ± 0.186
0.743TrpGly: 0.743 ± 0.655
0.0TrpHis: 0.0 ± 0.0
0.495TrpIle: 0.495 ± 0.445
0.495TrpLys: 0.495 ± 0.445
1.239TrpLeu: 1.239 ± 0.457
0.248TrpMet: 0.248 ± 0.582
0.495TrpAsn: 0.495 ± 0.307
0.495TrpPro: 0.495 ± 0.445
0.743TrpGln: 0.743 ± 0.334
0.248TrpArg: 0.248 ± 0.222
1.982TrpSer: 1.982 ± 0.972
0.0TrpThr: 0.0 ± 0.0
0.495TrpVal: 0.495 ± 0.135
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.229TyrAla: 2.229 ± 0.721
1.982TyrCys: 1.982 ± 1.436
2.229TyrAsp: 2.229 ± 0.558
2.477TyrGlu: 2.477 ± 0.674
1.734TyrPhe: 1.734 ± 0.563
1.239TyrGly: 1.239 ± 0.644
1.734TyrHis: 1.734 ± 0.589
4.706TyrIle: 4.706 ± 0.859
5.45TyrLys: 5.45 ± 0.339
3.22TyrLeu: 3.22 ± 1.363
1.486TyrMet: 1.486 ± 0.922
3.22TyrAsn: 3.22 ± 0.749
1.486TyrPro: 1.486 ± 0.972
0.743TyrGln: 0.743 ± 0.461
1.486TyrArg: 1.486 ± 1.095
3.716TyrSer: 3.716 ± 0.86
2.973TyrThr: 2.973 ± 0.813
0.743TyrVal: 0.743 ± 0.334
0.248TyrTrp: 0.248 ± 0.154
2.229TyrTyr: 2.229 ± 0.558
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4038 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski