Amino acid dipepetide frequency for Komandory virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.967AlaAla: 4.967 ± 1.31
1.104AlaCys: 1.104 ± 0.406
2.759AlaAsp: 2.759 ± 0.451
3.587AlaGlu: 3.587 ± 0.75
1.38AlaPhe: 1.38 ± 0.603
3.587AlaGly: 3.587 ± 1.369
0.552AlaHis: 0.552 ± 0.579
3.311AlaIle: 3.311 ± 1.091
4.139AlaLys: 4.139 ± 0.784
4.691AlaLeu: 4.691 ± 0.295
1.104AlaMet: 1.104 ± 0.987
2.483AlaAsn: 2.483 ± 1.327
0.828AlaPro: 0.828 ± 0.478
0.828AlaGln: 0.828 ± 0.541
1.932AlaArg: 1.932 ± 0.814
5.795AlaSer: 5.795 ± 2.212
3.863AlaThr: 3.863 ± 1.164
2.483AlaVal: 2.483 ± 0.859
0.552AlaTrp: 0.552 ± 0.319
1.38AlaTyr: 1.38 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.552CysAla: 0.552 ± 0.527
0.276CysCys: 0.276 ± 0.159
1.38CysAsp: 1.38 ± 0.603
1.932CysGlu: 1.932 ± 1.484
2.208CysPhe: 2.208 ± 1.399
1.656CysGly: 1.656 ± 0.505
0.552CysHis: 0.552 ± 0.203
1.656CysIle: 1.656 ± 0.609
1.932CysLys: 1.932 ± 1.14
3.311CysLeu: 3.311 ± 0.586
0.828CysMet: 0.828 ± 0.48
1.104CysAsn: 1.104 ± 0.406
0.828CysPro: 0.828 ± 0.79
1.38CysGln: 1.38 ± 0.96
0.828CysArg: 0.828 ± 0.48
3.311CysSer: 3.311 ± 1.734
0.552CysThr: 0.552 ± 0.527
0.828CysVal: 0.828 ± 0.442
0.276CysTrp: 0.276 ± 0.263
0.828CysTyr: 0.828 ± 0.79
0.0CysXaa: 0.0 ± 0.0
Asp
3.311AspAla: 3.311 ± 0.806
2.208AspCys: 2.208 ± 2.108
2.759AspAsp: 2.759 ± 0.451
2.208AspGlu: 2.208 ± 0.954
3.311AspPhe: 3.311 ± 0.588
2.483AspGly: 2.483 ± 0.879
1.656AspHis: 1.656 ± 0.437
3.311AspIle: 3.311 ± 0.176
4.139AspLys: 4.139 ± 1.518
6.623AspLeu: 6.623 ± 1.669
1.104AspMet: 1.104 ± 1.023
2.208AspAsn: 2.208 ± 0.954
3.035AspPro: 3.035 ± 0.697
1.932AspGln: 1.932 ± 0.617
3.587AspArg: 3.587 ± 1.301
4.139AspSer: 4.139 ± 1.289
1.38AspThr: 1.38 ± 0.511
4.139AspVal: 4.139 ± 0.994
1.932AspTrp: 1.932 ± 2.098
2.483AspTyr: 2.483 ± 1.327
0.0AspXaa: 0.0 ± 0.0
Glu
3.587GluAla: 3.587 ± 0.742
1.932GluCys: 1.932 ± 0.834
4.691GluAsp: 4.691 ± 0.798
4.691GluGlu: 4.691 ± 1.072
4.691GluPhe: 4.691 ± 1.349
4.139GluGly: 4.139 ± 0.854
1.104GluHis: 1.104 ± 0.4
4.967GluIle: 4.967 ± 1.117
3.863GluLys: 3.863 ± 0.708
4.967GluLeu: 4.967 ± 1.492
0.828GluMet: 0.828 ± 0.914
1.104GluAsn: 1.104 ± 0.528
0.276GluPro: 0.276 ± 0.159
2.208GluGln: 2.208 ± 0.812
3.587GluArg: 3.587 ± 2.052
5.243GluSer: 5.243 ± 1.599
3.863GluThr: 3.863 ± 0.851
4.967GluVal: 4.967 ± 1.324
1.932GluTrp: 1.932 ± 0.617
1.656GluTyr: 1.656 ± 0.661
0.0GluXaa: 0.0 ± 0.0
Phe
2.759PheAla: 2.759 ± 0.994
2.208PheCys: 2.208 ± 0.812
3.311PheAsp: 3.311 ± 0.783
1.932PheGlu: 1.932 ± 0.503
2.759PhePhe: 2.759 ± 1.283
2.759PheGly: 2.759 ± 0.59
2.483PheHis: 2.483 ± 0.474
2.208PheIle: 2.208 ± 0.373
4.691PheLys: 4.691 ± 1.293
5.519PheLeu: 5.519 ± 2.109
0.552PheMet: 0.552 ± 0.319
3.311PheAsn: 3.311 ± 1.05
1.38PhePro: 1.38 ± 0.797
1.932PheGln: 1.932 ± 0.354
3.035PheArg: 3.035 ± 0.25
4.139PheSer: 4.139 ± 1.045
3.311PheThr: 3.311 ± 1.224
1.932PheVal: 1.932 ± 0.977
0.828PheTrp: 0.828 ± 0.478
1.656PheTyr: 1.656 ± 0.609
0.0PheXaa: 0.0 ± 0.0
Gly
2.208GlyAla: 2.208 ± 0.373
1.38GlyCys: 1.38 ± 0.462
2.759GlyAsp: 2.759 ± 0.859
3.587GlyGlu: 3.587 ± 1.436
3.587GlyPhe: 3.587 ± 2.018
1.932GlyGly: 1.932 ± 0.834
0.828GlyHis: 0.828 ± 0.253
4.139GlyIle: 4.139 ± 0.702
3.863GlyLys: 3.863 ± 0.348
4.967GlyLeu: 4.967 ± 1.353
1.656GlyMet: 1.656 ± 0.431
1.932GlyAsn: 1.932 ± 0.538
2.483GlyPro: 2.483 ± 1.017
1.932GlyGln: 1.932 ± 1.321
1.38GlyArg: 1.38 ± 0.43
4.691GlySer: 4.691 ± 0.895
2.759GlyThr: 2.759 ± 1.015
3.863GlyVal: 3.863 ± 0.735
1.656GlyTrp: 1.656 ± 1.157
1.38GlyTyr: 1.38 ± 0.628
0.0GlyXaa: 0.0 ± 0.0
His
0.828HisAla: 0.828 ± 0.442
1.38HisCys: 1.38 ± 0.636
1.38HisAsp: 1.38 ± 0.96
0.828HisGlu: 0.828 ± 0.495
1.38HisPhe: 1.38 ± 0.511
2.208HisGly: 2.208 ± 0.954
1.38HisHis: 1.38 ± 0.603
1.932HisIle: 1.932 ± 0.623
1.104HisLys: 1.104 ± 0.37
3.863HisLeu: 3.863 ± 1.374
0.552HisMet: 0.552 ± 1.305
0.552HisAsn: 0.552 ± 0.203
2.208HisPro: 2.208 ± 0.969
0.552HisGln: 0.552 ± 0.655
1.38HisArg: 1.38 ± 0.797
2.208HisSer: 2.208 ± 0.501
0.552HisThr: 0.552 ± 0.319
1.656HisVal: 1.656 ± 0.661
0.828HisTrp: 0.828 ± 0.478
0.828HisTyr: 0.828 ± 0.495
0.0HisXaa: 0.0 ± 0.0
Ile
2.483IleAla: 2.483 ± 0.739
2.483IleCys: 2.483 ± 1.327
3.863IleAsp: 3.863 ± 0.787
3.587IleGlu: 3.587 ± 0.913
0.552IlePhe: 0.552 ± 0.319
3.863IleGly: 3.863 ± 1.314
2.759IleHis: 2.759 ± 0.859
6.071IleIle: 6.071 ± 1.57
5.795IleLys: 5.795 ± 0.856
4.415IleLeu: 4.415 ± 0.802
1.656IleMet: 1.656 ± 0.638
3.311IleAsn: 3.311 ± 0.176
3.587IlePro: 3.587 ± 0.765
2.208IleGln: 2.208 ± 0.741
3.863IleArg: 3.863 ± 0.752
7.174IleSer: 7.174 ± 0.378
2.483IleThr: 2.483 ± 0.474
4.139IleVal: 4.139 ± 2.251
1.656IleTrp: 1.656 ± 0.749
1.932IleTyr: 1.932 ± 0.343
0.0IleXaa: 0.0 ± 0.0
Lys
5.519LysAla: 5.519 ± 2.547
1.104LysCys: 1.104 ± 0.699
3.587LysAsp: 3.587 ± 1.19
2.759LysGlu: 2.759 ± 1.736
3.863LysPhe: 3.863 ± 1.628
1.932LysGly: 1.932 ± 0.623
1.38LysHis: 1.38 ± 0.797
4.967LysIle: 4.967 ± 0.737
6.347LysLys: 6.347 ± 1.109
7.726LysLeu: 7.726 ± 0.933
2.759LysMet: 2.759 ± 0.836
1.932LysAsn: 1.932 ± 0.614
2.208LysPro: 2.208 ± 0.854
2.483LysGln: 2.483 ± 0.962
5.519LysArg: 5.519 ± 2.211
4.967LysSer: 4.967 ± 1.17
4.691LysThr: 4.691 ± 0.466
4.691LysVal: 4.691 ± 1.268
1.656LysTrp: 1.656 ± 0.718
1.932LysTyr: 1.932 ± 0.503
0.0LysXaa: 0.0 ± 0.0
Leu
5.243LeuAla: 5.243 ± 1.194
2.483LeuCys: 2.483 ± 0.474
6.898LeuAsp: 6.898 ± 1.264
6.623LeuGlu: 6.623 ± 1.567
5.519LeuPhe: 5.519 ± 1.696
6.623LeuGly: 6.623 ± 1.86
2.759LeuHis: 2.759 ± 0.413
5.519LeuIle: 5.519 ± 0.513
7.726LeuLys: 7.726 ± 1.581
8.83LeuLeu: 8.83 ± 1.334
3.035LeuMet: 3.035 ± 1.417
4.415LeuAsn: 4.415 ± 0.87
3.311LeuPro: 3.311 ± 0.488
3.587LeuGln: 3.587 ± 1.365
4.691LeuArg: 4.691 ± 1.182
7.174LeuSer: 7.174 ± 1.508
8.002LeuThr: 8.002 ± 3.174
4.415LeuVal: 4.415 ± 1.32
1.104LeuTrp: 1.104 ± 1.039
1.38LeuTyr: 1.38 ± 0.753
0.0LeuXaa: 0.0 ± 0.0
Met
0.828MetAla: 0.828 ± 1.274
0.0MetCys: 0.0 ± 0.0
1.38MetAsp: 1.38 ± 0.764
1.38MetGlu: 1.38 ± 0.5
1.932MetPhe: 1.932 ± 0.814
1.104MetGly: 1.104 ± 0.4
1.104MetHis: 1.104 ± 0.843
1.932MetIle: 1.932 ± 0.988
1.932MetLys: 1.932 ± 1.238
1.656MetLeu: 1.656 ± 0.896
0.828MetMet: 0.828 ± 0.552
0.276MetAsn: 0.276 ± 0.263
0.828MetPro: 0.828 ± 0.716
1.104MetGln: 1.104 ± 0.528
1.38MetArg: 1.38 ± 0.636
2.208MetSer: 2.208 ± 1.022
1.656MetThr: 1.656 ± 0.609
1.38MetVal: 1.38 ± 0.43
0.0MetTrp: 0.0 ± 0.0
0.276MetTyr: 0.276 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
1.104AsnAla: 1.104 ± 1.175
1.932AsnCys: 1.932 ± 0.834
1.104AsnAsp: 1.104 ± 0.638
2.208AsnGlu: 2.208 ± 0.8
2.759AsnPhe: 2.759 ± 0.859
1.104AsnGly: 1.104 ± 0.581
1.104AsnHis: 1.104 ± 0.37
1.104AsnIle: 1.104 ± 0.37
3.035AsnLys: 3.035 ± 1.557
3.863AsnLeu: 3.863 ± 1.389
1.932AsnMet: 1.932 ± 0.529
1.38AsnAsn: 1.38 ± 0.636
1.932AsnPro: 1.932 ± 0.834
1.656AsnGln: 1.656 ± 0.375
1.932AsnArg: 1.932 ± 0.814
3.863AsnSer: 3.863 ± 1.421
1.932AsnThr: 1.932 ± 1.659
2.759AsnVal: 2.759 ± 0.859
1.38AsnTrp: 1.38 ± 0.511
0.276AsnTyr: 0.276 ± 0.159
0.0AsnXaa: 0.0 ± 0.0
Pro
2.483ProAla: 2.483 ± 1.576
0.276ProCys: 0.276 ± 0.159
1.932ProAsp: 1.932 ± 0.508
4.691ProGlu: 4.691 ± 1.475
3.587ProPhe: 3.587 ± 0.645
2.483ProGly: 2.483 ± 0.98
0.828ProHis: 0.828 ± 0.253
1.656ProIle: 1.656 ± 0.957
2.208ProLys: 2.208 ± 0.621
3.311ProLeu: 3.311 ± 0.819
0.276ProMet: 0.276 ± 0.159
1.104ProAsn: 1.104 ± 1.039
1.932ProPro: 1.932 ± 1.457
0.828ProGln: 0.828 ± 0.495
1.38ProArg: 1.38 ± 0.43
3.863ProSer: 3.863 ± 0.686
1.656ProThr: 1.656 ± 0.505
1.656ProVal: 1.656 ± 1.827
0.552ProTrp: 0.552 ± 0.319
0.828ProTyr: 0.828 ± 0.48
0.0ProXaa: 0.0 ± 0.0
Gln
1.104GlnAla: 1.104 ± 0.699
1.656GlnCys: 1.656 ± 0.638
1.38GlnAsp: 1.38 ± 0.603
3.587GlnGlu: 3.587 ± 0.645
1.104GlnPhe: 1.104 ± 0.528
1.932GlnGly: 1.932 ± 0.988
1.38GlnHis: 1.38 ± 1.12
2.759GlnIle: 2.759 ± 0.929
3.587GlnLys: 3.587 ± 1.311
1.932GlnLeu: 1.932 ± 0.508
0.276GlnMet: 0.276 ± 0.159
0.828GlnAsn: 0.828 ± 0.495
1.104GlnPro: 1.104 ± 0.699
0.828GlnGln: 0.828 ± 0.552
1.104GlnArg: 1.104 ± 0.699
3.311GlnSer: 3.311 ± 1.01
1.38GlnThr: 1.38 ± 0.417
2.759GlnVal: 2.759 ± 1.004
0.276GlnTrp: 0.276 ± 0.159
0.828GlnTyr: 0.828 ± 0.541
0.0GlnXaa: 0.0 ± 0.0
Arg
1.932ArgAla: 1.932 ± 0.614
1.932ArgCys: 1.932 ± 0.859
4.415ArgAsp: 4.415 ± 0.566
2.483ArgGlu: 2.483 ± 0.823
2.483ArgPhe: 2.483 ± 0.399
3.587ArgGly: 3.587 ± 1.514
1.104ArgHis: 1.104 ± 1.054
4.415ArgIle: 4.415 ± 1.172
2.208ArgLys: 2.208 ± 0.501
4.139ArgLeu: 4.139 ± 1.123
1.104ArgMet: 1.104 ± 0.406
2.483ArgAsn: 2.483 ± 1.327
1.656ArgPro: 1.656 ± 0.706
2.483ArgGln: 2.483 ± 0.879
2.759ArgArg: 2.759 ± 1.345
4.691ArgSer: 4.691 ± 1.169
2.483ArgThr: 2.483 ± 0.758
2.759ArgVal: 2.759 ± 0.929
0.552ArgTrp: 0.552 ± 0.203
1.656ArgTyr: 1.656 ± 0.989
0.0ArgXaa: 0.0 ± 0.0
Ser
4.967SerAla: 4.967 ± 1.446
1.104SerCys: 1.104 ± 1.054
3.863SerAsp: 3.863 ± 0.348
8.278SerGlu: 8.278 ± 1.918
2.208SerPhe: 2.208 ± 0.373
3.863SerGly: 3.863 ± 0.752
2.759SerHis: 2.759 ± 1.015
6.071SerIle: 6.071 ± 0.689
7.45SerLys: 7.45 ± 1.707
11.313SerLeu: 11.313 ± 1.244
2.208SerMet: 2.208 ± 0.373
2.759SerAsn: 2.759 ± 0.38
3.587SerPro: 3.587 ± 0.934
1.932SerGln: 1.932 ± 0.343
4.967SerArg: 4.967 ± 0.761
6.347SerSer: 6.347 ± 1.372
4.967SerThr: 4.967 ± 0.929
4.139SerVal: 4.139 ± 0.733
1.38SerTrp: 1.38 ± 0.5
1.38SerTyr: 1.38 ± 0.511
0.0SerXaa: 0.0 ± 0.0
Thr
3.311ThrAla: 3.311 ± 0.689
1.38ThrCys: 1.38 ± 0.789
3.587ThrAsp: 3.587 ± 1.316
2.759ThrGlu: 2.759 ± 0.378
3.863ThrPhe: 3.863 ± 0.708
4.691ThrGly: 4.691 ± 1.593
1.656ThrHis: 1.656 ± 0.414
3.311ThrIle: 3.311 ± 1.012
2.483ThrLys: 2.483 ± 0.399
6.071ThrLeu: 6.071 ± 2.991
1.104ThrMet: 1.104 ± 0.699
2.483ThrAsn: 2.483 ± 0.758
1.932ThrPro: 1.932 ± 0.343
1.104ThrGln: 1.104 ± 0.37
2.208ThrArg: 2.208 ± 0.984
4.139ThrSer: 4.139 ± 1.621
3.035ThrThr: 3.035 ± 1.043
4.967ThrVal: 4.967 ± 3.066
0.828ThrTrp: 0.828 ± 0.253
1.104ThrTyr: 1.104 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
3.035ValAla: 3.035 ± 0.506
1.104ValCys: 1.104 ± 0.699
4.691ValAsp: 4.691 ± 1.6
4.415ValGlu: 4.415 ± 1.426
3.311ValPhe: 3.311 ± 0.347
1.104ValGly: 1.104 ± 0.699
1.104ValHis: 1.104 ± 0.623
3.863ValIle: 3.863 ± 0.941
3.587ValLys: 3.587 ± 0.189
6.071ValLeu: 6.071 ± 0.554
0.276ValMet: 0.276 ± 0.263
2.208ValAsn: 2.208 ± 1.247
2.483ValPro: 2.483 ± 0.422
2.759ValGln: 2.759 ± 1.016
2.208ValArg: 2.208 ± 0.614
6.071ValSer: 6.071 ± 0.949
3.311ValThr: 3.311 ± 0.806
4.139ValVal: 4.139 ± 0.733
1.656ValTrp: 1.656 ± 0.638
1.38ValTyr: 1.38 ± 0.636
0.0ValXaa: 0.0 ± 0.0
Trp
1.104TrpAla: 1.104 ± 0.699
0.0TrpCys: 0.0 ± 0.0
0.828TrpAsp: 0.828 ± 0.716
1.38TrpGlu: 1.38 ± 1.332
1.104TrpPhe: 1.104 ± 0.638
1.38TrpGly: 1.38 ± 0.753
0.0TrpHis: 0.0 ± 0.0
2.759TrpIle: 2.759 ± 0.378
1.104TrpLys: 1.104 ± 0.581
2.483TrpLeu: 2.483 ± 0.474
0.828TrpMet: 0.828 ± 0.442
0.552TrpAsn: 0.552 ± 0.512
0.552TrpPro: 0.552 ± 1.148
0.276TrpGln: 0.276 ± 0.159
1.656TrpArg: 1.656 ± 0.375
0.828TrpSer: 0.828 ± 0.442
1.656TrpThr: 1.656 ± 0.661
0.552TrpVal: 0.552 ± 0.579
0.0TrpTrp: 0.0 ± 0.0
0.552TrpTyr: 0.552 ± 0.319
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
0.276TyrCys: 0.276 ± 0.263
1.38TyrAsp: 1.38 ± 0.43
1.38TyrGlu: 1.38 ± 0.797
1.104TyrPhe: 1.104 ± 0.406
0.552TyrGly: 0.552 ± 0.579
1.104TyrHis: 1.104 ± 0.638
1.932TyrIle: 1.932 ± 0.814
0.828TyrLys: 0.828 ± 0.253
3.863TyrLeu: 3.863 ± 0.348
0.0TyrMet: 0.0 ± 0.0
1.932TyrAsn: 1.932 ± 0.503
1.38TyrPro: 1.38 ± 0.753
1.104TyrGln: 1.104 ± 0.931
1.932TyrArg: 1.932 ± 1.659
1.38TyrSer: 1.38 ± 0.462
2.483TyrThr: 2.483 ± 0.821
0.552TyrVal: 0.552 ± 0.319
0.552TyrTrp: 0.552 ± 0.203
0.552TyrTyr: 0.552 ± 0.319
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3625 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski