Amino acid dipepetide frequency for Oriboca virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.762AlaAla: 1.762 ± 1.012
1.762AlaCys: 1.762 ± 0.749
1.762AlaAsp: 1.762 ± 1.683
3.775AlaGlu: 3.775 ± 0.604
3.02AlaPhe: 3.02 ± 1.124
1.258AlaGly: 1.258 ± 0.854
0.503AlaHis: 0.503 ± 0.187
3.272AlaIle: 3.272 ± 0.274
5.789AlaLys: 5.789 ± 3.76
4.782AlaLeu: 4.782 ± 1.057
0.755AlaMet: 0.755 ± 0.266
4.027AlaAsn: 4.027 ± 3.309
0.755AlaPro: 0.755 ± 0.393
1.762AlaGln: 1.762 ± 1.704
2.014AlaArg: 2.014 ± 1.108
2.265AlaSer: 2.265 ± 0.797
2.769AlaThr: 2.769 ± 0.838
2.769AlaVal: 2.769 ± 0.511
0.252AlaTrp: 0.252 ± 0.168
1.762AlaTyr: 1.762 ± 0.569
0.0AlaXaa: 0.0 ± 0.0
Cys
1.258CysAla: 1.258 ± 1.283
0.252CysCys: 0.252 ± 0.168
0.755CysAsp: 0.755 ± 0.393
1.762CysGlu: 1.762 ± 1.325
2.265CysPhe: 2.265 ± 1.179
3.02CysGly: 3.02 ± 1.572
0.252CysHis: 0.252 ± 0.237
2.517CysIle: 2.517 ± 1.404
2.517CysLys: 2.517 ± 1.709
1.762CysLeu: 1.762 ± 0.605
0.503CysMet: 0.503 ± 0.187
1.762CysAsn: 1.762 ± 0.605
0.755CysPro: 0.755 ± 0.266
0.755CysGln: 0.755 ± 0.393
0.755CysArg: 0.755 ± 0.711
2.517CysSer: 2.517 ± 0.856
2.014CysThr: 2.014 ± 1.243
1.51CysVal: 1.51 ± 1.089
0.252CysTrp: 0.252 ± 0.168
0.755CysTyr: 0.755 ± 0.393
0.0CysXaa: 0.0 ± 0.0
Asp
2.265AspAla: 2.265 ± 0.959
1.51AspCys: 1.51 ± 0.562
2.769AspAsp: 2.769 ± 1.275
3.02AspGlu: 3.02 ± 1.625
3.524AspPhe: 3.524 ± 1.499
2.014AspGly: 2.014 ± 0.778
0.0AspHis: 0.0 ± 0.0
3.524AspIle: 3.524 ± 1.312
2.769AspLys: 2.769 ± 0.296
5.034AspLeu: 5.034 ± 1.744
1.51AspMet: 1.51 ± 1.009
3.272AspAsn: 3.272 ± 1.303
1.51AspPro: 1.51 ± 0.64
2.769AspGln: 2.769 ± 1.063
2.769AspArg: 2.769 ± 1.546
1.258AspSer: 1.258 ± 0.428
3.272AspThr: 3.272 ± 1.431
3.272AspVal: 3.272 ± 1.354
0.252AspTrp: 0.252 ± 0.168
1.51AspTyr: 1.51 ± 0.531
0.0AspXaa: 0.0 ± 0.0
Glu
2.769GluAla: 2.769 ± 0.511
1.258GluCys: 1.258 ± 0.568
3.02GluAsp: 3.02 ± 1.437
4.782GluGlu: 4.782 ± 1.182
5.034GluPhe: 5.034 ± 2.386
1.007GluGly: 1.007 ± 0.375
2.265GluHis: 2.265 ± 1.971
7.299GluIle: 7.299 ± 1.727
4.531GluLys: 4.531 ± 0.825
7.803GluLeu: 7.803 ± 1.536
3.272GluMet: 3.272 ± 1.443
3.524GluAsn: 3.524 ± 0.789
2.265GluPro: 2.265 ± 0.787
2.014GluGln: 2.014 ± 0.687
3.524GluArg: 3.524 ± 1.499
4.531GluSer: 4.531 ± 0.516
4.027GluThr: 4.027 ± 1.453
3.775GluVal: 3.775 ± 1.284
0.503GluTrp: 0.503 ± 0.187
2.769GluTyr: 2.769 ± 0.971
0.0GluXaa: 0.0 ± 0.0
Phe
2.014PheAla: 2.014 ± 0.687
2.517PheCys: 2.517 ± 1.379
3.02PheAsp: 3.02 ± 0.689
4.279PheGlu: 4.279 ± 0.933
2.769PhePhe: 2.769 ± 1.443
2.014PheGly: 2.014 ± 1.628
1.258PheHis: 1.258 ± 0.428
3.775PheIle: 3.775 ± 1.26
4.531PheLys: 4.531 ± 1.917
4.279PheLeu: 4.279 ± 3.128
0.755PheMet: 0.755 ± 0.504
2.265PheAsn: 2.265 ± 1.213
1.258PhePro: 1.258 ± 0.743
1.762PheGln: 1.762 ± 0.605
1.51PheArg: 1.51 ± 1.009
4.279PheSer: 4.279 ± 0.617
2.517PheThr: 2.517 ± 0.937
3.02PheVal: 3.02 ± 1.031
0.252PheTrp: 0.252 ± 0.168
3.02PheTyr: 3.02 ± 1.572
0.0PheXaa: 0.0 ± 0.0
Gly
1.51GlyAla: 1.51 ± 0.64
2.517GlyCys: 2.517 ± 1.709
2.265GlyAsp: 2.265 ± 1.213
5.034GlyGlu: 5.034 ± 0.693
0.755GlyPhe: 0.755 ± 0.917
1.007GlyGly: 1.007 ± 0.403
0.252GlyHis: 0.252 ± 0.237
4.531GlyIle: 4.531 ± 0.516
3.272GlyLys: 3.272 ± 1.526
2.517GlyLeu: 2.517 ± 0.993
0.503GlyMet: 0.503 ± 0.187
2.769GlyAsn: 2.769 ± 1.063
1.007GlyPro: 1.007 ± 0.375
0.755GlyGln: 0.755 ± 0.917
2.014GlyArg: 2.014 ± 1.277
2.769GlySer: 2.769 ± 2.097
2.769GlyThr: 2.769 ± 2.602
2.517GlyVal: 2.517 ± 1.342
1.007GlyTrp: 1.007 ± 0.375
1.762GlyTyr: 1.762 ± 0.605
0.0GlyXaa: 0.0 ± 0.0
His
0.503HisAla: 0.503 ± 0.474
0.503HisCys: 0.503 ± 0.187
0.252HisAsp: 0.252 ± 0.237
0.755HisGlu: 0.755 ± 0.889
1.51HisPhe: 1.51 ± 1.009
1.51HisGly: 1.51 ± 0.531
0.503HisHis: 0.503 ± 0.187
1.258HisIle: 1.258 ± 0.558
1.007HisLys: 1.007 ± 0.375
1.762HisLeu: 1.762 ± 1.162
0.503HisMet: 0.503 ± 0.187
3.02HisAsn: 3.02 ± 1.063
1.007HisPro: 1.007 ± 0.375
0.503HisGln: 0.503 ± 0.474
1.51HisArg: 1.51 ± 2.921
1.258HisSer: 1.258 ± 0.428
0.755HisThr: 0.755 ± 0.393
0.755HisVal: 0.755 ± 0.266
0.503HisTrp: 0.503 ± 0.336
1.258HisTyr: 1.258 ± 0.909
0.0HisXaa: 0.0 ± 0.0
Ile
4.531IleAla: 4.531 ± 0.708
1.007IleCys: 1.007 ± 0.621
3.272IleAsp: 3.272 ± 1.66
5.789IleGlu: 5.789 ± 1.07
3.775IlePhe: 3.775 ± 1.489
3.775IleGly: 3.775 ± 1.328
1.762IleHis: 1.762 ± 0.662
5.034IleIle: 5.034 ± 0.67
8.809IleLys: 8.809 ± 3.48
9.565IleLeu: 9.565 ± 1.327
2.517IleMet: 2.517 ± 0.924
4.782IleAsn: 4.782 ± 0.574
3.02IlePro: 3.02 ± 1.031
2.517IleGln: 2.517 ± 0.993
4.531IleArg: 4.531 ± 1.185
5.286IleSer: 5.286 ± 1.829
4.782IleThr: 4.782 ± 2.625
4.027IleVal: 4.027 ± 1.499
0.755IleTrp: 0.755 ± 0.266
2.265IleTyr: 2.265 ± 1.526
0.0IleXaa: 0.0 ± 0.0
Lys
3.02LysAla: 3.02 ± 1.031
2.769LysCys: 2.769 ± 1.633
4.782LysAsp: 4.782 ± 0.574
8.558LysGlu: 8.558 ± 0.872
3.524LysPhe: 3.524 ± 1.137
4.782LysGly: 4.782 ± 0.779
1.762LysHis: 1.762 ± 0.882
5.789LysIle: 5.789 ± 1.842
5.034LysLys: 5.034 ± 1.27
7.803LysLeu: 7.803 ± 1.017
3.272LysMet: 3.272 ± 0.593
3.775LysAsn: 3.775 ± 0.339
2.265LysPro: 2.265 ± 0.566
2.014LysGln: 2.014 ± 1.655
1.51LysArg: 1.51 ± 0.531
5.034LysSer: 5.034 ± 0.687
6.041LysThr: 6.041 ± 2.633
3.524LysVal: 3.524 ± 2.268
0.755LysTrp: 0.755 ± 0.917
2.769LysTyr: 2.769 ± 1.063
0.0LysXaa: 0.0 ± 0.0
Leu
5.789LeuAla: 5.789 ± 2.741
2.265LeuCys: 2.265 ± 1.475
6.041LeuAsp: 6.041 ± 2.419
6.796LeuGlu: 6.796 ± 0.551
4.027LeuPhe: 4.027 ± 0.469
2.517LeuGly: 2.517 ± 2.554
2.014LeuHis: 2.014 ± 1.6
8.558LeuIle: 8.558 ± 1.879
6.544LeuLys: 6.544 ± 1.537
10.823LeuLeu: 10.823 ± 1.659
1.51LeuMet: 1.51 ± 1.209
8.306LeuAsn: 8.306 ± 2.018
3.775LeuPro: 3.775 ± 2.448
3.272LeuGln: 3.272 ± 1.653
4.027LeuArg: 4.027 ± 3.26
8.558LeuSer: 8.558 ± 0.745
4.279LeuThr: 4.279 ± 2.109
3.775LeuVal: 3.775 ± 1.341
0.252LeuTrp: 0.252 ± 0.168
3.775LeuTyr: 3.775 ± 0.604
0.0LeuXaa: 0.0 ± 0.0
Met
1.007MetAla: 1.007 ± 0.8
1.007MetCys: 1.007 ± 0.621
1.007MetAsp: 1.007 ± 0.403
1.51MetGlu: 1.51 ± 0.531
1.258MetPhe: 1.258 ± 0.743
1.007MetGly: 1.007 ± 0.375
0.755MetHis: 0.755 ± 0.266
2.517MetIle: 2.517 ± 0.662
2.769MetLys: 2.769 ± 1.606
2.517MetLeu: 2.517 ± 0.384
1.258MetMet: 1.258 ± 0.841
1.762MetAsn: 1.762 ± 0.882
2.014MetPro: 2.014 ± 0.806
0.755MetGln: 0.755 ± 0.711
2.265MetArg: 2.265 ± 1.139
2.265MetSer: 2.265 ± 1.318
2.517MetThr: 2.517 ± 0.937
2.014MetVal: 2.014 ± 1.277
0.0MetTrp: 0.0 ± 0.0
0.503MetTyr: 0.503 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
2.769AsnAla: 2.769 ± 1.063
1.51AsnCys: 1.51 ± 1.089
4.279AsnAsp: 4.279 ± 1.603
3.272AsnGlu: 3.272 ± 0.336
2.517AsnPhe: 2.517 ± 1.115
2.014AsnGly: 2.014 ± 1.243
2.517AsnHis: 2.517 ± 0.635
3.775AsnIle: 3.775 ± 1.291
3.775AsnLys: 3.775 ± 1.341
5.789AsnLeu: 5.789 ± 1.134
3.524AsnMet: 3.524 ± 0.715
2.265AsnAsn: 2.265 ± 0.859
2.014AsnPro: 2.014 ± 0.778
2.517AsnGln: 2.517 ± 1.115
1.762AsnArg: 1.762 ± 1.012
3.524AsnSer: 3.524 ± 0.762
4.027AsnThr: 4.027 ± 1.556
2.014AsnVal: 2.014 ± 0.687
0.503AsnTrp: 0.503 ± 0.336
3.272AsnTyr: 3.272 ± 1.303
0.0AsnXaa: 0.0 ± 0.0
Pro
2.517ProAla: 2.517 ± 0.635
0.0ProCys: 0.0 ± 0.0
1.258ProAsp: 1.258 ± 0.739
2.769ProGlu: 2.769 ± 0.737
1.258ProPhe: 1.258 ± 0.568
1.762ProGly: 1.762 ± 0.727
0.252ProHis: 0.252 ± 0.237
3.272ProIle: 3.272 ± 0.552
3.524ProLys: 3.524 ± 0.436
2.769ProLeu: 2.769 ± 2.602
0.503ProMet: 0.503 ± 0.187
1.762ProAsn: 1.762 ± 0.662
0.503ProPro: 0.503 ± 0.336
1.007ProGln: 1.007 ± 0.621
1.258ProArg: 1.258 ± 0.428
2.517ProSer: 2.517 ± 1.115
2.014ProThr: 2.014 ± 1.108
2.517ProVal: 2.517 ± 0.399
0.755ProTrp: 0.755 ± 0.917
0.755ProTyr: 0.755 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
2.265GlnAla: 2.265 ± 1.531
0.503GlnCys: 0.503 ± 0.336
1.762GlnAsp: 1.762 ± 0.662
2.517GlnGlu: 2.517 ± 1.709
1.258GlnPhe: 1.258 ± 0.558
1.762GlnGly: 1.762 ± 0.62
0.252GlnHis: 0.252 ± 0.237
2.769GlnIle: 2.769 ± 0.984
2.265GlnLys: 2.265 ± 1.621
1.762GlnLeu: 1.762 ± 0.882
1.258GlnMet: 1.258 ± 0.558
2.265GlnAsn: 2.265 ± 1.499
0.755GlnPro: 0.755 ± 0.393
1.762GlnGln: 1.762 ± 0.569
2.265GlnArg: 2.265 ± 1.193
3.272GlnSer: 3.272 ± 1.465
3.524GlnThr: 3.524 ± 1.214
1.007GlnVal: 1.007 ± 0.403
0.503GlnTrp: 0.503 ± 0.187
1.258GlnTyr: 1.258 ± 0.739
0.0GlnXaa: 0.0 ± 0.0
Arg
1.51ArgAla: 1.51 ± 1.778
1.762ArgCys: 1.762 ± 0.605
3.272ArgAsp: 3.272 ± 0.996
1.762ArgGlu: 1.762 ± 1.177
1.51ArgPhe: 1.51 ± 0.64
1.258ArgGly: 1.258 ± 0.743
0.755ArgHis: 0.755 ± 0.504
4.782ArgIle: 4.782 ± 2.246
3.272ArgLys: 3.272 ± 0.552
3.272ArgLeu: 3.272 ± 1.366
2.265ArgMet: 2.265 ± 1.109
2.014ArgAsn: 2.014 ± 1.047
0.503ArgPro: 0.503 ± 0.998
1.762ArgGln: 1.762 ± 1.958
1.258ArgArg: 1.258 ± 0.558
3.775ArgSer: 3.775 ± 3.85
2.265ArgThr: 2.265 ± 0.959
2.769ArgVal: 2.769 ± 1.714
0.252ArgTrp: 0.252 ± 0.237
2.769ArgTyr: 2.769 ± 1.275
0.0ArgXaa: 0.0 ± 0.0
Ser
2.769SerAla: 2.769 ± 1.443
2.517SerCys: 2.517 ± 1.624
3.524SerAsp: 3.524 ± 1.516
3.775SerGlu: 3.775 ± 1.328
4.027SerPhe: 4.027 ± 0.871
2.265SerGly: 2.265 ± 1.76
2.265SerHis: 2.265 ± 1.104
5.286SerIle: 5.286 ± 1.801
6.041SerLys: 6.041 ± 0.747
9.061SerLeu: 9.061 ± 2.957
2.014SerMet: 2.014 ± 0.687
2.517SerAsn: 2.517 ± 0.937
2.517SerPro: 2.517 ± 1.115
2.014SerGln: 2.014 ± 0.499
4.782SerArg: 4.782 ± 1.057
5.789SerSer: 5.789 ± 8.856
5.034SerThr: 5.034 ± 1.216
5.789SerVal: 5.789 ± 0.808
0.503SerTrp: 0.503 ± 0.336
2.265SerTyr: 2.265 ± 1.179
0.0SerXaa: 0.0 ± 0.0
Thr
4.782ThrAla: 4.782 ± 0.677
1.258ThrCys: 1.258 ± 1.185
2.265ThrAsp: 2.265 ± 0.797
3.524ThrGlu: 3.524 ± 2.393
2.769ThrPhe: 2.769 ± 0.737
1.762ThrGly: 1.762 ± 0.749
1.762ThrHis: 1.762 ± 0.749
4.782ThrIle: 4.782 ± 1.048
3.524ThrLys: 3.524 ± 1.214
5.789ThrLeu: 5.789 ± 4.97
1.258ThrMet: 1.258 ± 0.739
2.769ThrAsn: 2.769 ± 0.971
3.524ThrPro: 3.524 ± 1.241
2.265ThrGln: 2.265 ± 0.787
2.265ThrArg: 2.265 ± 0.797
6.796ThrSer: 6.796 ± 0.816
3.02ThrThr: 3.02 ± 1.031
3.02ThrVal: 3.02 ± 0.268
1.258ThrTrp: 1.258 ± 1.845
3.272ThrTyr: 3.272 ± 1.114
0.0ThrXaa: 0.0 ± 0.0
Val
2.265ValAla: 2.265 ± 1.606
1.762ValCys: 1.762 ± 0.605
0.755ValAsp: 0.755 ± 0.393
3.02ValGlu: 3.02 ± 0.268
3.524ValPhe: 3.524 ± 0.789
3.775ValGly: 3.775 ± 1.489
0.503ValHis: 0.503 ± 0.336
4.531ValIle: 4.531 ± 1.573
4.782ValLys: 4.782 ± 2.066
4.531ValLeu: 4.531 ± 1.573
1.51ValMet: 1.51 ± 0.64
2.265ValAsn: 2.265 ± 1.789
2.014ValPro: 2.014 ± 0.778
2.769ValGln: 2.769 ± 1.118
1.258ValArg: 1.258 ± 2.847
5.537ValSer: 5.537 ± 0.925
3.272ValThr: 3.272 ± 0.647
2.265ValVal: 2.265 ± 0.537
0.252ValTrp: 0.252 ± 0.237
1.51ValTyr: 1.51 ± 1.006
0.0ValXaa: 0.0 ± 0.0
Trp
0.503TrpAla: 0.503 ± 0.97
0.0TrpCys: 0.0 ± 0.0
0.503TrpAsp: 0.503 ± 0.336
0.755TrpGlu: 0.755 ± 0.504
0.503TrpPhe: 0.503 ± 0.187
1.007TrpGly: 1.007 ± 0.867
0.0TrpHis: 0.0 ± 0.0
0.755TrpIle: 0.755 ± 0.393
0.252TrpLys: 0.252 ± 1.048
1.762TrpLeu: 1.762 ± 0.605
0.252TrpMet: 0.252 ± 1.048
0.503TrpAsn: 0.503 ± 0.336
0.0TrpPro: 0.0 ± 0.0
0.755TrpGln: 0.755 ± 0.504
0.0TrpArg: 0.0 ± 0.0
1.007TrpSer: 1.007 ± 0.672
0.0TrpThr: 0.0 ± 0.0
0.252TrpVal: 0.252 ± 0.168
0.0TrpTrp: 0.0 ± 0.0
0.503TrpTyr: 0.503 ± 0.336
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.007TyrAla: 1.007 ± 0.867
1.258TyrCys: 1.258 ± 1.185
1.258TyrAsp: 1.258 ± 0.428
2.769TyrGlu: 2.769 ± 1.118
2.265TyrPhe: 2.265 ± 0.787
2.014TyrGly: 2.014 ± 0.959
1.007TyrHis: 1.007 ± 0.375
3.775TyrIle: 3.775 ± 0.604
3.775TyrLys: 3.775 ± 0.364
3.524TyrLeu: 3.524 ± 0.258
1.51TyrMet: 1.51 ± 0.531
2.014TyrAsn: 2.014 ± 0.75
1.51TyrPro: 1.51 ± 0.714
1.258TyrGln: 1.258 ± 0.739
1.51TyrArg: 1.51 ± 0.714
2.517TyrSer: 2.517 ± 1.136
2.769TyrThr: 2.769 ± 1.275
1.51TyrVal: 1.51 ± 0.786
0.503TyrTrp: 0.503 ± 0.336
2.014TyrTyr: 2.014 ± 0.75
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3974 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski