Amino acid dipepetide frequency for Yellowstone Lake virophage 6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.136AlaAla: 0.136 ± 0.122
0.272AlaCys: 0.272 ± 0.244
1.903AlaAsp: 1.903 ± 0.522
1.224AlaGlu: 1.224 ± 0.446
1.496AlaPhe: 1.496 ± 0.532
5.846AlaGly: 5.846 ± 1.859
0.272AlaHis: 0.272 ± 0.146
3.127AlaIle: 3.127 ± 0.671
2.719AlaLys: 2.719 ± 0.813
4.079AlaLeu: 4.079 ± 0.7
0.816AlaMet: 0.816 ± 0.355
2.447AlaAsn: 2.447 ± 0.701
2.039AlaPro: 2.039 ± 0.627
0.952AlaGln: 0.952 ± 0.404
2.039AlaArg: 2.039 ± 0.511
6.526AlaSer: 6.526 ± 2.422
3.943AlaThr: 3.943 ± 1.686
3.127AlaVal: 3.127 ± 0.68
0.544AlaTrp: 0.544 ± 0.257
2.175AlaTyr: 2.175 ± 0.825
0.0AlaXaa: 0.0 ± 0.0
Cys
0.544CysAla: 0.544 ± 0.266
0.408CysCys: 0.408 ± 0.218
0.816CysAsp: 0.816 ± 0.437
0.68CysGlu: 0.68 ± 0.332
0.408CysPhe: 0.408 ± 0.261
0.816CysGly: 0.816 ± 0.457
0.544CysHis: 0.544 ± 0.25
0.544CysIle: 0.544 ± 0.365
0.816CysLys: 0.816 ± 0.524
1.36CysLeu: 1.36 ± 0.505
0.136CysMet: 0.136 ± 0.113
1.36CysAsn: 1.36 ± 0.41
0.272CysPro: 0.272 ± 0.138
0.136CysGln: 0.136 ± 0.103
0.136CysArg: 0.136 ± 0.113
0.544CysSer: 0.544 ± 0.261
0.136CysThr: 0.136 ± 0.173
0.816CysVal: 0.816 ± 0.291
0.0CysTrp: 0.0 ± 0.0
0.408CysTyr: 0.408 ± 0.255
0.0CysXaa: 0.0 ± 0.0
Asp
1.903AspAla: 1.903 ± 0.662
1.088AspCys: 1.088 ± 0.574
3.535AspAsp: 3.535 ± 1.133
4.623AspGlu: 4.623 ± 1.16
2.447AspPhe: 2.447 ± 0.921
2.039AspGly: 2.039 ± 0.508
0.0AspHis: 0.0 ± 0.0
5.031AspIle: 5.031 ± 0.846
5.982AspLys: 5.982 ± 1.587
4.215AspLeu: 4.215 ± 0.683
1.088AspMet: 1.088 ± 0.308
3.127AspAsn: 3.127 ± 0.698
1.632AspPro: 1.632 ± 0.443
0.952AspGln: 0.952 ± 0.328
0.408AspArg: 0.408 ± 0.296
2.311AspSer: 2.311 ± 0.554
4.215AspThr: 4.215 ± 0.837
3.399AspVal: 3.399 ± 0.678
0.272AspTrp: 0.272 ± 0.19
3.399AspTyr: 3.399 ± 0.979
0.0AspXaa: 0.0 ± 0.0
Glu
2.855GluAla: 2.855 ± 0.785
0.816GluCys: 0.816 ± 0.374
2.855GluAsp: 2.855 ± 0.786
3.943GluGlu: 3.943 ± 1.15
2.447GluPhe: 2.447 ± 0.798
1.903GluGly: 1.903 ± 0.5
0.68GluHis: 0.68 ± 0.36
3.671GluIle: 3.671 ± 0.801
5.71GluLys: 5.71 ± 1.325
6.39GluLeu: 6.39 ± 1.466
1.088GluMet: 1.088 ± 0.392
4.079GluAsn: 4.079 ± 1.073
2.175GluPro: 2.175 ± 0.955
1.36GluGln: 1.36 ± 0.604
1.903GluArg: 1.903 ± 0.695
2.855GluSer: 2.855 ± 0.736
2.991GluThr: 2.991 ± 0.545
2.311GluVal: 2.311 ± 0.624
0.408GluTrp: 0.408 ± 0.209
3.807GluTyr: 3.807 ± 1.295
0.0GluXaa: 0.0 ± 0.0
Phe
0.952PheAla: 0.952 ± 0.427
0.136PheCys: 0.136 ± 0.113
2.311PheAsp: 2.311 ± 0.611
1.496PheGlu: 1.496 ± 0.488
1.224PhePhe: 1.224 ± 0.513
1.768PheGly: 1.768 ± 0.64
0.68PheHis: 0.68 ± 0.394
4.079PheIle: 4.079 ± 0.943
3.263PheLys: 3.263 ± 0.948
3.127PheLeu: 3.127 ± 0.729
0.544PheMet: 0.544 ± 0.265
3.671PheAsn: 3.671 ± 0.668
0.952PhePro: 0.952 ± 0.214
1.224PheGln: 1.224 ± 0.483
1.36PheArg: 1.36 ± 0.449
2.991PheSer: 2.991 ± 0.47
3.943PheThr: 3.943 ± 1.265
2.447PheVal: 2.447 ± 0.603
0.136PheTrp: 0.136 ± 0.134
1.768PheTyr: 1.768 ± 0.51
0.0PheXaa: 0.0 ± 0.0
Gly
3.263GlyAla: 3.263 ± 1.164
0.952GlyCys: 0.952 ± 0.314
2.583GlyAsp: 2.583 ± 0.656
1.632GlyGlu: 1.632 ± 0.482
2.175GlyPhe: 2.175 ± 0.501
5.438GlyGly: 5.438 ± 1.178
0.272GlyHis: 0.272 ± 0.209
4.351GlyIle: 4.351 ± 1.154
2.991GlyLys: 2.991 ± 0.739
3.943GlyLeu: 3.943 ± 0.572
0.408GlyMet: 0.408 ± 0.343
5.574GlyAsn: 5.574 ± 1.518
1.224GlyPro: 1.224 ± 0.46
2.039GlyGln: 2.039 ± 0.69
2.175GlyArg: 2.175 ± 0.383
7.342GlySer: 7.342 ± 2.1
7.614GlyThr: 7.614 ± 3.285
2.583GlyVal: 2.583 ± 0.746
0.408GlyTrp: 0.408 ± 0.217
2.175GlyTyr: 2.175 ± 0.525
0.0GlyXaa: 0.0 ± 0.0
His
0.272HisAla: 0.272 ± 0.204
0.136HisCys: 0.136 ± 0.128
0.272HisAsp: 0.272 ± 0.181
0.408HisGlu: 0.408 ± 0.249
0.272HisPhe: 0.272 ± 0.181
0.0HisGly: 0.0 ± 0.0
0.272HisHis: 0.272 ± 0.225
0.408HisIle: 0.408 ± 0.222
0.952HisLys: 0.952 ± 0.425
0.816HisLeu: 0.816 ± 0.472
0.136HisMet: 0.136 ± 0.136
0.68HisAsn: 0.68 ± 0.403
0.408HisPro: 0.408 ± 0.229
0.68HisGln: 0.68 ± 0.265
0.544HisArg: 0.544 ± 0.285
0.544HisSer: 0.544 ± 0.387
0.816HisThr: 0.816 ± 0.336
0.408HisVal: 0.408 ± 0.253
0.272HisTrp: 0.272 ± 0.346
0.136HisTyr: 0.136 ± 0.14
0.0HisXaa: 0.0 ± 0.0
Ile
3.535IleAla: 3.535 ± 0.648
0.544IleCys: 0.544 ± 0.335
5.438IleAsp: 5.438 ± 1.29
5.574IleGlu: 5.574 ± 1.132
3.263IlePhe: 3.263 ± 0.748
6.798IleGly: 6.798 ± 1.888
0.68IleHis: 0.68 ± 0.372
6.254IleIle: 6.254 ± 0.762
8.158IleLys: 8.158 ± 1.863
7.206IleLeu: 7.206 ± 1.064
1.088IleMet: 1.088 ± 0.581
9.517IleAsn: 9.517 ± 1.489
4.487IlePro: 4.487 ± 0.715
2.311IleGln: 2.311 ± 0.508
3.535IleArg: 3.535 ± 0.886
5.031IleSer: 5.031 ± 1.051
6.934IleThr: 6.934 ± 2.195
3.807IleVal: 3.807 ± 0.565
0.408IleTrp: 0.408 ± 0.249
4.215IleTyr: 4.215 ± 0.897
0.0IleXaa: 0.0 ± 0.0
Lys
3.127LysAla: 3.127 ± 0.782
0.68LysCys: 0.68 ± 0.37
5.71LysAsp: 5.71 ± 1.416
8.294LysGlu: 8.294 ± 2.186
2.855LysPhe: 2.855 ± 0.795
2.583LysGly: 2.583 ± 0.705
0.68LysHis: 0.68 ± 0.327
8.43LysIle: 8.43 ± 1.975
12.508LysLys: 12.508 ± 2.591
7.342LysLeu: 7.342 ± 1.482
2.991LysMet: 2.991 ± 0.95
6.934LysAsn: 6.934 ± 1.376
2.719LysPro: 2.719 ± 0.811
4.215LysGln: 4.215 ± 1.124
3.943LysArg: 3.943 ± 0.875
4.215LysSer: 4.215 ± 1.241
4.351LysThr: 4.351 ± 0.792
4.215LysVal: 4.215 ± 1.257
0.816LysTrp: 0.816 ± 0.372
4.623LysTyr: 4.623 ± 1.277
0.0LysXaa: 0.0 ± 0.0
Leu
4.079LeuAla: 4.079 ± 0.823
1.36LeuCys: 1.36 ± 0.377
3.943LeuAsp: 3.943 ± 0.796
7.07LeuGlu: 7.07 ± 1.611
3.943LeuPhe: 3.943 ± 0.965
4.079LeuGly: 4.079 ± 0.942
0.68LeuHis: 0.68 ± 0.338
7.206LeuIle: 7.206 ± 1.093
8.702LeuLys: 8.702 ± 1.95
6.39LeuLeu: 6.39 ± 1.087
1.36LeuMet: 1.36 ± 0.448
7.07LeuAsn: 7.07 ± 0.847
3.671LeuPro: 3.671 ± 0.841
3.399LeuGln: 3.399 ± 0.663
2.447LeuArg: 2.447 ± 0.566
8.294LeuSer: 8.294 ± 1.082
6.798LeuThr: 6.798 ± 1.299
3.807LeuVal: 3.807 ± 0.859
0.408LeuTrp: 0.408 ± 0.256
4.215LeuTyr: 4.215 ± 0.89
0.0LeuXaa: 0.0 ± 0.0
Met
0.408MetAla: 0.408 ± 0.22
0.136MetCys: 0.136 ± 0.113
0.68MetAsp: 0.68 ± 0.271
0.544MetGlu: 0.544 ± 0.384
0.272MetPhe: 0.272 ± 0.207
1.088MetGly: 1.088 ± 0.348
0.272MetHis: 0.272 ± 0.181
1.224MetIle: 1.224 ± 0.506
2.039MetLys: 2.039 ± 0.662
0.952MetLeu: 0.952 ± 0.362
0.544MetMet: 0.544 ± 0.285
2.039MetAsn: 2.039 ± 0.52
0.544MetPro: 0.544 ± 0.344
0.544MetGln: 0.544 ± 0.354
0.136MetArg: 0.136 ± 0.146
2.311MetSer: 2.311 ± 0.678
1.088MetThr: 1.088 ± 0.441
1.36MetVal: 1.36 ± 0.471
0.272MetTrp: 0.272 ± 0.214
0.272MetTyr: 0.272 ± 0.183
0.0MetXaa: 0.0 ± 0.0
Asn
1.768AsnAla: 1.768 ± 0.535
1.36AsnCys: 1.36 ± 0.571
3.535AsnAsp: 3.535 ± 0.703
3.263AsnGlu: 3.263 ± 0.705
2.991AsnPhe: 2.991 ± 0.761
3.399AsnGly: 3.399 ± 0.878
0.136AsnHis: 0.136 ± 0.154
8.566AsnIle: 8.566 ± 1.665
9.517AsnLys: 9.517 ± 2.559
8.566AsnLeu: 8.566 ± 0.704
1.088AsnMet: 1.088 ± 0.333
8.022AsnAsn: 8.022 ± 1.009
2.583AsnPro: 2.583 ± 0.749
2.175AsnGln: 2.175 ± 0.474
2.719AsnArg: 2.719 ± 0.516
7.206AsnSer: 7.206 ± 0.823
5.031AsnThr: 5.031 ± 1.157
3.127AsnVal: 3.127 ± 0.928
0.816AsnTrp: 0.816 ± 0.293
4.351AsnTyr: 4.351 ± 0.632
0.0AsnXaa: 0.0 ± 0.0
Pro
3.127ProAla: 3.127 ± 0.982
0.272ProCys: 0.272 ± 0.195
2.175ProAsp: 2.175 ± 0.871
1.36ProGlu: 1.36 ± 0.545
1.903ProPhe: 1.903 ± 0.497
0.0ProGly: 0.0 ± 0.0
0.272ProHis: 0.272 ± 0.196
3.127ProIle: 3.127 ± 0.733
2.311ProLys: 2.311 ± 0.931
2.719ProLeu: 2.719 ± 0.613
0.272ProMet: 0.272 ± 0.221
2.583ProAsn: 2.583 ± 0.608
1.903ProPro: 1.903 ± 0.633
1.224ProGln: 1.224 ± 0.428
1.088ProArg: 1.088 ± 0.39
3.127ProSer: 3.127 ± 0.787
2.719ProThr: 2.719 ± 0.83
2.583ProVal: 2.583 ± 0.601
0.272ProTrp: 0.272 ± 0.214
0.952ProTyr: 0.952 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
1.632GlnAla: 1.632 ± 0.555
0.408GlnCys: 0.408 ± 0.215
1.088GlnAsp: 1.088 ± 0.375
2.447GlnGlu: 2.447 ± 0.835
1.224GlnPhe: 1.224 ± 0.361
1.224GlnGly: 1.224 ± 0.35
0.0GlnHis: 0.0 ± 0.0
3.263GlnIle: 3.263 ± 0.568
3.535GlnLys: 3.535 ± 1.064
3.263GlnLeu: 3.263 ± 0.772
0.272GlnMet: 0.272 ± 0.23
1.632GlnAsn: 1.632 ± 0.419
0.544GlnPro: 0.544 ± 0.308
1.36GlnGln: 1.36 ± 0.478
1.496GlnArg: 1.496 ± 0.447
3.399GlnSer: 3.399 ± 0.814
2.039GlnThr: 2.039 ± 0.603
1.224GlnVal: 1.224 ± 0.336
0.0GlnTrp: 0.0 ± 0.0
2.855GlnTyr: 2.855 ± 0.643
0.0GlnXaa: 0.0 ± 0.0
Arg
1.36ArgAla: 1.36 ± 0.507
0.816ArgCys: 0.816 ± 0.383
1.496ArgAsp: 1.496 ± 0.505
2.175ArgGlu: 2.175 ± 0.813
1.36ArgPhe: 1.36 ± 0.355
1.496ArgGly: 1.496 ± 0.499
0.544ArgHis: 0.544 ± 0.303
4.079ArgIle: 4.079 ± 0.699
3.127ArgLys: 3.127 ± 0.992
2.855ArgLeu: 2.855 ± 0.696
1.36ArgMet: 1.36 ± 0.444
2.719ArgAsn: 2.719 ± 0.519
0.816ArgPro: 0.816 ± 0.295
0.68ArgGln: 0.68 ± 0.308
2.039ArgArg: 2.039 ± 0.604
1.632ArgSer: 1.632 ± 0.43
1.903ArgThr: 1.903 ± 0.503
0.952ArgVal: 0.952 ± 0.324
0.272ArgTrp: 0.272 ± 0.186
0.952ArgTyr: 0.952 ± 0.552
0.0ArgXaa: 0.0 ± 0.0
Ser
5.71SerAla: 5.71 ± 1.899
0.136SerCys: 0.136 ± 0.139
3.127SerAsp: 3.127 ± 0.619
3.399SerGlu: 3.399 ± 0.621
3.399SerPhe: 3.399 ± 0.89
8.022SerGly: 8.022 ± 2.732
0.816SerHis: 0.816 ± 0.518
7.07SerIle: 7.07 ± 1.44
4.759SerLys: 4.759 ± 1.215
6.798SerLeu: 6.798 ± 0.923
0.952SerMet: 0.952 ± 0.572
6.118SerAsn: 6.118 ± 1.009
2.039SerPro: 2.039 ± 0.499
3.943SerGln: 3.943 ± 0.611
1.768SerArg: 1.768 ± 0.482
8.566SerSer: 8.566 ± 1.908
8.838SerThr: 8.838 ± 2.241
3.535SerVal: 3.535 ± 0.739
1.088SerTrp: 1.088 ± 0.27
3.263SerTyr: 3.263 ± 0.951
0.0SerXaa: 0.0 ± 0.0
Thr
7.206ThrAla: 7.206 ± 2.836
0.136ThrCys: 0.136 ± 0.173
4.623ThrAsp: 4.623 ± 0.747
1.903ThrGlu: 1.903 ± 0.613
3.127ThrPhe: 3.127 ± 0.559
7.07ThrGly: 7.07 ± 2.631
0.68ThrHis: 0.68 ± 0.33
6.798ThrIle: 6.798 ± 1.411
3.263ThrLys: 3.263 ± 0.76
7.342ThrLeu: 7.342 ± 1.745
0.544ThrMet: 0.544 ± 0.238
3.943ThrAsn: 3.943 ± 0.569
2.175ThrPro: 2.175 ± 0.62
2.311ThrGln: 2.311 ± 0.542
2.039ThrArg: 2.039 ± 0.558
7.07ThrSer: 7.07 ± 2.604
9.789ThrThr: 9.789 ± 3.196
4.351ThrVal: 4.351 ± 1.077
0.544ThrTrp: 0.544 ± 0.232
3.263ThrTyr: 3.263 ± 0.636
0.0ThrXaa: 0.0 ± 0.0
Val
2.175ValAla: 2.175 ± 0.686
0.68ValCys: 0.68 ± 0.547
2.855ValAsp: 2.855 ± 0.695
2.039ValGlu: 2.039 ± 0.546
1.224ValPhe: 1.224 ± 0.363
4.079ValGly: 4.079 ± 0.688
0.408ValHis: 0.408 ± 0.218
4.215ValIle: 4.215 ± 0.83
4.215ValLys: 4.215 ± 0.905
6.118ValLeu: 6.118 ± 1.237
1.496ValMet: 1.496 ± 0.55
3.671ValAsn: 3.671 ± 0.572
2.447ValPro: 2.447 ± 0.543
1.224ValGln: 1.224 ± 0.387
1.088ValArg: 1.088 ± 0.364
5.303ValSer: 5.303 ± 0.977
1.36ValThr: 1.36 ± 0.507
2.991ValVal: 2.991 ± 0.811
0.408ValTrp: 0.408 ± 0.198
1.496ValTyr: 1.496 ± 0.467
0.0ValXaa: 0.0 ± 0.0
Trp
0.272TrpAla: 0.272 ± 0.152
0.272TrpCys: 0.272 ± 0.181
0.544TrpAsp: 0.544 ± 0.241
0.136TrpGlu: 0.136 ± 0.113
0.408TrpPhe: 0.408 ± 0.267
0.272TrpGly: 0.272 ± 0.211
0.136TrpHis: 0.136 ± 0.158
1.496TrpIle: 1.496 ± 0.429
0.816TrpLys: 0.816 ± 0.322
0.272TrpLeu: 0.272 ± 0.214
0.136TrpMet: 0.136 ± 0.131
0.408TrpAsn: 0.408 ± 0.219
0.0TrpPro: 0.0 ± 0.0
0.408TrpGln: 0.408 ± 0.221
0.408TrpArg: 0.408 ± 0.226
0.408TrpSer: 0.408 ± 0.271
0.68TrpThr: 0.68 ± 0.405
1.088TrpVal: 1.088 ± 0.357
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.36TyrAla: 1.36 ± 0.535
0.272TyrCys: 0.272 ± 0.138
2.175TyrAsp: 2.175 ± 0.572
2.039TyrGlu: 2.039 ± 0.644
1.632TyrPhe: 1.632 ± 0.375
1.224TyrGly: 1.224 ± 0.326
0.408TyrHis: 0.408 ± 0.236
5.846TyrIle: 5.846 ± 0.738
5.846TyrLys: 5.846 ± 1.238
5.303TyrLeu: 5.303 ± 0.795
0.272TyrMet: 0.272 ± 0.194
4.623TyrAsn: 4.623 ± 1.074
1.36TyrPro: 1.36 ± 0.513
1.768TyrGln: 1.768 ± 0.638
1.36TyrArg: 1.36 ± 0.412
3.671TyrSer: 3.671 ± 0.518
2.991TyrThr: 2.991 ± 0.622
1.36TyrVal: 1.36 ± 0.454
0.816TyrTrp: 0.816 ± 0.327
2.991TyrTyr: 2.991 ± 0.793
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 29 proteins (7356 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski