Amino acid dipepetide frequency for Roseovarius Plymouth podovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.563AlaAla: 7.563 ± 0.83
0.38AlaCys: 0.38 ± 0.165
5.493AlaAsp: 5.493 ± 0.507
6.296AlaGlu: 6.296 ± 0.685
2.366AlaPhe: 2.366 ± 0.341
6.0AlaGly: 6.0 ± 0.652
1.268AlaHis: 1.268 ± 0.227
4.31AlaIle: 4.31 ± 0.54
5.028AlaLys: 5.028 ± 0.385
6.169AlaLeu: 6.169 ± 0.461
2.408AlaMet: 2.408 ± 0.287
3.549AlaAsn: 3.549 ± 0.395
3.634AlaPro: 3.634 ± 0.471
2.831AlaGln: 2.831 ± 0.449
3.507AlaArg: 3.507 ± 0.464
4.225AlaSer: 4.225 ± 0.41
4.986AlaThr: 4.986 ± 0.493
5.789AlaVal: 5.789 ± 0.655
1.141AlaTrp: 1.141 ± 0.207
2.873AlaTyr: 2.873 ± 0.408
0.0AlaXaa: 0.0 ± 0.0
Cys
0.38CysAla: 0.38 ± 0.138
0.085CysCys: 0.085 ± 0.053
0.465CysAsp: 0.465 ± 0.184
0.718CysGlu: 0.718 ± 0.199
0.296CysPhe: 0.296 ± 0.115
0.676CysGly: 0.676 ± 0.234
0.38CysHis: 0.38 ± 0.119
0.127CysIle: 0.127 ± 0.108
0.676CysLys: 0.676 ± 0.242
0.465CysLeu: 0.465 ± 0.169
0.169CysMet: 0.169 ± 0.101
0.169CysAsn: 0.169 ± 0.081
0.38CysPro: 0.38 ± 0.165
0.423CysGln: 0.423 ± 0.151
0.38CysArg: 0.38 ± 0.187
0.38CysSer: 0.38 ± 0.159
0.423CysThr: 0.423 ± 0.191
0.254CysVal: 0.254 ± 0.123
0.127CysTrp: 0.127 ± 0.08
0.254CysTyr: 0.254 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
4.859AspAla: 4.859 ± 0.402
0.549AspCys: 0.549 ± 0.196
3.972AspAsp: 3.972 ± 0.521
4.648AspGlu: 4.648 ± 0.496
3.169AspPhe: 3.169 ± 0.398
4.606AspGly: 4.606 ± 0.395
1.268AspHis: 1.268 ± 0.269
4.268AspIle: 4.268 ± 0.51
3.465AspLys: 3.465 ± 0.403
6.929AspLeu: 6.929 ± 0.525
1.986AspMet: 1.986 ± 0.355
3.042AspAsn: 3.042 ± 0.32
4.099AspPro: 4.099 ± 0.499
2.493AspGln: 2.493 ± 0.431
2.831AspArg: 2.831 ± 0.322
3.042AspSer: 3.042 ± 0.434
4.225AspThr: 4.225 ± 0.398
4.183AspVal: 4.183 ± 0.681
0.93AspTrp: 0.93 ± 0.167
2.028AspTyr: 2.028 ± 0.299
0.0AspXaa: 0.0 ± 0.0
Glu
5.873GluAla: 5.873 ± 0.611
0.169GluCys: 0.169 ± 0.081
5.07GluAsp: 5.07 ± 0.388
6.803GluGlu: 6.803 ± 0.653
2.62GluPhe: 2.62 ± 0.429
4.986GluGly: 4.986 ± 0.729
1.056GluHis: 1.056 ± 0.199
4.775GluIle: 4.775 ± 0.43
3.676GluLys: 3.676 ± 0.376
5.831GluLeu: 5.831 ± 0.52
2.915GluMet: 2.915 ± 0.327
3.38GluAsn: 3.38 ± 0.352
2.451GluPro: 2.451 ± 0.375
3.591GluGln: 3.591 ± 0.439
3.38GluArg: 3.38 ± 0.573
3.972GluSer: 3.972 ± 0.476
4.352GluThr: 4.352 ± 0.506
5.028GluVal: 5.028 ± 0.529
1.014GluTrp: 1.014 ± 0.224
2.239GluTyr: 2.239 ± 0.325
0.0GluXaa: 0.0 ± 0.0
Phe
2.493PheAla: 2.493 ± 0.308
0.127PheCys: 0.127 ± 0.067
2.535PheAsp: 2.535 ± 0.393
2.493PheGlu: 2.493 ± 0.332
1.563PhePhe: 1.563 ± 0.248
2.831PheGly: 2.831 ± 0.369
0.634PheHis: 0.634 ± 0.202
2.282PheIle: 2.282 ± 0.461
2.62PheLys: 2.62 ± 0.257
2.915PheLeu: 2.915 ± 0.38
1.69PheMet: 1.69 ± 0.277
2.07PheAsn: 2.07 ± 0.346
1.521PhePro: 1.521 ± 0.209
1.69PheGln: 1.69 ± 0.251
1.352PheArg: 1.352 ± 0.204
2.155PheSer: 2.155 ± 0.267
2.239PheThr: 2.239 ± 0.29
2.704PheVal: 2.704 ± 0.392
0.423PheTrp: 0.423 ± 0.119
1.268PheTyr: 1.268 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
5.831GlyAla: 5.831 ± 0.555
0.592GlyCys: 0.592 ± 0.239
3.803GlyAsp: 3.803 ± 0.366
5.282GlyGlu: 5.282 ± 0.496
3.042GlyPhe: 3.042 ± 0.477
5.62GlyGly: 5.62 ± 0.799
1.31GlyHis: 1.31 ± 0.256
4.31GlyIle: 4.31 ± 0.518
4.141GlyLys: 4.141 ± 0.484
5.577GlyLeu: 5.577 ± 0.581
2.577GlyMet: 2.577 ± 0.411
3.211GlyAsn: 3.211 ± 0.298
3.211GlyPro: 3.211 ± 0.81
2.662GlyGln: 2.662 ± 0.281
3.169GlyArg: 3.169 ± 0.352
4.648GlySer: 4.648 ± 0.496
5.155GlyThr: 5.155 ± 0.605
5.155GlyVal: 5.155 ± 0.535
1.099GlyTrp: 1.099 ± 0.243
2.789GlyTyr: 2.789 ± 0.39
0.0GlyXaa: 0.0 ± 0.0
His
1.31HisAla: 1.31 ± 0.215
0.211HisCys: 0.211 ± 0.095
0.93HisAsp: 0.93 ± 0.246
1.521HisGlu: 1.521 ± 0.282
1.225HisPhe: 1.225 ± 0.326
1.225HisGly: 1.225 ± 0.327
0.761HisHis: 0.761 ± 0.242
1.31HisIle: 1.31 ± 0.263
1.141HisLys: 1.141 ± 0.251
1.648HisLeu: 1.648 ± 0.342
0.338HisMet: 0.338 ± 0.116
0.845HisAsn: 0.845 ± 0.175
0.803HisPro: 0.803 ± 0.205
0.254HisGln: 0.254 ± 0.103
1.183HisArg: 1.183 ± 0.258
0.93HisSer: 0.93 ± 0.196
1.014HisThr: 1.014 ± 0.207
1.31HisVal: 1.31 ± 0.292
0.254HisTrp: 0.254 ± 0.106
1.056HisTyr: 1.056 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
4.437IleAla: 4.437 ± 0.466
0.254IleCys: 0.254 ± 0.116
3.676IleAsp: 3.676 ± 0.35
3.591IleGlu: 3.591 ± 0.428
2.197IlePhe: 2.197 ± 0.35
3.422IleGly: 3.422 ± 0.41
1.31IleHis: 1.31 ± 0.255
2.915IleIle: 2.915 ± 0.358
2.62IleLys: 2.62 ± 0.389
4.479IleLeu: 4.479 ± 0.474
2.028IleMet: 2.028 ± 0.315
2.451IleAsn: 2.451 ± 0.282
3.0IlePro: 3.0 ± 0.392
2.915IleGln: 2.915 ± 0.378
3.127IleArg: 3.127 ± 0.368
3.0IleSer: 3.0 ± 0.346
3.591IleThr: 3.591 ± 0.36
3.507IleVal: 3.507 ± 0.384
0.803IleTrp: 0.803 ± 0.227
2.324IleTyr: 2.324 ± 0.362
0.0IleXaa: 0.0 ± 0.0
Lys
4.648LysAla: 4.648 ± 0.496
0.507LysCys: 0.507 ± 0.177
3.127LysAsp: 3.127 ± 0.441
4.817LysGlu: 4.817 ± 0.424
1.437LysPhe: 1.437 ± 0.234
3.972LysGly: 3.972 ± 0.349
1.31LysHis: 1.31 ± 0.331
2.873LysIle: 2.873 ± 0.332
3.549LysLys: 3.549 ± 0.523
4.479LysLeu: 4.479 ± 0.567
1.901LysMet: 1.901 ± 0.246
2.662LysAsn: 2.662 ± 0.354
2.662LysPro: 2.662 ± 0.404
2.282LysGln: 2.282 ± 0.28
2.662LysArg: 2.662 ± 0.374
3.38LysSer: 3.38 ± 0.419
3.718LysThr: 3.718 ± 0.488
3.549LysVal: 3.549 ± 0.385
0.887LysTrp: 0.887 ± 0.181
1.394LysTyr: 1.394 ± 0.264
0.0LysXaa: 0.0 ± 0.0
Leu
5.789LeuAla: 5.789 ± 0.832
1.099LeuCys: 1.099 ± 0.318
5.324LeuAsp: 5.324 ± 0.48
5.535LeuGlu: 5.535 ± 0.428
2.704LeuPhe: 2.704 ± 0.287
4.775LeuGly: 4.775 ± 0.492
1.352LeuHis: 1.352 ± 0.277
4.014LeuIle: 4.014 ± 0.436
4.606LeuLys: 4.606 ± 0.497
5.746LeuLeu: 5.746 ± 0.639
2.915LeuMet: 2.915 ± 0.353
4.69LeuAsn: 4.69 ± 0.385
3.845LeuPro: 3.845 ± 0.362
4.099LeuGln: 4.099 ± 0.347
3.887LeuArg: 3.887 ± 0.403
5.789LeuSer: 5.789 ± 0.497
5.535LeuThr: 5.535 ± 0.61
4.268LeuVal: 4.268 ± 0.516
1.183LeuTrp: 1.183 ± 0.233
2.366LeuTyr: 2.366 ± 0.459
0.0LeuXaa: 0.0 ± 0.0
Met
2.958MetAla: 2.958 ± 0.389
0.254MetCys: 0.254 ± 0.17
1.986MetAsp: 1.986 ± 0.283
1.901MetGlu: 1.901 ± 0.263
1.014MetPhe: 1.014 ± 0.212
2.155MetGly: 2.155 ± 0.33
0.338MetHis: 0.338 ± 0.139
1.817MetIle: 1.817 ± 0.283
1.69MetLys: 1.69 ± 0.238
2.535MetLeu: 2.535 ± 0.281
0.972MetMet: 0.972 ± 0.225
2.408MetAsn: 2.408 ± 0.434
1.521MetPro: 1.521 ± 0.257
1.986MetGln: 1.986 ± 0.37
1.69MetArg: 1.69 ± 0.235
2.239MetSer: 2.239 ± 0.358
1.69MetThr: 1.69 ± 0.267
2.577MetVal: 2.577 ± 0.393
0.338MetTrp: 0.338 ± 0.124
0.845MetTyr: 0.845 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
3.591AsnAla: 3.591 ± 0.531
0.338AsnCys: 0.338 ± 0.112
3.803AsnAsp: 3.803 ± 0.403
3.127AsnGlu: 3.127 ± 0.483
2.324AsnPhe: 2.324 ± 0.302
3.718AsnGly: 3.718 ± 0.517
0.972AsnHis: 0.972 ± 0.237
2.451AsnIle: 2.451 ± 0.337
2.789AsnLys: 2.789 ± 0.443
4.099AsnLeu: 4.099 ± 0.37
1.521AsnMet: 1.521 ± 0.307
2.113AsnAsn: 2.113 ± 0.338
2.62AsnPro: 2.62 ± 0.32
2.831AsnGln: 2.831 ± 0.353
2.535AsnArg: 2.535 ± 0.313
1.986AsnSer: 1.986 ± 0.365
3.465AsnThr: 3.465 ± 0.385
3.042AsnVal: 3.042 ± 0.34
0.634AsnTrp: 0.634 ± 0.167
1.394AsnTyr: 1.394 ± 0.309
0.0AsnXaa: 0.0 ± 0.0
Pro
3.253ProAla: 3.253 ± 0.448
0.38ProCys: 0.38 ± 0.132
3.549ProAsp: 3.549 ± 0.389
3.93ProGlu: 3.93 ± 0.394
1.859ProPhe: 1.859 ± 0.326
3.0ProGly: 3.0 ± 0.295
0.803ProHis: 0.803 ± 0.168
2.239ProIle: 2.239 ± 0.308
2.873ProLys: 2.873 ± 0.34
2.704ProLeu: 2.704 ± 0.399
1.183ProMet: 1.183 ± 0.196
1.901ProAsn: 1.901 ± 0.314
1.268ProPro: 1.268 ± 0.239
2.535ProGln: 2.535 ± 0.33
1.521ProArg: 1.521 ± 0.224
3.0ProSer: 3.0 ± 0.398
3.253ProThr: 3.253 ± 0.31
2.915ProVal: 2.915 ± 0.32
0.592ProTrp: 0.592 ± 0.148
0.887ProTyr: 0.887 ± 0.228
0.0ProXaa: 0.0 ± 0.0
Gln
4.479GlnAla: 4.479 ± 0.48
0.211GlnCys: 0.211 ± 0.093
2.662GlnAsp: 2.662 ± 0.263
3.169GlnGlu: 3.169 ± 0.304
1.521GlnPhe: 1.521 ± 0.258
4.014GlnGly: 4.014 ± 0.717
0.761GlnHis: 0.761 ± 0.167
2.535GlnIle: 2.535 ± 0.325
2.451GlnLys: 2.451 ± 0.301
3.169GlnLeu: 3.169 ± 0.421
1.521GlnMet: 1.521 ± 0.273
2.282GlnAsn: 2.282 ± 0.216
1.648GlnPro: 1.648 ± 0.393
1.563GlnGln: 1.563 ± 0.278
2.789GlnArg: 2.789 ± 0.393
1.944GlnSer: 1.944 ± 0.344
2.746GlnThr: 2.746 ± 0.41
2.789GlnVal: 2.789 ± 0.357
0.549GlnTrp: 0.549 ± 0.158
1.606GlnTyr: 1.606 ± 0.273
0.0GlnXaa: 0.0 ± 0.0
Arg
3.211ArgAla: 3.211 ± 0.378
0.38ArgCys: 0.38 ± 0.165
3.042ArgAsp: 3.042 ± 0.343
3.676ArgGlu: 3.676 ± 0.587
2.239ArgPhe: 2.239 ± 0.324
2.958ArgGly: 2.958 ± 0.302
0.887ArgHis: 0.887 ± 0.204
2.831ArgIle: 2.831 ± 0.388
2.662ArgLys: 2.662 ± 0.358
4.563ArgLeu: 4.563 ± 0.429
2.282ArgMet: 2.282 ± 0.33
2.577ArgAsn: 2.577 ± 0.335
1.225ArgPro: 1.225 ± 0.231
2.493ArgGln: 2.493 ± 0.35
2.662ArgArg: 2.662 ± 0.322
2.704ArgSer: 2.704 ± 0.446
3.084ArgThr: 3.084 ± 0.356
3.084ArgVal: 3.084 ± 0.417
0.718ArgTrp: 0.718 ± 0.173
1.775ArgTyr: 1.775 ± 0.228
0.0ArgXaa: 0.0 ± 0.0
Ser
4.056SerAla: 4.056 ± 0.491
0.296SerCys: 0.296 ± 0.114
3.93SerAsp: 3.93 ± 0.492
3.845SerGlu: 3.845 ± 0.432
1.859SerPhe: 1.859 ± 0.275
5.113SerGly: 5.113 ± 0.456
0.972SerHis: 0.972 ± 0.222
2.662SerIle: 2.662 ± 0.312
2.746SerLys: 2.746 ± 0.346
4.521SerLeu: 4.521 ± 0.437
1.437SerMet: 1.437 ± 0.243
2.831SerAsn: 2.831 ± 0.303
2.535SerPro: 2.535 ± 0.341
2.197SerGln: 2.197 ± 0.468
3.042SerArg: 3.042 ± 0.353
3.338SerSer: 3.338 ± 0.382
3.676SerThr: 3.676 ± 0.442
4.648SerVal: 4.648 ± 0.436
0.634SerTrp: 0.634 ± 0.214
1.732SerTyr: 1.732 ± 0.277
0.0SerXaa: 0.0 ± 0.0
Thr
5.915ThrAla: 5.915 ± 0.518
0.296ThrCys: 0.296 ± 0.13
4.31ThrAsp: 4.31 ± 0.316
4.014ThrGlu: 4.014 ± 0.455
2.282ThrPhe: 2.282 ± 0.31
5.831ThrGly: 5.831 ± 0.525
1.31ThrHis: 1.31 ± 0.303
3.507ThrIle: 3.507 ± 0.38
3.718ThrLys: 3.718 ± 0.382
4.732ThrLeu: 4.732 ± 0.372
1.775ThrMet: 1.775 ± 0.29
3.042ThrAsn: 3.042 ± 0.383
2.831ThrPro: 2.831 ± 0.43
2.577ThrGln: 2.577 ± 0.35
3.253ThrArg: 3.253 ± 0.301
3.169ThrSer: 3.169 ± 0.355
3.465ThrThr: 3.465 ± 0.382
4.437ThrVal: 4.437 ± 0.348
0.887ThrTrp: 0.887 ± 0.171
2.366ThrTyr: 2.366 ± 0.27
0.0ThrXaa: 0.0 ± 0.0
Val
5.408ValAla: 5.408 ± 0.558
0.549ValCys: 0.549 ± 0.167
4.732ValAsp: 4.732 ± 0.386
4.31ValGlu: 4.31 ± 0.421
2.113ValPhe: 2.113 ± 0.306
5.028ValGly: 5.028 ± 0.75
1.521ValHis: 1.521 ± 0.282
3.676ValIle: 3.676 ± 0.439
3.169ValLys: 3.169 ± 0.435
4.859ValLeu: 4.859 ± 0.494
2.324ValMet: 2.324 ± 0.369
3.422ValAsn: 3.422 ± 0.435
2.873ValPro: 2.873 ± 0.315
2.451ValGln: 2.451 ± 0.308
3.803ValArg: 3.803 ± 0.39
3.93ValSer: 3.93 ± 0.425
4.394ValThr: 4.394 ± 0.52
3.845ValVal: 3.845 ± 0.465
0.972ValTrp: 0.972 ± 0.185
2.535ValTyr: 2.535 ± 0.336
0.0ValXaa: 0.0 ± 0.0
Trp
1.014TrpAla: 1.014 ± 0.216
0.169TrpCys: 0.169 ± 0.088
1.31TrpAsp: 1.31 ± 0.253
0.972TrpGlu: 0.972 ± 0.225
0.634TrpPhe: 0.634 ± 0.153
1.014TrpGly: 1.014 ± 0.274
0.338TrpHis: 0.338 ± 0.148
0.761TrpIle: 0.761 ± 0.179
0.803TrpLys: 0.803 ± 0.189
2.028TrpLeu: 2.028 ± 0.434
0.127TrpMet: 0.127 ± 0.086
0.718TrpAsn: 0.718 ± 0.168
0.211TrpPro: 0.211 ± 0.087
0.676TrpGln: 0.676 ± 0.165
0.676TrpArg: 0.676 ± 0.189
0.423TrpSer: 0.423 ± 0.133
0.887TrpThr: 0.887 ± 0.204
0.761TrpVal: 0.761 ± 0.2
0.296TrpTrp: 0.296 ± 0.13
0.254TrpTyr: 0.254 ± 0.084
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.746TyrAla: 2.746 ± 0.318
0.38TyrCys: 0.38 ± 0.133
2.873TyrAsp: 2.873 ± 0.35
2.535TyrGlu: 2.535 ± 0.38
1.141TyrPhe: 1.141 ± 0.225
2.239TyrGly: 2.239 ± 0.423
0.718TyrHis: 0.718 ± 0.196
1.986TyrIle: 1.986 ± 0.325
1.394TyrLys: 1.394 ± 0.226
2.07TyrLeu: 2.07 ± 0.301
0.803TyrMet: 0.803 ± 0.218
1.986TyrAsn: 1.986 ± 0.259
1.31TyrPro: 1.31 ± 0.27
1.859TyrGln: 1.859 ± 0.33
1.648TyrArg: 1.648 ± 0.218
1.817TyrSer: 1.817 ± 0.27
1.817TyrThr: 1.817 ± 0.314
2.07TyrVal: 2.07 ± 0.397
0.592TyrTrp: 0.592 ± 0.21
1.014TyrTyr: 1.014 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (23668 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski