Amino acid dipepetide frequency for Dinoroseobacter phage DFL12phi1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.663AlaAla: 7.663 ± 1.123
0.379AlaCys: 0.379 ± 0.161
5.558AlaAsp: 5.558 ± 0.569
6.653AlaGlu: 6.653 ± 0.608
2.611AlaPhe: 2.611 ± 0.307
5.769AlaGly: 5.769 ± 0.517
1.263AlaHis: 1.263 ± 0.192
3.832AlaIle: 3.832 ± 0.394
5.305AlaLys: 5.305 ± 0.581
6.99AlaLeu: 6.99 ± 0.575
2.653AlaMet: 2.653 ± 0.382
4.463AlaAsn: 4.463 ± 0.475
3.074AlaPro: 3.074 ± 0.373
3.874AlaGln: 3.874 ± 0.619
3.453AlaArg: 3.453 ± 0.527
4.421AlaSer: 4.421 ± 0.445
5.053AlaThr: 5.053 ± 0.494
5.558AlaVal: 5.558 ± 0.598
0.8AlaTrp: 0.8 ± 0.21
2.611AlaTyr: 2.611 ± 0.231
0.0AlaXaa: 0.0 ± 0.0
Cys
0.505CysAla: 0.505 ± 0.151
0.084CysCys: 0.084 ± 0.063
0.379CysAsp: 0.379 ± 0.14
0.632CysGlu: 0.632 ± 0.18
0.253CysPhe: 0.253 ± 0.102
0.589CysGly: 0.589 ± 0.202
0.211CysHis: 0.211 ± 0.094
0.211CysIle: 0.211 ± 0.105
0.632CysLys: 0.632 ± 0.19
0.379CysLeu: 0.379 ± 0.134
0.168CysMet: 0.168 ± 0.092
0.211CysAsn: 0.211 ± 0.102
0.379CysPro: 0.379 ± 0.191
0.295CysGln: 0.295 ± 0.103
0.295CysArg: 0.295 ± 0.11
0.337CysSer: 0.337 ± 0.129
0.589CysThr: 0.589 ± 0.18
0.211CysVal: 0.211 ± 0.097
0.211CysTrp: 0.211 ± 0.093
0.421CysTyr: 0.421 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
4.8AspAla: 4.8 ± 0.48
0.379AspCys: 0.379 ± 0.13
3.621AspAsp: 3.621 ± 0.507
5.263AspGlu: 5.263 ± 0.446
3.074AspPhe: 3.074 ± 0.358
4.927AspGly: 4.927 ± 0.416
1.347AspHis: 1.347 ± 0.228
4.169AspIle: 4.169 ± 0.372
2.947AspLys: 2.947 ± 0.433
6.611AspLeu: 6.611 ± 0.554
2.063AspMet: 2.063 ± 0.268
2.4AspAsn: 2.4 ± 0.302
4.253AspPro: 4.253 ± 0.494
2.442AspGln: 2.442 ± 0.293
3.116AspArg: 3.116 ± 0.338
3.453AspSer: 3.453 ± 0.456
4.126AspThr: 4.126 ± 0.49
4.0AspVal: 4.0 ± 0.444
1.305AspTrp: 1.305 ± 0.251
2.063AspTyr: 2.063 ± 0.363
0.0AspXaa: 0.0 ± 0.0
Glu
6.19GluAla: 6.19 ± 0.726
0.463GluCys: 0.463 ± 0.176
4.548GluAsp: 4.548 ± 0.504
7.032GluGlu: 7.032 ± 0.662
3.032GluPhe: 3.032 ± 0.32
4.548GluGly: 4.548 ± 0.547
1.137GluHis: 1.137 ± 0.256
4.884GluIle: 4.884 ± 0.281
3.663GluLys: 3.663 ± 0.442
6.274GluLeu: 6.274 ± 0.569
3.326GluMet: 3.326 ± 0.416
3.958GluAsn: 3.958 ± 0.626
1.937GluPro: 1.937 ± 0.374
3.958GluGln: 3.958 ± 0.426
3.411GluArg: 3.411 ± 0.398
3.453GluSer: 3.453 ± 0.394
4.632GluThr: 4.632 ± 0.648
4.758GluVal: 4.758 ± 0.492
0.8GluTrp: 0.8 ± 0.176
2.274GluTyr: 2.274 ± 0.234
0.0GluXaa: 0.0 ± 0.0
Phe
2.653PheAla: 2.653 ± 0.295
0.379PheCys: 0.379 ± 0.141
2.4PheAsp: 2.4 ± 0.302
2.316PheGlu: 2.316 ± 0.305
1.642PhePhe: 1.642 ± 0.36
2.653PheGly: 2.653 ± 0.455
0.674PheHis: 0.674 ± 0.174
2.695PheIle: 2.695 ± 0.417
2.779PheLys: 2.779 ± 0.35
2.737PheLeu: 2.737 ± 0.326
1.474PheMet: 1.474 ± 0.272
2.442PheAsn: 2.442 ± 0.36
1.558PhePro: 1.558 ± 0.357
1.811PheGln: 1.811 ± 0.25
1.558PheArg: 1.558 ± 0.207
2.316PheSer: 2.316 ± 0.324
2.19PheThr: 2.19 ± 0.309
2.147PheVal: 2.147 ± 0.29
0.295PheTrp: 0.295 ± 0.134
1.305PheTyr: 1.305 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
5.516GlyAla: 5.516 ± 0.613
0.337GlyCys: 0.337 ± 0.147
4.379GlyAsp: 4.379 ± 0.449
4.969GlyGlu: 4.969 ± 0.622
3.2GlyPhe: 3.2 ± 0.358
6.527GlyGly: 6.527 ± 0.696
0.8GlyHis: 0.8 ± 0.182
3.663GlyIle: 3.663 ± 0.436
4.084GlyLys: 4.084 ± 0.427
5.811GlyLeu: 5.811 ± 0.481
1.937GlyMet: 1.937 ± 0.244
3.537GlyAsn: 3.537 ± 0.398
2.442GlyPro: 2.442 ± 0.485
3.074GlyGln: 3.074 ± 0.371
2.863GlyArg: 2.863 ± 0.391
4.884GlySer: 4.884 ± 0.404
5.179GlyThr: 5.179 ± 0.556
5.179GlyVal: 5.179 ± 0.529
1.39GlyTrp: 1.39 ± 0.309
2.232GlyTyr: 2.232 ± 0.258
0.0GlyXaa: 0.0 ± 0.0
His
1.474HisAla: 1.474 ± 0.305
0.042HisCys: 0.042 ± 0.047
1.179HisAsp: 1.179 ± 0.254
1.221HisGlu: 1.221 ± 0.251
0.884HisPhe: 0.884 ± 0.202
1.011HisGly: 1.011 ± 0.207
0.379HisHis: 0.379 ± 0.145
1.137HisIle: 1.137 ± 0.18
1.305HisLys: 1.305 ± 0.288
1.684HisLeu: 1.684 ± 0.336
0.379HisMet: 0.379 ± 0.13
0.8HisAsn: 0.8 ± 0.193
0.8HisPro: 0.8 ± 0.258
0.758HisGln: 0.758 ± 0.174
0.968HisArg: 0.968 ± 0.231
1.053HisSer: 1.053 ± 0.227
0.716HisThr: 0.716 ± 0.205
0.968HisVal: 0.968 ± 0.196
0.211HisTrp: 0.211 ± 0.093
0.968HisTyr: 0.968 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
4.674IleAla: 4.674 ± 0.56
0.379IleCys: 0.379 ± 0.165
3.705IleAsp: 3.705 ± 0.43
4.169IleGlu: 4.169 ± 0.328
1.937IlePhe: 1.937 ± 0.328
4.084IleGly: 4.084 ± 0.479
1.137IleHis: 1.137 ± 0.236
2.905IleIle: 2.905 ± 0.413
2.737IleLys: 2.737 ± 0.401
4.295IleLeu: 4.295 ± 0.315
1.558IleMet: 1.558 ± 0.205
2.905IleAsn: 2.905 ± 0.303
3.158IlePro: 3.158 ± 0.331
2.569IleGln: 2.569 ± 0.254
3.116IleArg: 3.116 ± 0.44
3.369IleSer: 3.369 ± 0.413
3.663IleThr: 3.663 ± 0.508
2.863IleVal: 2.863 ± 0.382
0.968IleTrp: 0.968 ± 0.235
2.105IleTyr: 2.105 ± 0.359
0.0IleXaa: 0.0 ± 0.0
Lys
5.137LysAla: 5.137 ± 0.555
0.379LysCys: 0.379 ± 0.156
3.453LysAsp: 3.453 ± 0.337
4.253LysGlu: 4.253 ± 0.468
1.684LysPhe: 1.684 ± 0.287
3.495LysGly: 3.495 ± 0.441
1.179LysHis: 1.179 ± 0.247
3.032LysIle: 3.032 ± 0.495
4.169LysLys: 4.169 ± 0.616
4.632LysLeu: 4.632 ± 0.521
1.726LysMet: 1.726 ± 0.216
2.611LysAsn: 2.611 ± 0.402
2.4LysPro: 2.4 ± 0.39
2.232LysGln: 2.232 ± 0.256
2.147LysArg: 2.147 ± 0.287
3.326LysSer: 3.326 ± 0.438
3.705LysThr: 3.705 ± 0.493
3.537LysVal: 3.537 ± 0.505
0.758LysTrp: 0.758 ± 0.184
1.432LysTyr: 1.432 ± 0.227
0.0LysXaa: 0.0 ± 0.0
Leu
6.569LeuAla: 6.569 ± 0.83
0.968LeuCys: 0.968 ± 0.283
5.348LeuAsp: 5.348 ± 0.45
5.769LeuGlu: 5.769 ± 0.536
2.484LeuPhe: 2.484 ± 0.348
5.558LeuGly: 5.558 ± 0.495
1.516LeuHis: 1.516 ± 0.274
4.463LeuIle: 4.463 ± 0.396
4.8LeuLys: 4.8 ± 0.567
4.674LeuLeu: 4.674 ± 0.527
2.611LeuMet: 2.611 ± 0.343
5.221LeuAsn: 5.221 ± 0.458
3.537LeuPro: 3.537 ± 0.548
3.579LeuGln: 3.579 ± 0.361
3.958LeuArg: 3.958 ± 0.389
5.011LeuSer: 5.011 ± 0.421
5.053LeuThr: 5.053 ± 0.401
4.632LeuVal: 4.632 ± 0.467
0.968LeuTrp: 0.968 ± 0.297
2.274LeuTyr: 2.274 ± 0.413
0.0LeuXaa: 0.0 ± 0.0
Met
3.242MetAla: 3.242 ± 0.357
0.337MetCys: 0.337 ± 0.123
2.063MetAsp: 2.063 ± 0.27
1.811MetGlu: 1.811 ± 0.291
1.263MetPhe: 1.263 ± 0.208
2.19MetGly: 2.19 ± 0.239
0.211MetHis: 0.211 ± 0.095
1.811MetIle: 1.811 ± 0.33
1.516MetLys: 1.516 ± 0.267
2.569MetLeu: 2.569 ± 0.388
0.716MetMet: 0.716 ± 0.263
1.979MetAsn: 1.979 ± 0.216
1.642MetPro: 1.642 ± 0.277
1.937MetGln: 1.937 ± 0.3
1.558MetArg: 1.558 ± 0.271
1.895MetSer: 1.895 ± 0.234
2.569MetThr: 2.569 ± 0.321
1.811MetVal: 1.811 ± 0.248
0.505MetTrp: 0.505 ± 0.138
0.884MetTyr: 0.884 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
3.579AsnAla: 3.579 ± 0.418
0.421AsnCys: 0.421 ± 0.149
3.369AsnAsp: 3.369 ± 0.366
3.621AsnGlu: 3.621 ± 0.573
1.979AsnPhe: 1.979 ± 0.237
3.032AsnGly: 3.032 ± 0.399
0.968AsnHis: 0.968 ± 0.203
2.526AsnIle: 2.526 ± 0.322
2.442AsnLys: 2.442 ± 0.361
4.674AsnLeu: 4.674 ± 0.553
1.179AsnMet: 1.179 ± 0.214
2.063AsnAsn: 2.063 ± 0.321
3.2AsnPro: 3.2 ± 0.34
3.369AsnGln: 3.369 ± 0.381
2.947AsnArg: 2.947 ± 0.352
3.074AsnSer: 3.074 ± 0.332
3.116AsnThr: 3.116 ± 0.417
3.663AsnVal: 3.663 ± 0.462
0.632AsnTrp: 0.632 ± 0.238
2.19AsnTyr: 2.19 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
3.032ProAla: 3.032 ± 0.332
0.211ProCys: 0.211 ± 0.096
3.874ProAsp: 3.874 ± 0.49
4.126ProGlu: 4.126 ± 0.504
1.895ProPhe: 1.895 ± 0.354
2.274ProGly: 2.274 ± 0.423
0.632ProHis: 0.632 ± 0.196
2.147ProIle: 2.147 ± 0.345
2.526ProLys: 2.526 ± 0.393
2.653ProLeu: 2.653 ± 0.331
1.6ProMet: 1.6 ± 0.253
2.274ProAsn: 2.274 ± 0.279
1.305ProPro: 1.305 ± 0.278
1.937ProGln: 1.937 ± 0.236
1.6ProArg: 1.6 ± 0.311
2.779ProSer: 2.779 ± 0.369
2.863ProThr: 2.863 ± 0.349
3.411ProVal: 3.411 ± 0.388
0.379ProTrp: 0.379 ± 0.13
1.095ProTyr: 1.095 ± 0.238
0.0ProXaa: 0.0 ± 0.0
Gln
3.705GlnAla: 3.705 ± 0.471
0.295GlnCys: 0.295 ± 0.103
2.863GlnAsp: 2.863 ± 0.305
3.537GlnGlu: 3.537 ± 0.415
1.347GlnPhe: 1.347 ± 0.314
3.2GlnGly: 3.2 ± 0.403
0.926GlnHis: 0.926 ± 0.204
3.158GlnIle: 3.158 ± 0.316
2.905GlnLys: 2.905 ± 0.418
3.369GlnLeu: 3.369 ± 0.382
1.895GlnMet: 1.895 ± 0.314
2.147GlnAsn: 2.147 ± 0.289
1.6GlnPro: 1.6 ± 0.262
1.853GlnGln: 1.853 ± 0.331
2.19GlnArg: 2.19 ± 0.366
2.316GlnSer: 2.316 ± 0.313
3.2GlnThr: 3.2 ± 0.402
3.2GlnVal: 3.2 ± 0.393
0.674GlnTrp: 0.674 ± 0.185
1.6GlnTyr: 1.6 ± 0.253
0.0GlnXaa: 0.0 ± 0.0
Arg
3.411ArgAla: 3.411 ± 0.554
0.421ArgCys: 0.421 ± 0.145
2.905ArgAsp: 2.905 ± 0.413
3.284ArgGlu: 3.284 ± 0.504
2.147ArgPhe: 2.147 ± 0.341
3.032ArgGly: 3.032 ± 0.446
0.758ArgHis: 0.758 ± 0.172
2.779ArgIle: 2.779 ± 0.351
2.147ArgLys: 2.147 ± 0.368
4.421ArgLeu: 4.421 ± 0.344
1.937ArgMet: 1.937 ± 0.298
2.737ArgAsn: 2.737 ± 0.398
1.39ArgPro: 1.39 ± 0.228
2.779ArgGln: 2.779 ± 0.407
2.358ArgArg: 2.358 ± 0.319
2.653ArgSer: 2.653 ± 0.337
2.611ArgThr: 2.611 ± 0.297
2.611ArgVal: 2.611 ± 0.346
0.716ArgTrp: 0.716 ± 0.163
1.516ArgTyr: 1.516 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
4.59SerAla: 4.59 ± 0.473
0.337SerCys: 0.337 ± 0.12
4.042SerAsp: 4.042 ± 0.395
3.874SerGlu: 3.874 ± 0.313
2.147SerPhe: 2.147 ± 0.32
5.558SerGly: 5.558 ± 0.551
0.926SerHis: 0.926 ± 0.253
3.2SerIle: 3.2 ± 0.35
3.326SerLys: 3.326 ± 0.38
4.0SerLeu: 4.0 ± 0.377
2.147SerMet: 2.147 ± 0.309
3.032SerAsn: 3.032 ± 0.354
2.4SerPro: 2.4 ± 0.313
2.021SerGln: 2.021 ± 0.28
2.695SerArg: 2.695 ± 0.304
3.074SerSer: 3.074 ± 0.351
4.042SerThr: 4.042 ± 0.37
3.874SerVal: 3.874 ± 0.341
0.505SerTrp: 0.505 ± 0.133
1.726SerTyr: 1.726 ± 0.373
0.0SerXaa: 0.0 ± 0.0
Thr
6.021ThrAla: 6.021 ± 0.782
0.295ThrCys: 0.295 ± 0.131
4.169ThrAsp: 4.169 ± 0.369
3.453ThrGlu: 3.453 ± 0.349
2.611ThrPhe: 2.611 ± 0.468
6.106ThrGly: 6.106 ± 0.573
1.39ThrHis: 1.39 ± 0.306
3.874ThrIle: 3.874 ± 0.336
2.905ThrLys: 2.905 ± 0.481
4.842ThrLeu: 4.842 ± 0.544
1.811ThrMet: 1.811 ± 0.261
3.032ThrAsn: 3.032 ± 0.287
3.284ThrPro: 3.284 ± 0.33
2.905ThrGln: 2.905 ± 0.427
2.695ThrArg: 2.695 ± 0.346
3.621ThrSer: 3.621 ± 0.418
3.958ThrThr: 3.958 ± 0.516
4.674ThrVal: 4.674 ± 0.555
0.758ThrTrp: 0.758 ± 0.162
2.021ThrTyr: 2.021 ± 0.456
0.0ThrXaa: 0.0 ± 0.0
Val
5.516ValAla: 5.516 ± 0.747
0.421ValCys: 0.421 ± 0.161
4.126ValAsp: 4.126 ± 0.455
4.884ValGlu: 4.884 ± 0.487
2.105ValPhe: 2.105 ± 0.318
4.379ValGly: 4.379 ± 0.499
1.305ValHis: 1.305 ± 0.267
3.242ValIle: 3.242 ± 0.384
3.369ValLys: 3.369 ± 0.397
4.842ValLeu: 4.842 ± 0.397
2.4ValMet: 2.4 ± 0.417
3.832ValAsn: 3.832 ± 0.474
2.863ValPro: 2.863 ± 0.309
2.021ValGln: 2.021 ± 0.247
3.453ValArg: 3.453 ± 0.395
4.126ValSer: 4.126 ± 0.423
4.379ValThr: 4.379 ± 0.466
3.579ValVal: 3.579 ± 0.46
0.926ValTrp: 0.926 ± 0.197
2.063ValTyr: 2.063 ± 0.351
0.0ValXaa: 0.0 ± 0.0
Trp
0.968TrpAla: 0.968 ± 0.245
0.126TrpCys: 0.126 ± 0.067
1.305TrpAsp: 1.305 ± 0.264
1.179TrpGlu: 1.179 ± 0.229
0.632TrpPhe: 0.632 ± 0.152
1.053TrpGly: 1.053 ± 0.232
0.505TrpHis: 0.505 ± 0.208
0.674TrpIle: 0.674 ± 0.15
0.8TrpLys: 0.8 ± 0.185
1.053TrpLeu: 1.053 ± 0.213
0.295TrpMet: 0.295 ± 0.122
0.674TrpAsn: 0.674 ± 0.175
0.295TrpPro: 0.295 ± 0.109
0.632TrpGln: 0.632 ± 0.155
0.589TrpArg: 0.589 ± 0.131
0.8TrpSer: 0.8 ± 0.192
0.842TrpThr: 0.842 ± 0.19
0.758TrpVal: 0.758 ± 0.146
0.211TrpTrp: 0.211 ± 0.13
0.505TrpTyr: 0.505 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.737TyrAla: 2.737 ± 0.313
0.337TyrCys: 0.337 ± 0.107
3.074TyrAsp: 3.074 ± 0.47
2.232TyrGlu: 2.232 ± 0.329
1.095TyrPhe: 1.095 ± 0.229
2.021TyrGly: 2.021 ± 0.279
0.674TyrHis: 0.674 ± 0.199
1.937TyrIle: 1.937 ± 0.426
0.926TyrLys: 0.926 ± 0.203
2.484TyrLeu: 2.484 ± 0.394
0.589TyrMet: 0.589 ± 0.152
1.726TyrAsn: 1.726 ± 0.234
1.095TyrPro: 1.095 ± 0.183
1.937TyrGln: 1.937 ± 0.335
1.642TyrArg: 1.642 ± 0.294
1.558TyrSer: 1.558 ± 0.268
1.937TyrThr: 1.937 ± 0.318
2.4TyrVal: 2.4 ± 0.301
0.884TyrTrp: 0.884 ± 0.219
1.137TyrTyr: 1.137 ± 0.223
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (23750 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski