Amino acid dipepetide frequency for Roseobacter phage RDJL Phi 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.592AlaAla: 11.592 ± 1.623
1.104AlaCys: 1.104 ± 0.285
6.348AlaAsp: 6.348 ± 0.618
8.114AlaGlu: 8.114 ± 0.694
3.312AlaPhe: 3.312 ± 0.418
8.39AlaGly: 8.39 ± 0.728
1.601AlaHis: 1.601 ± 0.322
5.686AlaIle: 5.686 ± 0.602
5.134AlaLys: 5.134 ± 0.912
9.66AlaLeu: 9.66 ± 1.362
3.422AlaMet: 3.422 ± 0.579
3.533AlaAsn: 3.533 ± 0.551
3.864AlaPro: 3.864 ± 0.516
3.036AlaGln: 3.036 ± 0.634
5.906AlaArg: 5.906 ± 0.714
5.741AlaSer: 5.741 ± 0.733
4.968AlaThr: 4.968 ± 0.717
5.52AlaVal: 5.52 ± 0.594
1.987AlaTrp: 1.987 ± 0.422
2.484AlaTyr: 2.484 ± 0.344
0.0AlaXaa: 0.0 ± 0.0
Cys
0.938CysAla: 0.938 ± 0.246
0.055CysCys: 0.055 ± 0.068
0.828CysAsp: 0.828 ± 0.239
0.828CysGlu: 0.828 ± 0.314
0.331CysPhe: 0.331 ± 0.117
0.828CysGly: 0.828 ± 0.221
0.331CysHis: 0.331 ± 0.136
0.276CysIle: 0.276 ± 0.103
0.552CysLys: 0.552 ± 0.177
0.718CysLeu: 0.718 ± 0.24
0.221CysMet: 0.221 ± 0.127
0.386CysAsn: 0.386 ± 0.145
0.718CysPro: 0.718 ± 0.249
0.276CysGln: 0.276 ± 0.139
0.607CysArg: 0.607 ± 0.227
0.662CysSer: 0.662 ± 0.233
0.386CysThr: 0.386 ± 0.122
0.276CysVal: 0.276 ± 0.141
0.221CysTrp: 0.221 ± 0.106
0.276CysTyr: 0.276 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
6.9AspAla: 6.9 ± 0.821
0.718AspCys: 0.718 ± 0.223
4.747AspAsp: 4.747 ± 0.635
6.514AspGlu: 6.514 ± 0.94
3.312AspPhe: 3.312 ± 0.397
6.79AspGly: 6.79 ± 0.852
1.214AspHis: 1.214 ± 0.291
2.539AspIle: 2.539 ± 0.441
2.981AspLys: 2.981 ± 0.407
6.955AspLeu: 6.955 ± 0.656
1.877AspMet: 1.877 ± 0.331
2.318AspAsn: 2.318 ± 0.299
4.03AspPro: 4.03 ± 0.489
2.65AspGln: 2.65 ± 0.575
3.202AspArg: 3.202 ± 0.491
2.208AspSer: 2.208 ± 0.438
4.526AspThr: 4.526 ± 0.421
5.189AspVal: 5.189 ± 0.56
1.711AspTrp: 1.711 ± 0.3
1.822AspTyr: 1.822 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
8.998GluAla: 8.998 ± 0.847
0.828GluCys: 0.828 ± 0.234
4.968GluAsp: 4.968 ± 0.579
6.79GluGlu: 6.79 ± 0.85
2.926GluPhe: 2.926 ± 0.428
5.41GluGly: 5.41 ± 0.557
1.435GluHis: 1.435 ± 0.285
2.815GluIle: 2.815 ± 0.463
2.981GluLys: 2.981 ± 0.49
7.783GluLeu: 7.783 ± 0.641
2.263GluMet: 2.263 ± 0.362
1.822GluAsn: 1.822 ± 0.296
3.091GluPro: 3.091 ± 0.608
2.815GluGln: 2.815 ± 0.373
4.416GluArg: 4.416 ± 0.694
1.987GluSer: 1.987 ± 0.367
4.582GluThr: 4.582 ± 0.522
5.244GluVal: 5.244 ± 0.487
1.38GluTrp: 1.38 ± 0.32
1.711GluTyr: 1.711 ± 0.33
0.0GluXaa: 0.0 ± 0.0
Phe
2.87PheAla: 2.87 ± 0.449
0.442PheCys: 0.442 ± 0.166
3.367PheAsp: 3.367 ± 0.368
3.533PheGlu: 3.533 ± 0.441
0.938PhePhe: 0.938 ± 0.209
2.76PheGly: 2.76 ± 0.388
0.552PheHis: 0.552 ± 0.182
2.484PheIle: 2.484 ± 0.365
2.042PheLys: 2.042 ± 0.247
2.318PheLeu: 2.318 ± 0.424
0.883PheMet: 0.883 ± 0.216
1.38PheAsn: 1.38 ± 0.267
1.711PhePro: 1.711 ± 0.291
1.49PheGln: 1.49 ± 0.306
2.098PheArg: 2.098 ± 0.355
2.153PheSer: 2.153 ± 0.442
2.484PheThr: 2.484 ± 0.463
1.546PheVal: 1.546 ± 0.325
0.552PheTrp: 0.552 ± 0.183
0.883PheTyr: 0.883 ± 0.232
0.0PheXaa: 0.0 ± 0.0
Gly
6.679GlyAla: 6.679 ± 1.08
0.883GlyCys: 0.883 ± 0.266
4.858GlyAsp: 4.858 ± 0.501
4.968GlyGlu: 4.968 ± 0.566
3.809GlyPhe: 3.809 ± 0.471
7.066GlyGly: 7.066 ± 0.728
1.27GlyHis: 1.27 ± 0.301
3.367GlyIle: 3.367 ± 0.392
3.974GlyLys: 3.974 ± 0.459
6.514GlyLeu: 6.514 ± 0.526
2.153GlyMet: 2.153 ± 0.347
2.65GlyAsn: 2.65 ± 0.43
2.705GlyPro: 2.705 ± 0.379
2.65GlyGln: 2.65 ± 0.398
5.851GlyArg: 5.851 ± 0.534
5.906GlySer: 5.906 ± 0.836
5.686GlyThr: 5.686 ± 0.63
5.686GlyVal: 5.686 ± 0.568
1.27GlyTrp: 1.27 ± 0.274
1.987GlyTyr: 1.987 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
1.766HisAla: 1.766 ± 0.318
0.497HisCys: 0.497 ± 0.141
1.435HisAsp: 1.435 ± 0.274
1.325HisGlu: 1.325 ± 0.268
0.442HisPhe: 0.442 ± 0.151
1.38HisGly: 1.38 ± 0.328
0.883HisHis: 0.883 ± 0.229
0.994HisIle: 0.994 ± 0.209
0.662HisLys: 0.662 ± 0.205
1.546HisLeu: 1.546 ± 0.282
0.442HisMet: 0.442 ± 0.145
0.497HisAsn: 0.497 ± 0.187
1.159HisPro: 1.159 ± 0.266
0.552HisGln: 0.552 ± 0.226
1.27HisArg: 1.27 ± 0.23
0.994HisSer: 0.994 ± 0.217
0.883HisThr: 0.883 ± 0.232
0.994HisVal: 0.994 ± 0.254
0.386HisTrp: 0.386 ± 0.159
0.442HisTyr: 0.442 ± 0.156
0.0HisXaa: 0.0 ± 0.0
Ile
4.913IleAla: 4.913 ± 0.692
0.331IleCys: 0.331 ± 0.129
3.864IleAsp: 3.864 ± 0.501
3.974IleGlu: 3.974 ± 0.525
1.214IlePhe: 1.214 ± 0.214
3.257IleGly: 3.257 ± 0.436
0.662IleHis: 0.662 ± 0.194
1.38IleIle: 1.38 ± 0.239
2.042IleLys: 2.042 ± 0.366
3.864IleLeu: 3.864 ± 0.527
1.049IleMet: 1.049 ± 0.246
1.656IleAsn: 1.656 ± 0.32
2.042IlePro: 2.042 ± 0.372
1.932IleGln: 1.932 ± 0.368
3.312IleArg: 3.312 ± 0.474
2.042IleSer: 2.042 ± 0.374
2.981IleThr: 2.981 ± 0.392
2.981IleVal: 2.981 ± 0.404
1.159IleTrp: 1.159 ± 0.242
0.994IleTyr: 0.994 ± 0.256
0.0IleXaa: 0.0 ± 0.0
Lys
6.569LysAla: 6.569 ± 0.859
0.276LysCys: 0.276 ± 0.106
3.202LysAsp: 3.202 ± 0.527
2.705LysGlu: 2.705 ± 0.428
1.325LysPhe: 1.325 ± 0.238
3.257LysGly: 3.257 ± 0.523
0.773LysHis: 0.773 ± 0.197
1.325LysIle: 1.325 ± 0.274
2.208LysLys: 2.208 ± 0.467
3.698LysLeu: 3.698 ± 0.51
1.435LysMet: 1.435 ± 0.32
1.711LysAsn: 1.711 ± 0.345
2.429LysPro: 2.429 ± 0.45
2.153LysGln: 2.153 ± 0.355
3.367LysArg: 3.367 ± 0.481
2.76LysSer: 2.76 ± 0.477
3.146LysThr: 3.146 ± 0.441
3.478LysVal: 3.478 ± 0.491
1.049LysTrp: 1.049 ± 0.284
0.773LysTyr: 0.773 ± 0.207
0.0LysXaa: 0.0 ± 0.0
Leu
8.832LeuAla: 8.832 ± 1.191
0.607LeuCys: 0.607 ± 0.185
5.741LeuAsp: 5.741 ± 0.534
5.575LeuGlu: 5.575 ± 0.49
2.208LeuPhe: 2.208 ± 0.287
7.286LeuGly: 7.286 ± 0.769
1.932LeuHis: 1.932 ± 0.445
3.698LeuIle: 3.698 ± 0.503
3.974LeuLys: 3.974 ± 0.636
6.403LeuLeu: 6.403 ± 0.792
2.042LeuMet: 2.042 ± 0.373
3.202LeuAsn: 3.202 ± 0.43
3.864LeuPro: 3.864 ± 0.566
2.65LeuGln: 2.65 ± 0.446
5.741LeuArg: 5.741 ± 0.574
4.306LeuSer: 4.306 ± 0.486
7.286LeuThr: 7.286 ± 0.781
5.299LeuVal: 5.299 ± 0.536
1.049LeuTrp: 1.049 ± 0.262
1.932LeuTyr: 1.932 ± 0.323
0.0LeuXaa: 0.0 ± 0.0
Met
2.87MetAla: 2.87 ± 0.441
0.166MetCys: 0.166 ± 0.132
2.153MetAsp: 2.153 ± 0.347
1.601MetGlu: 1.601 ± 0.379
0.386MetPhe: 0.386 ± 0.15
1.877MetGly: 1.877 ± 0.382
0.442MetHis: 0.442 ± 0.174
1.435MetIle: 1.435 ± 0.216
1.38MetLys: 1.38 ± 0.244
1.601MetLeu: 1.601 ± 0.281
0.607MetMet: 0.607 ± 0.191
0.994MetAsn: 0.994 ± 0.233
1.27MetPro: 1.27 ± 0.218
1.049MetGln: 1.049 ± 0.224
1.601MetArg: 1.601 ± 0.347
1.932MetSer: 1.932 ± 0.392
2.153MetThr: 2.153 ± 0.433
1.159MetVal: 1.159 ± 0.233
0.331MetTrp: 0.331 ± 0.133
0.607MetTyr: 0.607 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
3.422AsnAla: 3.422 ± 0.468
0.221AsnCys: 0.221 ± 0.103
2.374AsnAsp: 2.374 ± 0.416
2.208AsnGlu: 2.208 ± 0.328
1.601AsnPhe: 1.601 ± 0.36
2.76AsnGly: 2.76 ± 0.327
0.828AsnHis: 0.828 ± 0.225
1.711AsnIle: 1.711 ± 0.485
1.546AsnLys: 1.546 ± 0.292
2.705AsnLeu: 2.705 ± 0.363
0.883AsnMet: 0.883 ± 0.211
1.435AsnAsn: 1.435 ± 0.392
2.098AsnPro: 2.098 ± 0.357
1.214AsnGln: 1.214 ± 0.321
2.484AsnArg: 2.484 ± 0.395
1.766AsnSer: 1.766 ± 0.292
1.877AsnThr: 1.877 ± 0.381
1.711AsnVal: 1.711 ± 0.288
0.994AsnTrp: 0.994 ± 0.231
0.994AsnTyr: 0.994 ± 0.233
0.0AsnXaa: 0.0 ± 0.0
Pro
4.03ProAla: 4.03 ± 0.458
0.552ProCys: 0.552 ± 0.22
4.471ProAsp: 4.471 ± 0.55
4.085ProGlu: 4.085 ± 0.59
1.38ProPhe: 1.38 ± 0.288
3.588ProGly: 3.588 ± 0.424
0.662ProHis: 0.662 ± 0.254
2.153ProIle: 2.153 ± 0.389
2.374ProLys: 2.374 ± 0.34
2.926ProLeu: 2.926 ± 0.455
0.718ProMet: 0.718 ± 0.212
1.656ProAsn: 1.656 ± 0.297
1.711ProPro: 1.711 ± 0.385
1.159ProGln: 1.159 ± 0.306
2.374ProArg: 2.374 ± 0.397
3.146ProSer: 3.146 ± 0.716
3.146ProThr: 3.146 ± 0.429
2.981ProVal: 2.981 ± 0.389
0.883ProTrp: 0.883 ± 0.205
1.159ProTyr: 1.159 ± 0.279
0.0ProXaa: 0.0 ± 0.0
Gln
4.306GlnAla: 4.306 ± 0.605
0.331GlnCys: 0.331 ± 0.14
1.766GlnAsp: 1.766 ± 0.293
2.429GlnGlu: 2.429 ± 0.377
1.601GlnPhe: 1.601 ± 0.402
1.932GlnGly: 1.932 ± 0.335
0.497GlnHis: 0.497 ± 0.195
1.822GlnIle: 1.822 ± 0.323
1.822GlnLys: 1.822 ± 0.258
3.698GlnLeu: 3.698 ± 0.473
0.607GlnMet: 0.607 ± 0.174
1.049GlnAsn: 1.049 ± 0.228
1.38GlnPro: 1.38 ± 0.304
1.49GlnGln: 1.49 ± 0.284
1.711GlnArg: 1.711 ± 0.295
1.766GlnSer: 1.766 ± 0.295
1.932GlnThr: 1.932 ± 0.286
2.981GlnVal: 2.981 ± 0.373
0.442GlnTrp: 0.442 ± 0.152
0.552GlnTyr: 0.552 ± 0.168
0.0GlnXaa: 0.0 ± 0.0
Arg
5.63ArgAla: 5.63 ± 0.732
0.497ArgCys: 0.497 ± 0.193
4.526ArgAsp: 4.526 ± 0.561
3.974ArgGlu: 3.974 ± 0.425
3.146ArgPhe: 3.146 ± 0.427
4.306ArgGly: 4.306 ± 0.468
1.27ArgHis: 1.27 ± 0.27
3.091ArgIle: 3.091 ± 0.403
3.257ArgLys: 3.257 ± 0.678
5.244ArgLeu: 5.244 ± 0.636
1.656ArgMet: 1.656 ± 0.423
2.042ArgAsn: 2.042 ± 0.334
2.208ArgPro: 2.208 ± 0.303
2.208ArgGln: 2.208 ± 0.272
4.306ArgArg: 4.306 ± 0.603
3.643ArgSer: 3.643 ± 0.481
3.754ArgThr: 3.754 ± 0.467
3.422ArgVal: 3.422 ± 0.458
1.325ArgTrp: 1.325 ± 0.343
2.318ArgTyr: 2.318 ± 0.456
0.0ArgXaa: 0.0 ± 0.0
Ser
5.023SerAla: 5.023 ± 0.553
0.442SerCys: 0.442 ± 0.162
3.864SerAsp: 3.864 ± 0.68
2.594SerGlu: 2.594 ± 0.407
2.263SerPhe: 2.263 ± 0.33
4.968SerGly: 4.968 ± 0.518
1.049SerHis: 1.049 ± 0.238
3.091SerIle: 3.091 ± 0.407
2.87SerLys: 2.87 ± 0.429
4.195SerLeu: 4.195 ± 0.427
1.159SerMet: 1.159 ± 0.216
1.766SerAsn: 1.766 ± 0.378
2.594SerPro: 2.594 ± 0.394
1.601SerGln: 1.601 ± 0.402
2.87SerArg: 2.87 ± 0.419
3.698SerSer: 3.698 ± 0.686
3.036SerThr: 3.036 ± 0.622
4.637SerVal: 4.637 ± 0.646
1.38SerTrp: 1.38 ± 0.296
1.822SerTyr: 1.822 ± 0.315
0.0SerXaa: 0.0 ± 0.0
Thr
6.458ThrAla: 6.458 ± 0.89
0.607ThrCys: 0.607 ± 0.169
4.14ThrAsp: 4.14 ± 0.596
4.747ThrGlu: 4.747 ± 0.419
2.042ThrPhe: 2.042 ± 0.36
6.127ThrGly: 6.127 ± 0.787
0.994ThrHis: 0.994 ± 0.302
4.03ThrIle: 4.03 ± 0.583
2.594ThrLys: 2.594 ± 0.332
5.189ThrLeu: 5.189 ± 0.623
1.214ThrMet: 1.214 ± 0.22
2.318ThrAsn: 2.318 ± 0.402
3.643ThrPro: 3.643 ± 0.449
1.877ThrGln: 1.877 ± 0.323
3.257ThrArg: 3.257 ± 0.44
3.588ThrSer: 3.588 ± 0.545
4.913ThrThr: 4.913 ± 0.912
4.802ThrVal: 4.802 ± 0.645
1.104ThrTrp: 1.104 ± 0.294
1.932ThrTyr: 1.932 ± 0.26
0.0ThrXaa: 0.0 ± 0.0
Val
5.354ValAla: 5.354 ± 0.667
0.276ValCys: 0.276 ± 0.14
5.686ValAsp: 5.686 ± 0.478
5.078ValGlu: 5.078 ± 0.574
2.76ValPhe: 2.76 ± 0.3
4.526ValGly: 4.526 ± 0.517
0.938ValHis: 0.938 ± 0.258
2.208ValIle: 2.208 ± 0.332
3.091ValLys: 3.091 ± 0.449
5.63ValLeu: 5.63 ± 0.478
1.325ValMet: 1.325 ± 0.269
2.815ValAsn: 2.815 ± 0.494
2.76ValPro: 2.76 ± 0.482
2.153ValGln: 2.153 ± 0.402
3.974ValArg: 3.974 ± 0.421
3.919ValSer: 3.919 ± 0.536
5.134ValThr: 5.134 ± 0.613
4.085ValVal: 4.085 ± 0.787
0.938ValTrp: 0.938 ± 0.237
1.38ValTyr: 1.38 ± 0.253
0.0ValXaa: 0.0 ± 0.0
Trp
1.766TrpAla: 1.766 ± 0.312
0.331TrpCys: 0.331 ± 0.134
1.766TrpAsp: 1.766 ± 0.318
1.104TrpGlu: 1.104 ± 0.212
0.773TrpPhe: 0.773 ± 0.245
1.435TrpGly: 1.435 ± 0.349
0.552TrpHis: 0.552 ± 0.172
0.442TrpIle: 0.442 ± 0.163
1.104TrpLys: 1.104 ± 0.273
1.546TrpLeu: 1.546 ± 0.367
0.497TrpMet: 0.497 ± 0.213
0.662TrpAsn: 0.662 ± 0.172
0.607TrpPro: 0.607 ± 0.188
0.607TrpGln: 0.607 ± 0.149
1.214TrpArg: 1.214 ± 0.288
1.104TrpSer: 1.104 ± 0.311
1.214TrpThr: 1.214 ± 0.267
1.27TrpVal: 1.27 ± 0.282
0.552TrpTrp: 0.552 ± 0.179
0.552TrpTyr: 0.552 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.484TyrAla: 2.484 ± 0.48
0.552TyrCys: 0.552 ± 0.209
2.098TyrAsp: 2.098 ± 0.37
2.098TyrGlu: 2.098 ± 0.363
0.718TyrPhe: 0.718 ± 0.281
1.987TyrGly: 1.987 ± 0.469
0.662TyrHis: 0.662 ± 0.225
1.104TyrIle: 1.104 ± 0.261
1.049TyrLys: 1.049 ± 0.241
1.49TyrLeu: 1.49 ± 0.268
1.104TyrMet: 1.104 ± 0.268
0.938TyrAsn: 0.938 ± 0.211
1.27TyrPro: 1.27 ± 0.283
0.552TyrGln: 0.552 ± 0.179
2.263TyrArg: 2.263 ± 0.4
1.601TyrSer: 1.601 ± 0.267
1.49TyrThr: 1.49 ± 0.27
0.773TyrVal: 0.773 ± 0.188
0.386TyrTrp: 0.386 ± 0.135
0.773TyrTyr: 0.773 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (18117 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski