Amino acid dipepetide frequency for Ralstonia phage Cimandef

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.957AlaAla: 20.957 ± 1.357
1.097AlaCys: 1.097 ± 0.37
6.582AlaAsp: 6.582 ± 0.468
8.198AlaGlu: 8.198 ± 0.944
3.579AlaPhe: 3.579 ± 0.448
11.72AlaGly: 11.72 ± 1.272
2.021AlaHis: 2.021 ± 0.289
5.311AlaIle: 5.311 ± 0.643
6.177AlaLys: 6.177 ± 0.732
9.988AlaLeu: 9.988 ± 0.692
3.753AlaMet: 3.753 ± 0.392
4.503AlaAsn: 4.503 ± 0.547
7.736AlaPro: 7.736 ± 1.002
6.524AlaGln: 6.524 ± 0.619
7.39AlaArg: 7.39 ± 0.784
7.101AlaSer: 7.101 ± 0.809
5.6AlaThr: 5.6 ± 0.515
7.505AlaVal: 7.505 ± 0.581
1.559AlaTrp: 1.559 ± 0.284
3.349AlaTyr: 3.349 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.404CysAla: 0.404 ± 0.175
0.289CysCys: 0.289 ± 0.144
0.52CysAsp: 0.52 ± 0.302
0.289CysGlu: 0.289 ± 0.155
0.231CysPhe: 0.231 ± 0.197
0.635CysGly: 0.635 ± 0.277
0.289CysHis: 0.289 ± 0.165
0.346CysIle: 0.346 ± 0.149
0.404CysLys: 0.404 ± 0.189
0.635CysLeu: 0.635 ± 0.271
0.346CysMet: 0.346 ± 0.184
0.231CysAsn: 0.231 ± 0.132
0.231CysPro: 0.231 ± 0.141
0.058CysGln: 0.058 ± 0.068
0.808CysArg: 0.808 ± 0.37
0.693CysSer: 0.693 ± 0.287
0.173CysThr: 0.173 ± 0.138
0.52CysVal: 0.52 ± 0.207
0.115CysTrp: 0.115 ± 0.075
0.404CysTyr: 0.404 ± 0.276
0.0CysXaa: 0.0 ± 0.0
Asp
8.314AspAla: 8.314 ± 0.826
0.346AspCys: 0.346 ± 0.207
3.579AspAsp: 3.579 ± 0.718
3.984AspGlu: 3.984 ± 0.508
1.963AspPhe: 1.963 ± 0.319
7.39AspGly: 7.39 ± 0.865
1.328AspHis: 1.328 ± 0.21
2.829AspIle: 2.829 ± 0.372
2.021AspLys: 2.021 ± 0.304
4.099AspLeu: 4.099 ± 0.5
1.559AspMet: 1.559 ± 0.231
2.136AspAsn: 2.136 ± 0.358
2.771AspPro: 2.771 ± 0.335
1.674AspGln: 1.674 ± 0.293
4.33AspArg: 4.33 ± 0.434
2.598AspSer: 2.598 ± 0.359
3.002AspThr: 3.002 ± 0.357
4.272AspVal: 4.272 ± 0.505
0.924AspTrp: 0.924 ± 0.224
2.194AspTyr: 2.194 ± 0.327
0.0AspXaa: 0.0 ± 0.0
Glu
7.217GluAla: 7.217 ± 0.955
0.577GluCys: 0.577 ± 0.229
3.637GluAsp: 3.637 ± 0.462
3.984GluGlu: 3.984 ± 0.606
3.06GluPhe: 3.06 ± 0.493
4.445GluGly: 4.445 ± 0.692
1.501GluHis: 1.501 ± 0.311
3.291GluIle: 3.291 ± 0.318
2.829GluLys: 2.829 ± 0.725
5.485GluLeu: 5.485 ± 0.441
1.443GluMet: 1.443 ± 0.214
2.367GluAsn: 2.367 ± 0.359
3.002GluPro: 3.002 ± 0.662
3.753GluGln: 3.753 ± 0.386
6.582GluArg: 6.582 ± 0.806
2.829GluSer: 2.829 ± 0.426
2.54GluThr: 2.54 ± 0.369
3.002GluVal: 3.002 ± 0.474
0.866GluTrp: 0.866 ± 0.199
1.79GluTyr: 1.79 ± 0.423
0.0GluXaa: 0.0 ± 0.0
Phe
3.291PheAla: 3.291 ± 0.373
0.231PheCys: 0.231 ± 0.131
3.233PheAsp: 3.233 ± 0.364
1.79PheGlu: 1.79 ± 0.295
0.751PhePhe: 0.751 ± 0.263
2.829PheGly: 2.829 ± 0.37
0.52PheHis: 0.52 ± 0.163
1.617PheIle: 1.617 ± 0.429
1.386PheLys: 1.386 ± 0.275
1.963PheLeu: 1.963 ± 0.338
1.039PheMet: 1.039 ± 0.217
1.212PheAsn: 1.212 ± 0.243
1.097PhePro: 1.097 ± 0.217
1.039PheGln: 1.039 ± 0.199
1.79PheArg: 1.79 ± 0.399
2.194PheSer: 2.194 ± 0.371
1.27PheThr: 1.27 ± 0.245
2.598PheVal: 2.598 ± 0.382
0.289PheTrp: 0.289 ± 0.119
1.212PheTyr: 1.212 ± 0.197
0.0PheXaa: 0.0 ± 0.0
Gly
12.066GlyAla: 12.066 ± 1.314
0.52GlyCys: 0.52 ± 0.235
5.889GlyAsp: 5.889 ± 0.645
5.773GlyGlu: 5.773 ± 0.672
2.425GlyPhe: 2.425 ± 0.379
8.949GlyGly: 8.949 ± 1.309
1.617GlyHis: 1.617 ± 0.338
3.233GlyIle: 3.233 ± 0.326
5.023GlyLys: 5.023 ± 0.582
5.081GlyLeu: 5.081 ± 0.576
2.252GlyMet: 2.252 ± 0.47
3.233GlyAsn: 3.233 ± 0.694
2.021GlyPro: 2.021 ± 0.304
3.175GlyGln: 3.175 ± 0.496
6.062GlyArg: 6.062 ± 0.682
4.561GlySer: 4.561 ± 0.676
5.831GlyThr: 5.831 ± 0.732
5.658GlyVal: 5.658 ± 0.597
1.097GlyTrp: 1.097 ± 0.258
1.963GlyTyr: 1.963 ± 0.314
0.0GlyXaa: 0.0 ± 0.0
His
2.367HisAla: 2.367 ± 0.339
0.231HisCys: 0.231 ± 0.136
1.27HisAsp: 1.27 ± 0.268
1.617HisGlu: 1.617 ± 0.296
0.866HisPhe: 0.866 ± 0.214
1.27HisGly: 1.27 ± 0.25
0.52HisHis: 0.52 ± 0.231
0.693HisIle: 0.693 ± 0.195
0.808HisLys: 0.808 ± 0.25
0.866HisLeu: 0.866 ± 0.206
0.577HisMet: 0.577 ± 0.187
0.52HisAsn: 0.52 ± 0.195
0.635HisPro: 0.635 ± 0.155
0.52HisGln: 0.52 ± 0.139
1.559HisArg: 1.559 ± 0.297
0.924HisSer: 0.924 ± 0.23
0.693HisThr: 0.693 ± 0.179
1.155HisVal: 1.155 ± 0.27
0.173HisTrp: 0.173 ± 0.125
0.577HisTyr: 0.577 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
6.177IleAla: 6.177 ± 0.594
0.231IleCys: 0.231 ± 0.169
2.656IleAsp: 2.656 ± 0.321
4.503IleGlu: 4.503 ± 0.507
0.981IlePhe: 0.981 ± 0.222
3.926IleGly: 3.926 ± 0.437
0.52IleHis: 0.52 ± 0.148
1.559IleIle: 1.559 ± 0.308
2.021IleLys: 2.021 ± 0.351
2.078IleLeu: 2.078 ± 0.305
1.155IleMet: 1.155 ± 0.347
2.021IleAsn: 2.021 ± 0.289
1.847IlePro: 1.847 ± 0.271
1.905IleGln: 1.905 ± 0.34
3.175IleArg: 3.175 ± 0.488
2.309IleSer: 2.309 ± 0.367
2.425IleThr: 2.425 ± 0.376
3.406IleVal: 3.406 ± 0.463
0.231IleTrp: 0.231 ± 0.087
1.097IleTyr: 1.097 ± 0.283
0.0IleXaa: 0.0 ± 0.0
Lys
7.217LysAla: 7.217 ± 0.539
0.404LysCys: 0.404 ± 0.224
2.944LysAsp: 2.944 ± 0.382
2.887LysGlu: 2.887 ± 0.746
1.212LysPhe: 1.212 ± 0.282
3.291LysGly: 3.291 ± 0.586
0.462LysHis: 0.462 ± 0.186
1.963LysIle: 1.963 ± 0.369
2.367LysLys: 2.367 ± 0.35
3.868LysLeu: 3.868 ± 0.53
1.212LysMet: 1.212 ± 0.343
1.386LysAsn: 1.386 ± 0.308
2.713LysPro: 2.713 ± 0.436
2.829LysGln: 2.829 ± 0.428
3.81LysArg: 3.81 ± 0.561
2.54LysSer: 2.54 ± 0.394
2.887LysThr: 2.887 ± 0.419
3.002LysVal: 3.002 ± 0.381
0.866LysTrp: 0.866 ± 0.245
0.924LysTyr: 0.924 ± 0.18
0.0LysXaa: 0.0 ± 0.0
Leu
6.697LeuAla: 6.697 ± 0.54
0.404LeuCys: 0.404 ± 0.172
5.658LeuAsp: 5.658 ± 0.605
3.637LeuGlu: 3.637 ± 0.589
2.021LeuPhe: 2.021 ± 0.278
6.12LeuGly: 6.12 ± 0.584
1.212LeuHis: 1.212 ± 0.208
3.579LeuIle: 3.579 ± 0.564
4.388LeuLys: 4.388 ± 0.638
3.984LeuLeu: 3.984 ± 0.564
1.79LeuMet: 1.79 ± 0.339
2.367LeuAsn: 2.367 ± 0.329
4.215LeuPro: 4.215 ± 0.803
2.194LeuGln: 2.194 ± 0.318
5.138LeuArg: 5.138 ± 0.599
4.503LeuSer: 4.503 ± 0.404
4.445LeuThr: 4.445 ± 0.436
3.926LeuVal: 3.926 ± 0.612
0.52LeuTrp: 0.52 ± 0.213
1.674LeuTyr: 1.674 ± 0.282
0.0LeuXaa: 0.0 ± 0.0
Met
2.656MetAla: 2.656 ± 0.364
0.173MetCys: 0.173 ± 0.131
1.386MetAsp: 1.386 ± 0.296
1.039MetGlu: 1.039 ± 0.22
0.866MetPhe: 0.866 ± 0.217
2.194MetGly: 2.194 ± 0.378
0.404MetHis: 0.404 ± 0.186
0.866MetIle: 0.866 ± 0.201
2.078MetLys: 2.078 ± 0.328
1.905MetLeu: 1.905 ± 0.293
0.866MetMet: 0.866 ± 0.175
1.155MetAsn: 1.155 ± 0.267
1.847MetPro: 1.847 ± 0.275
1.386MetGln: 1.386 ± 0.371
2.194MetArg: 2.194 ± 0.421
2.136MetSer: 2.136 ± 0.367
1.905MetThr: 1.905 ± 0.284
1.27MetVal: 1.27 ± 0.284
0.462MetTrp: 0.462 ± 0.167
0.52MetTyr: 0.52 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
4.099AsnAla: 4.099 ± 0.499
0.289AsnCys: 0.289 ± 0.179
2.309AsnAsp: 2.309 ± 0.298
2.194AsnGlu: 2.194 ± 0.367
1.212AsnPhe: 1.212 ± 0.285
4.099AsnGly: 4.099 ± 0.599
0.808AsnHis: 0.808 ± 0.237
1.617AsnIle: 1.617 ± 0.275
1.328AsnLys: 1.328 ± 0.299
2.078AsnLeu: 2.078 ± 0.309
0.751AsnMet: 0.751 ± 0.219
1.328AsnAsn: 1.328 ± 0.273
2.367AsnPro: 2.367 ± 0.361
1.501AsnGln: 1.501 ± 0.306
2.194AsnArg: 2.194 ± 0.291
1.847AsnSer: 1.847 ± 0.29
2.252AsnThr: 2.252 ± 0.518
3.233AsnVal: 3.233 ± 0.366
0.693AsnTrp: 0.693 ± 0.15
0.577AsnTyr: 0.577 ± 0.255
0.0AsnXaa: 0.0 ± 0.0
Pro
8.545ProAla: 8.545 ± 1.324
0.289ProCys: 0.289 ± 0.136
3.118ProAsp: 3.118 ± 0.323
3.291ProGlu: 3.291 ± 0.457
1.328ProPhe: 1.328 ± 0.357
3.695ProGly: 3.695 ± 0.673
0.577ProHis: 0.577 ± 0.252
1.905ProIle: 1.905 ± 0.361
2.598ProLys: 2.598 ± 0.347
2.829ProLeu: 2.829 ± 0.341
0.866ProMet: 0.866 ± 0.258
1.501ProAsn: 1.501 ± 0.383
2.021ProPro: 2.021 ± 0.314
1.79ProGln: 1.79 ± 0.363
2.771ProArg: 2.771 ± 0.379
3.579ProSer: 3.579 ± 0.487
2.887ProThr: 2.887 ± 0.464
2.829ProVal: 2.829 ± 0.352
0.289ProTrp: 0.289 ± 0.123
1.039ProTyr: 1.039 ± 0.315
0.0ProXaa: 0.0 ± 0.0
Gln
6.582GlnAla: 6.582 ± 0.769
0.115GlnCys: 0.115 ± 0.084
1.905GlnAsp: 1.905 ± 0.336
2.367GlnGlu: 2.367 ± 0.396
1.097GlnPhe: 1.097 ± 0.222
3.349GlnGly: 3.349 ± 0.549
0.808GlnHis: 0.808 ± 0.196
2.194GlnIle: 2.194 ± 0.338
1.963GlnLys: 1.963 ± 0.377
3.118GlnLeu: 3.118 ± 0.379
1.501GlnMet: 1.501 ± 0.349
1.386GlnAsn: 1.386 ± 0.396
1.674GlnPro: 1.674 ± 0.246
4.041GlnGln: 4.041 ± 0.885
2.829GlnArg: 2.829 ± 0.438
1.732GlnSer: 1.732 ± 0.354
1.847GlnThr: 1.847 ± 0.421
2.713GlnVal: 2.713 ± 0.403
0.635GlnTrp: 0.635 ± 0.205
1.097GlnTyr: 1.097 ± 0.255
0.0GlnXaa: 0.0 ± 0.0
Arg
8.949ArgAla: 8.949 ± 0.768
0.462ArgCys: 0.462 ± 0.236
3.926ArgAsp: 3.926 ± 0.401
4.619ArgGlu: 4.619 ± 0.665
3.522ArgPhe: 3.522 ± 0.595
4.33ArgGly: 4.33 ± 0.558
1.27ArgHis: 1.27 ± 0.274
3.522ArgIle: 3.522 ± 0.365
3.406ArgLys: 3.406 ± 0.421
5.023ArgLeu: 5.023 ± 0.5
2.194ArgMet: 2.194 ± 0.342
3.175ArgAsn: 3.175 ± 0.351
2.483ArgPro: 2.483 ± 0.407
3.349ArgGln: 3.349 ± 0.56
4.676ArgArg: 4.676 ± 0.529
3.637ArgSer: 3.637 ± 0.417
2.771ArgThr: 2.771 ± 0.399
4.734ArgVal: 4.734 ± 0.603
1.155ArgTrp: 1.155 ± 0.285
2.887ArgTyr: 2.887 ± 0.377
0.0ArgXaa: 0.0 ± 0.0
Ser
7.159SerAla: 7.159 ± 0.881
0.577SerCys: 0.577 ± 0.236
3.579SerAsp: 3.579 ± 0.452
3.06SerGlu: 3.06 ± 0.368
1.847SerPhe: 1.847 ± 0.292
5.196SerGly: 5.196 ± 0.622
1.155SerHis: 1.155 ± 0.279
2.425SerIle: 2.425 ± 0.305
2.54SerLys: 2.54 ± 0.329
3.753SerLeu: 3.753 ± 0.386
1.212SerMet: 1.212 ± 0.362
2.194SerAsn: 2.194 ± 0.38
2.598SerPro: 2.598 ± 0.404
2.252SerGln: 2.252 ± 0.365
3.637SerArg: 3.637 ± 0.466
2.598SerSer: 2.598 ± 0.432
3.637SerThr: 3.637 ± 0.456
3.753SerVal: 3.753 ± 0.415
1.097SerTrp: 1.097 ± 0.28
1.27SerTyr: 1.27 ± 0.311
0.0SerXaa: 0.0 ± 0.0
Thr
6.351ThrAla: 6.351 ± 0.535
0.231ThrCys: 0.231 ± 0.139
2.54ThrAsp: 2.54 ± 0.383
3.868ThrGlu: 3.868 ± 0.44
1.386ThrPhe: 1.386 ± 0.321
5.311ThrGly: 5.311 ± 0.582
1.039ThrHis: 1.039 ± 0.207
2.887ThrIle: 2.887 ± 0.38
2.887ThrLys: 2.887 ± 0.32
4.503ThrLeu: 4.503 ± 0.571
1.617ThrMet: 1.617 ± 0.332
1.79ThrAsn: 1.79 ± 0.329
3.118ThrPro: 3.118 ± 0.382
1.443ThrGln: 1.443 ± 0.304
2.829ThrArg: 2.829 ± 0.395
2.771ThrSer: 2.771 ± 0.628
3.349ThrThr: 3.349 ± 0.534
4.33ThrVal: 4.33 ± 0.577
0.635ThrTrp: 0.635 ± 0.155
1.617ThrTyr: 1.617 ± 0.336
0.0ThrXaa: 0.0 ± 0.0
Val
7.448ValAla: 7.448 ± 0.596
0.808ValCys: 0.808 ± 0.352
3.868ValAsp: 3.868 ± 0.425
4.503ValGlu: 4.503 ± 0.546
1.501ValPhe: 1.501 ± 0.338
4.85ValGly: 4.85 ± 0.504
0.981ValHis: 0.981 ± 0.266
2.54ValIle: 2.54 ± 0.484
3.118ValLys: 3.118 ± 0.618
3.349ValLeu: 3.349 ± 0.353
2.021ValMet: 2.021 ± 0.319
2.598ValAsn: 2.598 ± 0.394
3.926ValPro: 3.926 ± 0.425
2.367ValGln: 2.367 ± 0.285
4.965ValArg: 4.965 ± 0.498
4.676ValSer: 4.676 ± 0.511
4.33ValThr: 4.33 ± 0.571
3.926ValVal: 3.926 ± 0.51
0.981ValTrp: 0.981 ± 0.338
1.674ValTyr: 1.674 ± 0.337
0.0ValXaa: 0.0 ± 0.0
Trp
1.155TrpAla: 1.155 ± 0.244
0.173TrpCys: 0.173 ± 0.093
0.577TrpAsp: 0.577 ± 0.222
0.52TrpGlu: 0.52 ± 0.207
0.635TrpPhe: 0.635 ± 0.24
1.039TrpGly: 1.039 ± 0.18
0.404TrpHis: 0.404 ± 0.205
0.693TrpIle: 0.693 ± 0.206
0.462TrpLys: 0.462 ± 0.154
1.27TrpLeu: 1.27 ± 0.307
0.52TrpMet: 0.52 ± 0.14
0.577TrpAsn: 0.577 ± 0.222
0.462TrpPro: 0.462 ± 0.203
0.404TrpGln: 0.404 ± 0.124
1.27TrpArg: 1.27 ± 0.308
0.751TrpSer: 0.751 ± 0.259
0.866TrpThr: 0.866 ± 0.227
0.866TrpVal: 0.866 ± 0.169
0.231TrpTrp: 0.231 ± 0.168
0.231TrpTyr: 0.231 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.002TyrAla: 3.002 ± 0.374
0.173TyrCys: 0.173 ± 0.112
2.136TyrAsp: 2.136 ± 0.222
2.078TyrGlu: 2.078 ± 0.289
0.866TyrPhe: 0.866 ± 0.227
1.79TyrGly: 1.79 ± 0.374
0.404TyrHis: 0.404 ± 0.142
1.155TyrIle: 1.155 ± 0.26
1.039TyrLys: 1.039 ± 0.198
2.598TyrLeu: 2.598 ± 0.42
0.577TyrMet: 0.577 ± 0.179
1.097TyrAsn: 1.097 ± 0.239
0.981TyrPro: 0.981 ± 0.25
0.751TyrGln: 0.751 ± 0.168
2.078TyrArg: 2.078 ± 0.353
1.559TyrSer: 1.559 ± 0.334
1.79TyrThr: 1.79 ± 0.318
1.732TyrVal: 1.732 ± 0.356
0.289TyrTrp: 0.289 ± 0.217
0.577TyrTyr: 0.577 ± 0.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (17322 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski