Amino acid dipepetide frequency for Rhodococcus phage REQ2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.739AlaAla: 21.739 ± 2.147
0.657AlaCys: 0.657 ± 0.249
8.407AlaAsp: 8.407 ± 1.064
10.049AlaGlu: 10.049 ± 1.155
2.758AlaPhe: 2.758 ± 0.488
9.326AlaGly: 9.326 ± 0.85
2.364AlaHis: 2.364 ± 0.437
6.239AlaIle: 6.239 ± 0.557
3.744AlaLys: 3.744 ± 0.833
10.64AlaLeu: 10.64 ± 0.841
3.612AlaMet: 3.612 ± 0.399
3.218AlaAsn: 3.218 ± 0.473
5.517AlaPro: 5.517 ± 0.588
4.926AlaGln: 4.926 ± 0.575
8.341AlaArg: 8.341 ± 0.808
7.159AlaSer: 7.159 ± 0.626
8.735AlaThr: 8.735 ± 0.9
8.078AlaVal: 8.078 ± 0.851
2.496AlaTrp: 2.496 ± 0.45
2.43AlaTyr: 2.43 ± 0.591
0.0AlaXaa: 0.0 ± 0.0
Cys
0.722CysAla: 0.722 ± 0.229
0.131CysCys: 0.131 ± 0.085
0.197CysAsp: 0.197 ± 0.106
0.46CysGlu: 0.46 ± 0.157
0.263CysPhe: 0.263 ± 0.112
0.788CysGly: 0.788 ± 0.261
0.394CysHis: 0.394 ± 0.207
0.263CysIle: 0.263 ± 0.124
0.197CysLys: 0.197 ± 0.117
0.657CysLeu: 0.657 ± 0.257
0.066CysMet: 0.066 ± 0.061
0.328CysAsn: 0.328 ± 0.133
0.46CysPro: 0.46 ± 0.174
0.263CysGln: 0.263 ± 0.153
1.117CysArg: 1.117 ± 0.321
0.525CysSer: 0.525 ± 0.19
0.394CysThr: 0.394 ± 0.171
0.788CysVal: 0.788 ± 0.253
0.131CysTrp: 0.131 ± 0.086
0.131CysTyr: 0.131 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
9.392AspAla: 9.392 ± 0.826
0.591AspCys: 0.591 ± 0.223
4.86AspAsp: 4.86 ± 0.774
5.057AspGlu: 5.057 ± 0.783
1.576AspPhe: 1.576 ± 0.358
5.78AspGly: 5.78 ± 0.547
1.97AspHis: 1.97 ± 0.402
3.087AspIle: 3.087 ± 0.518
1.642AspLys: 1.642 ± 0.337
4.663AspLeu: 4.663 ± 0.579
1.576AspMet: 1.576 ± 0.324
1.511AspAsn: 1.511 ± 0.291
3.284AspPro: 3.284 ± 0.513
2.102AspGln: 2.102 ± 0.311
5.057AspArg: 5.057 ± 0.636
3.941AspSer: 3.941 ± 0.496
4.072AspThr: 4.072 ± 0.479
3.547AspVal: 3.547 ± 0.383
1.182AspTrp: 1.182 ± 0.293
1.314AspTyr: 1.314 ± 0.337
0.0AspXaa: 0.0 ± 0.0
Glu
7.75GluAla: 7.75 ± 0.807
0.525GluCys: 0.525 ± 0.17
3.087GluAsp: 3.087 ± 0.707
2.561GluGlu: 2.561 ± 0.357
1.379GluPhe: 1.379 ± 0.302
4.269GluGly: 4.269 ± 0.548
2.364GluHis: 2.364 ± 0.454
3.021GluIle: 3.021 ± 0.39
1.839GluLys: 1.839 ± 0.406
5.123GluLeu: 5.123 ± 0.645
1.314GluMet: 1.314 ± 0.372
1.905GluAsn: 1.905 ± 0.284
2.299GluPro: 2.299 ± 0.593
3.744GluGln: 3.744 ± 0.606
4.138GluArg: 4.138 ± 0.565
2.824GluSer: 2.824 ± 0.413
3.744GluThr: 3.744 ± 0.491
3.284GluVal: 3.284 ± 0.427
0.985GluTrp: 0.985 ± 0.209
1.248GluTyr: 1.248 ± 0.241
0.0GluXaa: 0.0 ± 0.0
Phe
2.693PheAla: 2.693 ± 0.416
0.0PheCys: 0.0 ± 0.0
2.758PheAsp: 2.758 ± 0.495
2.299PheGlu: 2.299 ± 0.4
1.051PhePhe: 1.051 ± 0.332
2.693PheGly: 2.693 ± 0.59
0.46PheHis: 0.46 ± 0.154
1.314PheIle: 1.314 ± 0.257
0.854PheLys: 0.854 ± 0.203
1.642PheLeu: 1.642 ± 0.339
0.328PheMet: 0.328 ± 0.152
0.394PheAsn: 0.394 ± 0.14
1.708PhePro: 1.708 ± 0.344
0.525PheGln: 0.525 ± 0.167
1.773PheArg: 1.773 ± 0.392
1.314PheSer: 1.314 ± 0.4
2.102PheThr: 2.102 ± 0.307
1.379PheVal: 1.379 ± 0.315
0.328PheTrp: 0.328 ± 0.127
0.46PheTyr: 0.46 ± 0.143
0.0PheXaa: 0.0 ± 0.0
Gly
7.816GlyAla: 7.816 ± 0.929
0.788GlyCys: 0.788 ± 0.275
3.547GlyAsp: 3.547 ± 0.498
4.269GlyGlu: 4.269 ± 0.607
2.693GlyPhe: 2.693 ± 0.453
7.29GlyGly: 7.29 ± 0.681
1.445GlyHis: 1.445 ± 0.392
4.466GlyIle: 4.466 ± 0.657
3.021GlyLys: 3.021 ± 0.441
7.881GlyLeu: 7.881 ± 1.137
1.379GlyMet: 1.379 ± 0.239
2.89GlyAsn: 2.89 ± 0.478
3.547GlyPro: 3.547 ± 0.434
2.561GlyGln: 2.561 ± 0.537
5.517GlyArg: 5.517 ± 0.657
4.138GlySer: 4.138 ± 0.617
6.568GlyThr: 6.568 ± 0.83
6.633GlyVal: 6.633 ± 0.726
1.97GlyTrp: 1.97 ± 0.336
2.693GlyTyr: 2.693 ± 0.371
0.0GlyXaa: 0.0 ± 0.0
His
2.233HisAla: 2.233 ± 0.435
0.131HisCys: 0.131 ± 0.101
1.248HisAsp: 1.248 ± 0.322
1.511HisGlu: 1.511 ± 0.332
0.854HisPhe: 0.854 ± 0.195
1.379HisGly: 1.379 ± 0.26
0.525HisHis: 0.525 ± 0.188
0.722HisIle: 0.722 ± 0.239
0.919HisLys: 0.919 ± 0.372
2.824HisLeu: 2.824 ± 0.541
0.46HisMet: 0.46 ± 0.2
0.591HisAsn: 0.591 ± 0.212
1.379HisPro: 1.379 ± 0.397
0.788HisGln: 0.788 ± 0.218
1.839HisArg: 1.839 ± 0.361
0.854HisSer: 0.854 ± 0.237
1.511HisThr: 1.511 ± 0.387
1.576HisVal: 1.576 ± 0.314
0.46HisTrp: 0.46 ± 0.171
0.263HisTyr: 0.263 ± 0.135
0.0HisXaa: 0.0 ± 0.0
Ile
6.962IleAla: 6.962 ± 0.606
0.46IleCys: 0.46 ± 0.154
3.678IleAsp: 3.678 ± 0.499
2.758IleGlu: 2.758 ± 0.488
1.248IlePhe: 1.248 ± 0.312
4.729IleGly: 4.729 ± 0.756
1.117IleHis: 1.117 ± 0.274
1.708IleIle: 1.708 ± 0.287
2.299IleLys: 2.299 ± 0.71
2.89IleLeu: 2.89 ± 0.437
0.525IleMet: 0.525 ± 0.186
1.182IleAsn: 1.182 ± 0.274
2.496IlePro: 2.496 ± 0.437
1.576IleGln: 1.576 ± 0.263
3.547IleArg: 3.547 ± 0.51
2.89IleSer: 2.89 ± 0.458
3.284IleThr: 3.284 ± 0.433
3.809IleVal: 3.809 ± 0.382
0.985IleTrp: 0.985 ± 0.317
0.985IleTyr: 0.985 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
4.532LysAla: 4.532 ± 0.652
0.197LysCys: 0.197 ± 0.106
1.839LysAsp: 1.839 ± 0.375
1.379LysGlu: 1.379 ± 0.325
0.985LysPhe: 0.985 ± 0.253
1.839LysGly: 1.839 ± 0.344
0.985LysHis: 0.985 ± 0.265
1.773LysIle: 1.773 ± 0.301
1.379LysLys: 1.379 ± 0.303
3.284LysLeu: 3.284 ± 0.566
0.985LysMet: 0.985 ± 0.26
1.379LysAsn: 1.379 ± 0.337
2.496LysPro: 2.496 ± 0.396
0.854LysGln: 0.854 ± 0.291
3.284LysArg: 3.284 ± 0.536
1.773LysSer: 1.773 ± 0.299
1.576LysThr: 1.576 ± 0.354
2.693LysVal: 2.693 ± 0.454
0.722LysTrp: 0.722 ± 0.19
0.328LysTyr: 0.328 ± 0.162
0.0LysXaa: 0.0 ± 0.0
Leu
10.311LeuAla: 10.311 ± 0.966
0.788LeuCys: 0.788 ± 0.217
5.254LeuAsp: 5.254 ± 0.612
4.729LeuGlu: 4.729 ± 0.56
1.314LeuPhe: 1.314 ± 0.318
6.305LeuGly: 6.305 ± 0.93
1.773LeuHis: 1.773 ± 0.44
4.597LeuIle: 4.597 ± 0.454
2.89LeuLys: 2.89 ± 0.606
7.224LeuLeu: 7.224 ± 0.63
1.511LeuMet: 1.511 ± 0.292
2.233LeuAsn: 2.233 ± 0.4
3.612LeuPro: 3.612 ± 0.502
2.955LeuGln: 2.955 ± 0.345
5.123LeuArg: 5.123 ± 0.658
4.4LeuSer: 4.4 ± 0.461
5.714LeuThr: 5.714 ± 0.623
6.896LeuVal: 6.896 ± 0.704
1.379LeuTrp: 1.379 ± 0.328
1.182LeuTyr: 1.182 ± 0.349
0.0LeuXaa: 0.0 ± 0.0
Met
2.89MetAla: 2.89 ± 0.429
0.066MetCys: 0.066 ± 0.068
0.788MetAsp: 0.788 ± 0.24
0.722MetGlu: 0.722 ± 0.224
0.328MetPhe: 0.328 ± 0.129
1.314MetGly: 1.314 ± 0.328
0.722MetHis: 0.722 ± 0.264
0.788MetIle: 0.788 ± 0.193
1.117MetLys: 1.117 ± 0.262
1.445MetLeu: 1.445 ± 0.298
0.328MetMet: 0.328 ± 0.118
1.051MetAsn: 1.051 ± 0.205
1.576MetPro: 1.576 ± 0.28
0.46MetGln: 0.46 ± 0.134
1.511MetArg: 1.511 ± 0.269
1.839MetSer: 1.839 ± 0.367
3.415MetThr: 3.415 ± 0.419
1.379MetVal: 1.379 ± 0.311
0.394MetTrp: 0.394 ± 0.179
0.131MetTyr: 0.131 ± 0.078
0.0MetXaa: 0.0 ± 0.0
Asn
3.153AsnAla: 3.153 ± 0.426
0.066AsnCys: 0.066 ± 0.076
1.511AsnAsp: 1.511 ± 0.327
1.379AsnGlu: 1.379 ± 0.274
0.657AsnPhe: 0.657 ± 0.203
2.824AsnGly: 2.824 ± 0.398
0.263AsnHis: 0.263 ± 0.128
1.642AsnIle: 1.642 ± 0.373
0.591AsnLys: 0.591 ± 0.173
2.43AsnLeu: 2.43 ± 0.443
0.722AsnMet: 0.722 ± 0.191
0.46AsnAsn: 0.46 ± 0.152
2.364AsnPro: 2.364 ± 0.418
1.248AsnGln: 1.248 ± 0.231
2.036AsnArg: 2.036 ± 0.299
1.708AsnSer: 1.708 ± 0.42
1.511AsnThr: 1.511 ± 0.259
1.905AsnVal: 1.905 ± 0.326
1.117AsnTrp: 1.117 ± 0.223
0.657AsnTyr: 0.657 ± 0.194
0.0AsnXaa: 0.0 ± 0.0
Pro
7.487ProAla: 7.487 ± 0.739
0.722ProCys: 0.722 ± 0.189
4.269ProAsp: 4.269 ± 0.584
3.087ProGlu: 3.087 ± 0.434
1.248ProPhe: 1.248 ± 0.29
3.744ProGly: 3.744 ± 0.495
1.117ProHis: 1.117 ± 0.255
2.299ProIle: 2.299 ± 0.331
1.773ProLys: 1.773 ± 0.334
2.89ProLeu: 2.89 ± 0.364
1.905ProMet: 1.905 ± 0.304
1.511ProAsn: 1.511 ± 0.376
2.89ProPro: 2.89 ± 0.608
1.248ProGln: 1.248 ± 0.252
3.35ProArg: 3.35 ± 0.595
2.496ProSer: 2.496 ± 0.428
3.547ProThr: 3.547 ± 0.557
3.547ProVal: 3.547 ± 0.467
1.117ProTrp: 1.117 ± 0.25
1.117ProTyr: 1.117 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
3.481GlnAla: 3.481 ± 0.439
0.46GlnCys: 0.46 ± 0.193
1.445GlnAsp: 1.445 ± 0.26
1.445GlnGlu: 1.445 ± 0.282
1.379GlnPhe: 1.379 ± 0.302
2.561GlnGly: 2.561 ± 0.357
0.854GlnHis: 0.854 ± 0.222
2.693GlnIle: 2.693 ± 0.396
1.051GlnLys: 1.051 ± 0.249
3.612GlnLeu: 3.612 ± 0.471
0.854GlnMet: 0.854 ± 0.232
0.788GlnAsn: 0.788 ± 0.266
1.576GlnPro: 1.576 ± 0.285
1.576GlnGln: 1.576 ± 0.306
3.284GlnArg: 3.284 ± 0.512
1.708GlnSer: 1.708 ± 0.409
2.43GlnThr: 2.43 ± 0.341
1.708GlnVal: 1.708 ± 0.244
0.919GlnTrp: 0.919 ± 0.29
0.657GlnTyr: 0.657 ± 0.205
0.0GlnXaa: 0.0 ± 0.0
Arg
8.407ArgAla: 8.407 ± 0.991
0.525ArgCys: 0.525 ± 0.223
5.188ArgAsp: 5.188 ± 0.614
4.335ArgGlu: 4.335 ± 0.542
1.773ArgPhe: 1.773 ± 0.335
5.583ArgGly: 5.583 ± 0.686
1.182ArgHis: 1.182 ± 0.328
3.218ArgIle: 3.218 ± 0.477
2.955ArgLys: 2.955 ± 0.504
5.254ArgLeu: 5.254 ± 0.685
1.773ArgMet: 1.773 ± 0.371
1.905ArgAsn: 1.905 ± 0.342
3.678ArgPro: 3.678 ± 0.582
2.824ArgGln: 2.824 ± 0.383
7.093ArgArg: 7.093 ± 1.056
4.466ArgSer: 4.466 ± 0.577
4.86ArgThr: 4.86 ± 0.725
4.926ArgVal: 4.926 ± 0.539
1.511ArgTrp: 1.511 ± 0.363
1.445ArgTyr: 1.445 ± 0.334
0.0ArgXaa: 0.0 ± 0.0
Ser
7.224SerAla: 7.224 ± 0.777
0.394SerCys: 0.394 ± 0.169
3.481SerAsp: 3.481 ± 0.576
2.693SerGlu: 2.693 ± 0.442
2.364SerPhe: 2.364 ± 0.319
5.32SerGly: 5.32 ± 0.721
1.379SerHis: 1.379 ± 0.318
2.627SerIle: 2.627 ± 0.388
2.036SerLys: 2.036 ± 0.344
3.415SerLeu: 3.415 ± 0.362
1.314SerMet: 1.314 ± 0.279
1.708SerAsn: 1.708 ± 0.349
2.89SerPro: 2.89 ± 0.413
1.839SerGln: 1.839 ± 0.288
3.941SerArg: 3.941 ± 0.447
3.415SerSer: 3.415 ± 0.708
3.678SerThr: 3.678 ± 0.511
4.006SerVal: 4.006 ± 0.485
1.051SerTrp: 1.051 ± 0.274
1.314SerTyr: 1.314 ± 0.289
0.0SerXaa: 0.0 ± 0.0
Thr
9.786ThrAla: 9.786 ± 1.042
0.657ThrCys: 0.657 ± 0.228
5.911ThrAsp: 5.911 ± 0.723
3.284ThrGlu: 3.284 ± 0.522
1.576ThrPhe: 1.576 ± 0.344
4.729ThrGly: 4.729 ± 0.618
1.051ThrHis: 1.051 ± 0.283
3.612ThrIle: 3.612 ± 0.323
2.496ThrLys: 2.496 ± 0.395
5.386ThrLeu: 5.386 ± 0.708
1.379ThrMet: 1.379 ± 0.305
1.642ThrAsn: 1.642 ± 0.312
4.138ThrPro: 4.138 ± 0.554
1.511ThrGln: 1.511 ± 0.277
4.86ThrArg: 4.86 ± 0.413
3.612ThrSer: 3.612 ± 0.416
4.663ThrThr: 4.663 ± 0.892
5.386ThrVal: 5.386 ± 0.658
1.182ThrTrp: 1.182 ± 0.257
1.445ThrTyr: 1.445 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
9.917ValAla: 9.917 ± 0.827
0.394ValCys: 0.394 ± 0.144
5.977ValAsp: 5.977 ± 0.712
3.678ValGlu: 3.678 ± 0.502
1.773ValPhe: 1.773 ± 0.528
7.356ValGly: 7.356 ± 0.829
1.314ValHis: 1.314 ± 0.326
3.021ValIle: 3.021 ± 0.483
2.496ValLys: 2.496 ± 0.414
5.188ValLeu: 5.188 ± 0.678
1.576ValMet: 1.576 ± 0.351
1.773ValAsn: 1.773 ± 0.306
3.087ValPro: 3.087 ± 0.499
1.839ValGln: 1.839 ± 0.318
4.335ValArg: 4.335 ± 0.51
4.466ValSer: 4.466 ± 0.621
3.875ValThr: 3.875 ± 0.449
5.451ValVal: 5.451 ± 0.639
1.839ValTrp: 1.839 ± 0.544
1.511ValTyr: 1.511 ± 0.301
0.0ValXaa: 0.0 ± 0.0
Trp
1.248TrpAla: 1.248 ± 0.309
0.525TrpCys: 0.525 ± 0.187
1.708TrpAsp: 1.708 ± 0.433
0.985TrpGlu: 0.985 ± 0.333
0.525TrpPhe: 0.525 ± 0.195
1.379TrpGly: 1.379 ± 0.316
0.46TrpHis: 0.46 ± 0.185
1.314TrpIle: 1.314 ± 0.344
0.46TrpLys: 0.46 ± 0.165
1.839TrpLeu: 1.839 ± 0.334
0.263TrpMet: 0.263 ± 0.119
1.379TrpAsn: 1.379 ± 0.334
1.248TrpPro: 1.248 ± 0.303
0.722TrpGln: 0.722 ± 0.275
1.511TrpArg: 1.511 ± 0.332
1.117TrpSer: 1.117 ± 0.264
1.379TrpThr: 1.379 ± 0.266
1.839TrpVal: 1.839 ± 0.389
0.394TrpTrp: 0.394 ± 0.246
0.263TrpTyr: 0.263 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.693TyrAla: 2.693 ± 0.344
0.197TyrCys: 0.197 ± 0.125
1.051TyrAsp: 1.051 ± 0.227
0.722TyrGlu: 0.722 ± 0.269
0.328TyrPhe: 0.328 ± 0.129
1.905TyrGly: 1.905 ± 0.406
0.328TyrHis: 0.328 ± 0.139
0.722TyrIle: 0.722 ± 0.245
0.591TyrLys: 0.591 ± 0.184
1.905TyrLeu: 1.905 ± 0.271
0.328TyrMet: 0.328 ± 0.15
0.394TyrAsn: 0.394 ± 0.135
1.117TyrPro: 1.117 ± 0.278
0.919TyrGln: 0.919 ± 0.265
1.248TyrArg: 1.248 ± 0.327
1.445TyrSer: 1.445 ± 0.382
1.248TyrThr: 1.248 ± 0.313
2.036TyrVal: 2.036 ± 0.374
0.394TyrTrp: 0.394 ± 0.178
0.197TyrTyr: 0.197 ± 0.109
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (15227 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski