Amino acid dipepetide frequency for Aeribacillus phage AP45

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.227AlaAla: 5.227 ± 1.123
0.446AlaCys: 0.446 ± 0.149
3.888AlaAsp: 3.888 ± 0.696
5.036AlaGlu: 5.036 ± 0.613
2.358AlaPhe: 2.358 ± 0.356
3.506AlaGly: 3.506 ± 0.511
0.637AlaHis: 0.637 ± 0.231
4.398AlaIle: 4.398 ± 0.565
6.247AlaLys: 6.247 ± 0.658
5.546AlaLeu: 5.546 ± 0.833
2.295AlaMet: 2.295 ± 0.423
4.016AlaAsn: 4.016 ± 0.636
1.211AlaPro: 1.211 ± 0.234
2.04AlaGln: 2.04 ± 0.375
2.422AlaArg: 2.422 ± 0.473
3.315AlaSer: 3.315 ± 0.582
3.251AlaThr: 3.251 ± 0.49
4.08AlaVal: 4.08 ± 0.601
0.956AlaTrp: 0.956 ± 0.272
1.976AlaTyr: 1.976 ± 0.317
0.0AlaXaa: 0.0 ± 0.0
Cys
0.127CysAla: 0.127 ± 0.094
0.0CysCys: 0.0 ± 0.0
0.319CysAsp: 0.319 ± 0.234
0.446CysGlu: 0.446 ± 0.153
0.446CysPhe: 0.446 ± 0.222
0.637CysGly: 0.637 ± 0.208
0.127CysHis: 0.127 ± 0.09
0.382CysIle: 0.382 ± 0.176
0.319CysLys: 0.319 ± 0.142
0.446CysLeu: 0.446 ± 0.196
0.191CysMet: 0.191 ± 0.13
0.382CysAsn: 0.382 ± 0.194
0.382CysPro: 0.382 ± 0.165
0.255CysGln: 0.255 ± 0.142
0.382CysArg: 0.382 ± 0.159
0.701CysSer: 0.701 ± 0.264
0.574CysThr: 0.574 ± 0.186
0.255CysVal: 0.255 ± 0.114
0.191CysTrp: 0.191 ± 0.113
0.319CysTyr: 0.319 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
3.761AspAla: 3.761 ± 0.648
0.446AspCys: 0.446 ± 0.167
3.697AspAsp: 3.697 ± 0.653
6.438AspGlu: 6.438 ± 0.736
2.805AspPhe: 2.805 ± 0.426
4.462AspGly: 4.462 ± 0.69
0.892AspHis: 0.892 ± 0.222
4.08AspIle: 4.08 ± 0.509
3.57AspLys: 3.57 ± 0.451
4.016AspLeu: 4.016 ± 0.534
1.785AspMet: 1.785 ± 0.283
1.849AspAsn: 1.849 ± 0.313
2.231AspPro: 2.231 ± 0.416
1.721AspGln: 1.721 ± 0.401
3.251AspArg: 3.251 ± 0.561
3.06AspSer: 3.06 ± 0.433
2.358AspThr: 2.358 ± 0.378
3.633AspVal: 3.633 ± 0.47
1.02AspTrp: 1.02 ± 0.322
2.358AspTyr: 2.358 ± 0.348
0.0AspXaa: 0.0 ± 0.0
Glu
4.271GluAla: 4.271 ± 0.662
0.255GluCys: 0.255 ± 0.177
3.761GluAsp: 3.761 ± 0.593
5.801GluGlu: 5.801 ± 0.93
2.996GluPhe: 2.996 ± 0.42
4.844GluGly: 4.844 ± 0.685
1.466GluHis: 1.466 ± 0.327
6.502GluIle: 6.502 ± 0.896
8.095GluLys: 8.095 ± 1.051
7.968GluLeu: 7.968 ± 0.797
2.295GluMet: 2.295 ± 0.415
6.119GluAsn: 6.119 ± 0.817
1.657GluPro: 1.657 ± 0.288
3.123GluGln: 3.123 ± 0.566
4.653GluArg: 4.653 ± 0.738
3.251GluSer: 3.251 ± 0.424
4.526GluThr: 4.526 ± 0.486
5.291GluVal: 5.291 ± 0.612
1.402GluTrp: 1.402 ± 0.333
2.486GluTyr: 2.486 ± 0.532
0.0GluXaa: 0.0 ± 0.0
Phe
2.358PheAla: 2.358 ± 0.375
0.446PheCys: 0.446 ± 0.212
3.123PheAsp: 3.123 ± 0.451
2.677PheGlu: 2.677 ± 0.433
1.53PhePhe: 1.53 ± 0.307
2.486PheGly: 2.486 ± 0.392
0.574PheHis: 0.574 ± 0.167
4.08PheIle: 4.08 ± 0.594
4.08PheLys: 4.08 ± 0.616
4.335PheLeu: 4.335 ± 0.572
1.084PheMet: 1.084 ± 0.267
3.378PheAsn: 3.378 ± 0.452
1.466PhePro: 1.466 ± 0.324
1.275PheGln: 1.275 ± 0.334
1.785PheArg: 1.785 ± 0.352
3.06PheSer: 3.06 ± 0.436
2.231PheThr: 2.231 ± 0.317
2.613PheVal: 2.613 ± 0.374
0.446PheTrp: 0.446 ± 0.146
1.53PheTyr: 1.53 ± 0.225
0.0PheXaa: 0.0 ± 0.0
Gly
3.506GlyAla: 3.506 ± 0.619
0.127GlyCys: 0.127 ± 0.096
3.315GlyAsp: 3.315 ± 0.494
3.825GlyGlu: 3.825 ± 0.577
3.697GlyPhe: 3.697 ± 0.439
2.868GlyGly: 2.868 ± 0.62
0.701GlyHis: 0.701 ± 0.172
5.099GlyIle: 5.099 ± 0.644
5.609GlyLys: 5.609 ± 0.608
5.609GlyLeu: 5.609 ± 1.048
1.53GlyMet: 1.53 ± 0.354
3.57GlyAsn: 3.57 ± 0.443
0.701GlyPro: 0.701 ± 0.304
2.104GlyGln: 2.104 ± 0.471
2.486GlyArg: 2.486 ± 0.388
3.761GlySer: 3.761 ± 0.543
3.378GlyThr: 3.378 ± 0.632
4.844GlyVal: 4.844 ± 0.667
0.956GlyTrp: 0.956 ± 0.281
2.741GlyTyr: 2.741 ± 0.499
0.0GlyXaa: 0.0 ± 0.0
His
0.956HisAla: 0.956 ± 0.345
0.127HisCys: 0.127 ± 0.088
0.765HisAsp: 0.765 ± 0.194
1.211HisGlu: 1.211 ± 0.295
0.765HisPhe: 0.765 ± 0.22
0.829HisGly: 0.829 ± 0.207
0.255HisHis: 0.255 ± 0.129
1.402HisIle: 1.402 ± 0.327
1.339HisLys: 1.339 ± 0.271
1.466HisLeu: 1.466 ± 0.297
0.191HisMet: 0.191 ± 0.124
1.02HisAsn: 1.02 ± 0.22
1.147HisPro: 1.147 ± 0.288
0.637HisGln: 0.637 ± 0.23
0.892HisArg: 0.892 ± 0.265
0.51HisSer: 0.51 ± 0.218
0.51HisThr: 0.51 ± 0.156
0.892HisVal: 0.892 ± 0.224
0.255HisTrp: 0.255 ± 0.172
0.446HisTyr: 0.446 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
4.207IleAla: 4.207 ± 0.527
0.892IleCys: 0.892 ± 0.253
5.482IleAsp: 5.482 ± 0.888
7.203IleGlu: 7.203 ± 0.905
3.187IlePhe: 3.187 ± 0.523
3.442IleGly: 3.442 ± 0.512
1.594IleHis: 1.594 ± 0.389
4.844IleIle: 4.844 ± 0.743
6.757IleLys: 6.757 ± 0.686
4.335IleLeu: 4.335 ± 0.495
1.785IleMet: 1.785 ± 0.383
5.354IleAsn: 5.354 ± 0.891
2.677IlePro: 2.677 ± 0.509
3.633IleGln: 3.633 ± 0.416
4.589IleArg: 4.589 ± 0.392
4.781IleSer: 4.781 ± 0.639
4.335IleThr: 4.335 ± 0.621
4.398IleVal: 4.398 ± 0.534
0.637IleTrp: 0.637 ± 0.231
1.976IleTyr: 1.976 ± 0.339
0.0IleXaa: 0.0 ± 0.0
Lys
6.247LysAla: 6.247 ± 0.68
0.191LysCys: 0.191 ± 0.129
4.717LysAsp: 4.717 ± 0.546
7.713LysGlu: 7.713 ± 1.129
2.55LysPhe: 2.55 ± 0.471
5.609LysGly: 5.609 ± 0.656
1.785LysHis: 1.785 ± 0.425
6.757LysIle: 6.757 ± 0.698
8.414LysLys: 8.414 ± 0.801
7.522LysLeu: 7.522 ± 0.714
3.06LysMet: 3.06 ± 0.46
4.462LysAsn: 4.462 ± 0.501
2.55LysPro: 2.55 ± 0.538
4.462LysGln: 4.462 ± 0.698
5.418LysArg: 5.418 ± 0.782
4.526LysSer: 4.526 ± 0.461
3.633LysThr: 3.633 ± 0.511
5.354LysVal: 5.354 ± 0.582
1.02LysTrp: 1.02 ± 0.252
3.888LysTyr: 3.888 ± 0.561
0.0LysXaa: 0.0 ± 0.0
Leu
5.099LeuAla: 5.099 ± 0.861
0.701LeuCys: 0.701 ± 0.272
5.099LeuAsp: 5.099 ± 0.517
6.502LeuGlu: 6.502 ± 0.808
3.952LeuPhe: 3.952 ± 0.489
3.888LeuGly: 3.888 ± 0.491
1.02LeuHis: 1.02 ± 0.265
5.928LeuIle: 5.928 ± 0.699
7.585LeuLys: 7.585 ± 0.735
6.247LeuLeu: 6.247 ± 0.752
1.721LeuMet: 1.721 ± 0.355
5.418LeuAsn: 5.418 ± 0.532
3.06LeuPro: 3.06 ± 0.451
4.143LeuGln: 4.143 ± 0.644
3.442LeuArg: 3.442 ± 0.468
6.183LeuSer: 6.183 ± 0.64
4.207LeuThr: 4.207 ± 0.598
4.335LeuVal: 4.335 ± 0.592
0.892LeuTrp: 0.892 ± 0.289
2.805LeuTyr: 2.805 ± 0.464
0.0LeuXaa: 0.0 ± 0.0
Met
2.231MetAla: 2.231 ± 0.436
0.064MetCys: 0.064 ± 0.074
0.892MetAsp: 0.892 ± 0.269
1.53MetGlu: 1.53 ± 0.37
1.084MetPhe: 1.084 ± 0.285
1.147MetGly: 1.147 ± 0.29
0.255MetHis: 0.255 ± 0.119
2.104MetIle: 2.104 ± 0.457
3.123MetLys: 3.123 ± 0.484
2.167MetLeu: 2.167 ± 0.338
0.829MetMet: 0.829 ± 0.203
1.849MetAsn: 1.849 ± 0.38
0.765MetPro: 0.765 ± 0.211
0.829MetGln: 0.829 ± 0.236
1.594MetArg: 1.594 ± 0.366
1.785MetSer: 1.785 ± 0.301
1.275MetThr: 1.275 ± 0.213
1.084MetVal: 1.084 ± 0.326
0.382MetTrp: 0.382 ± 0.153
0.574MetTyr: 0.574 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
4.08AsnAla: 4.08 ± 0.978
0.51AsnCys: 0.51 ± 0.196
3.442AsnAsp: 3.442 ± 0.435
5.163AsnGlu: 5.163 ± 0.64
2.805AsnPhe: 2.805 ± 0.43
4.653AsnGly: 4.653 ± 0.505
0.829AsnHis: 0.829 ± 0.268
4.143AsnIle: 4.143 ± 0.652
4.462AsnLys: 4.462 ± 0.581
4.143AsnLeu: 4.143 ± 0.477
1.02AsnMet: 1.02 ± 0.227
3.888AsnAsn: 3.888 ± 0.571
2.295AsnPro: 2.295 ± 0.34
2.295AsnGln: 2.295 ± 0.355
2.932AsnArg: 2.932 ± 0.427
2.932AsnSer: 2.932 ± 0.55
2.422AsnThr: 2.422 ± 0.371
4.207AsnVal: 4.207 ± 0.622
0.765AsnTrp: 0.765 ± 0.23
2.358AsnTyr: 2.358 ± 0.404
0.0AsnXaa: 0.0 ± 0.0
Pro
2.805ProAla: 2.805 ± 0.41
0.0ProCys: 0.0 ± 0.0
1.53ProAsp: 1.53 ± 0.282
3.187ProGlu: 3.187 ± 0.49
1.785ProPhe: 1.785 ± 0.331
1.466ProGly: 1.466 ± 0.366
0.637ProHis: 0.637 ± 0.2
2.104ProIle: 2.104 ± 0.438
2.805ProLys: 2.805 ± 0.34
2.231ProLeu: 2.231 ± 0.385
0.446ProMet: 0.446 ± 0.209
1.657ProAsn: 1.657 ± 0.387
1.084ProPro: 1.084 ± 0.323
0.829ProGln: 0.829 ± 0.233
0.892ProArg: 0.892 ± 0.226
2.358ProSer: 2.358 ± 0.398
1.785ProThr: 1.785 ± 0.334
1.594ProVal: 1.594 ± 0.316
0.255ProTrp: 0.255 ± 0.117
1.339ProTyr: 1.339 ± 0.246
0.0ProXaa: 0.0 ± 0.0
Gln
3.315GlnAla: 3.315 ± 0.587
0.191GlnCys: 0.191 ± 0.1
1.849GlnAsp: 1.849 ± 0.323
3.06GlnGlu: 3.06 ± 0.462
1.594GlnPhe: 1.594 ± 0.314
2.231GlnGly: 2.231 ± 0.546
0.765GlnHis: 0.765 ± 0.198
3.57GlnIle: 3.57 ± 0.489
3.697GlnLys: 3.697 ± 0.493
4.016GlnLeu: 4.016 ± 0.78
1.275GlnMet: 1.275 ± 0.347
1.785GlnAsn: 1.785 ± 0.313
1.211GlnPro: 1.211 ± 0.354
1.976GlnGln: 1.976 ± 0.364
1.275GlnArg: 1.275 ± 0.316
2.55GlnSer: 2.55 ± 0.376
2.104GlnThr: 2.104 ± 0.368
2.295GlnVal: 2.295 ± 0.344
0.382GlnTrp: 0.382 ± 0.163
1.084GlnTyr: 1.084 ± 0.247
0.0GlnXaa: 0.0 ± 0.0
Arg
2.231ArgAla: 2.231 ± 0.394
0.574ArgCys: 0.574 ± 0.21
1.912ArgAsp: 1.912 ± 0.265
4.08ArgGlu: 4.08 ± 0.469
2.231ArgPhe: 2.231 ± 0.331
2.677ArgGly: 2.677 ± 0.465
0.701ArgHis: 0.701 ± 0.223
3.187ArgIle: 3.187 ± 0.474
5.227ArgLys: 5.227 ± 0.522
3.761ArgLeu: 3.761 ± 0.62
1.211ArgMet: 1.211 ± 0.281
2.422ArgAsn: 2.422 ± 0.372
1.147ArgPro: 1.147 ± 0.292
2.358ArgGln: 2.358 ± 0.421
1.657ArgArg: 1.657 ± 0.465
2.741ArgSer: 2.741 ± 0.524
1.976ArgThr: 1.976 ± 0.347
3.06ArgVal: 3.06 ± 0.382
0.446ArgTrp: 0.446 ± 0.17
2.104ArgTyr: 2.104 ± 0.39
0.0ArgXaa: 0.0 ± 0.0
Ser
3.761SerAla: 3.761 ± 0.649
0.446SerCys: 0.446 ± 0.176
3.187SerAsp: 3.187 ± 0.512
3.952SerGlu: 3.952 ± 0.659
3.187SerPhe: 3.187 ± 0.483
5.418SerGly: 5.418 ± 0.877
0.892SerHis: 0.892 ± 0.267
4.717SerIle: 4.717 ± 0.692
4.335SerLys: 4.335 ± 0.505
4.143SerLeu: 4.143 ± 0.606
1.53SerMet: 1.53 ± 0.253
3.697SerAsn: 3.697 ± 0.568
1.849SerPro: 1.849 ± 0.359
2.613SerGln: 2.613 ± 0.427
1.976SerArg: 1.976 ± 0.399
4.972SerSer: 4.972 ± 1.151
2.932SerThr: 2.932 ± 0.411
3.761SerVal: 3.761 ± 0.531
0.446SerTrp: 0.446 ± 0.153
2.04SerTyr: 2.04 ± 0.394
0.0SerXaa: 0.0 ± 0.0
Thr
2.358ThrAla: 2.358 ± 0.533
0.382ThrCys: 0.382 ± 0.177
2.677ThrAsp: 2.677 ± 0.417
4.207ThrGlu: 4.207 ± 0.583
2.613ThrPhe: 2.613 ± 0.358
3.378ThrGly: 3.378 ± 0.419
0.637ThrHis: 0.637 ± 0.208
4.335ThrIle: 4.335 ± 0.613
4.589ThrLys: 4.589 ± 0.368
4.844ThrLeu: 4.844 ± 0.78
0.892ThrMet: 0.892 ± 0.243
3.06ThrAsn: 3.06 ± 0.359
2.04ThrPro: 2.04 ± 0.398
1.657ThrGln: 1.657 ± 0.337
1.594ThrArg: 1.594 ± 0.343
2.613ThrSer: 2.613 ± 0.514
2.422ThrThr: 2.422 ± 0.419
4.08ThrVal: 4.08 ± 0.564
0.574ThrTrp: 0.574 ± 0.207
1.912ThrTyr: 1.912 ± 0.313
0.0ThrXaa: 0.0 ± 0.0
Val
3.506ValAla: 3.506 ± 0.512
0.637ValCys: 0.637 ± 0.207
4.844ValAsp: 4.844 ± 0.598
4.271ValGlu: 4.271 ± 0.517
2.741ValPhe: 2.741 ± 0.48
3.442ValGly: 3.442 ± 0.665
0.829ValHis: 0.829 ± 0.236
4.08ValIle: 4.08 ± 0.584
5.928ValLys: 5.928 ± 0.672
5.291ValLeu: 5.291 ± 0.568
1.02ValMet: 1.02 ± 0.293
2.486ValAsn: 2.486 ± 0.363
2.295ValPro: 2.295 ± 0.345
2.613ValGln: 2.613 ± 0.409
2.295ValArg: 2.295 ± 0.351
3.697ValSer: 3.697 ± 0.437
4.653ValThr: 4.653 ± 0.653
3.633ValVal: 3.633 ± 0.57
0.956ValTrp: 0.956 ± 0.353
2.55ValTyr: 2.55 ± 0.475
0.0ValXaa: 0.0 ± 0.0
Trp
0.765TrpAla: 0.765 ± 0.228
0.0TrpCys: 0.0 ± 0.0
0.892TrpAsp: 0.892 ± 0.324
1.211TrpGlu: 1.211 ± 0.326
0.701TrpPhe: 0.701 ± 0.193
0.701TrpGly: 0.701 ± 0.206
0.319TrpHis: 0.319 ± 0.14
0.382TrpIle: 0.382 ± 0.166
1.275TrpLys: 1.275 ± 0.299
1.084TrpLeu: 1.084 ± 0.312
0.446TrpMet: 0.446 ± 0.149
1.211TrpAsn: 1.211 ± 0.372
0.191TrpPro: 0.191 ± 0.112
0.191TrpGln: 0.191 ± 0.106
0.574TrpArg: 0.574 ± 0.148
0.637TrpSer: 0.637 ± 0.244
0.574TrpThr: 0.574 ± 0.15
0.637TrpVal: 0.637 ± 0.177
0.127TrpTrp: 0.127 ± 0.125
0.574TrpTyr: 0.574 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.04TyrAla: 2.04 ± 0.409
0.446TyrCys: 0.446 ± 0.17
2.04TyrAsp: 2.04 ± 0.413
3.251TyrGlu: 3.251 ± 0.644
1.53TyrPhe: 1.53 ± 0.317
2.741TyrGly: 2.741 ± 0.451
0.701TyrHis: 0.701 ± 0.256
4.016TyrIle: 4.016 ± 0.608
2.422TyrLys: 2.422 ± 0.457
3.06TyrLeu: 3.06 ± 0.427
0.892TyrMet: 0.892 ± 0.234
1.721TyrAsn: 1.721 ± 0.336
0.829TyrPro: 0.829 ± 0.293
1.53TyrGln: 1.53 ± 0.289
1.594TyrArg: 1.594 ± 0.349
2.422TyrSer: 2.422 ± 0.482
1.721TyrThr: 1.721 ± 0.274
1.721TyrVal: 1.721 ± 0.306
0.382TyrTrp: 0.382 ± 0.179
1.594TyrTyr: 1.594 ± 0.397
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (15689 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski