Amino acid dipepetide frequency for Bacillus phage vB_BpsS-140

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.388AlaAla: 0.388 ± 0.149
0.259AlaCys: 0.259 ± 0.119
3.559AlaAsp: 3.559 ± 0.551
4.142AlaGlu: 4.142 ± 0.485
2.459AlaPhe: 2.459 ± 0.461
3.43AlaGly: 3.43 ± 0.551
1.165AlaHis: 1.165 ± 0.289
6.148AlaIle: 6.148 ± 1.046
5.048AlaLys: 5.048 ± 0.487
5.695AlaLeu: 5.695 ± 0.609
2.33AlaMet: 2.33 ± 0.342
3.106AlaAsn: 3.106 ± 0.628
2.2AlaPro: 2.2 ± 0.601
2.653AlaGln: 2.653 ± 0.469
2.977AlaArg: 2.977 ± 0.505
3.689AlaSer: 3.689 ± 0.413
3.753AlaThr: 3.753 ± 0.424
3.494AlaVal: 3.494 ± 0.613
0.841AlaTrp: 0.841 ± 0.228
2.33AlaTyr: 2.33 ± 0.47
0.0AlaXaa: 0.0 ± 0.0
Cys
0.259CysAla: 0.259 ± 0.133
0.129CysCys: 0.129 ± 0.092
0.518CysAsp: 0.518 ± 0.225
0.388CysGlu: 0.388 ± 0.153
0.324CysPhe: 0.324 ± 0.172
0.324CysGly: 0.324 ± 0.171
0.324CysHis: 0.324 ± 0.189
0.194CysIle: 0.194 ± 0.126
0.324CysLys: 0.324 ± 0.128
0.388CysLeu: 0.388 ± 0.172
0.065CysMet: 0.065 ± 0.068
0.259CysAsn: 0.259 ± 0.137
0.259CysPro: 0.259 ± 0.156
0.129CysGln: 0.129 ± 0.094
0.194CysArg: 0.194 ± 0.101
0.388CysSer: 0.388 ± 0.173
0.518CysThr: 0.518 ± 0.183
0.388CysVal: 0.388 ± 0.176
0.065CysTrp: 0.065 ± 0.065
0.129CysTyr: 0.129 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
3.494AspAla: 3.494 ± 0.473
0.324AspCys: 0.324 ± 0.191
5.759AspAsp: 5.759 ± 0.614
4.724AspGlu: 4.724 ± 0.485
2.071AspPhe: 2.071 ± 0.359
4.918AspGly: 4.918 ± 0.572
1.1AspHis: 1.1 ± 0.404
5.824AspIle: 5.824 ± 0.55
4.789AspLys: 4.789 ± 0.519
5.501AspLeu: 5.501 ± 0.763
2.265AspMet: 2.265 ± 0.395
4.012AspAsn: 4.012 ± 0.567
1.941AspPro: 1.941 ± 0.328
2.265AspGln: 2.265 ± 0.349
3.689AspArg: 3.689 ± 0.532
4.4AspSer: 4.4 ± 0.469
4.4AspThr: 4.4 ± 0.677
4.595AspVal: 4.595 ± 0.565
0.841AspTrp: 0.841 ± 0.204
3.041AspTyr: 3.041 ± 0.579
0.0AspXaa: 0.0 ± 0.0
Glu
3.689GluAla: 3.689 ± 0.512
0.518GluCys: 0.518 ± 0.145
5.759GluAsp: 5.759 ± 0.675
6.665GluGlu: 6.665 ± 0.762
3.106GluPhe: 3.106 ± 0.374
4.53GluGly: 4.53 ± 0.417
2.136GluHis: 2.136 ± 0.49
5.242GluIle: 5.242 ± 0.671
4.465GluLys: 4.465 ± 0.73
6.924GluLeu: 6.924 ± 0.7
2.2GluMet: 2.2 ± 0.335
3.494GluAsn: 3.494 ± 0.627
1.359GluPro: 1.359 ± 0.346
3.041GluGln: 3.041 ± 0.448
3.559GluArg: 3.559 ± 0.848
3.818GluSer: 3.818 ± 0.468
3.171GluThr: 3.171 ± 0.482
4.012GluVal: 4.012 ± 0.457
1.23GluTrp: 1.23 ± 0.279
2.459GluTyr: 2.459 ± 0.353
0.0GluXaa: 0.0 ± 0.0
Phe
1.877PheAla: 1.877 ± 0.333
0.194PheCys: 0.194 ± 0.101
3.236PheAsp: 3.236 ± 0.619
2.459PheGlu: 2.459 ± 0.412
1.23PhePhe: 1.23 ± 0.287
2.265PheGly: 2.265 ± 0.413
0.777PheHis: 0.777 ± 0.256
3.041PheIle: 3.041 ± 0.511
1.618PheLys: 1.618 ± 0.295
2.394PheLeu: 2.394 ± 0.42
0.777PheMet: 0.777 ± 0.219
2.2PheAsn: 2.2 ± 0.417
0.777PhePro: 0.777 ± 0.223
1.1PheGln: 1.1 ± 0.269
2.459PheArg: 2.459 ± 0.355
2.653PheSer: 2.653 ± 0.441
2.136PheThr: 2.136 ± 0.423
2.459PheVal: 2.459 ± 0.398
0.388PheTrp: 0.388 ± 0.19
1.747PheTyr: 1.747 ± 0.37
0.0PheXaa: 0.0 ± 0.0
Gly
4.53GlyAla: 4.53 ± 0.856
0.259GlyCys: 0.259 ± 0.122
3.947GlyAsp: 3.947 ± 0.428
4.271GlyGlu: 4.271 ± 0.583
2.977GlyPhe: 2.977 ± 0.411
4.789GlyGly: 4.789 ± 0.683
1.1GlyHis: 1.1 ± 0.254
4.918GlyIle: 4.918 ± 0.568
5.565GlyLys: 5.565 ± 0.481
4.336GlyLeu: 4.336 ± 0.548
1.618GlyMet: 1.618 ± 0.274
3.559GlyAsn: 3.559 ± 0.612
0.712GlyPro: 0.712 ± 0.203
2.33GlyGln: 2.33 ± 0.446
3.559GlyArg: 3.559 ± 0.537
4.465GlySer: 4.465 ± 0.469
4.4GlyThr: 4.4 ± 0.579
3.947GlyVal: 3.947 ± 0.551
0.518GlyTrp: 0.518 ± 0.325
2.265GlyTyr: 2.265 ± 0.442
0.0GlyXaa: 0.0 ± 0.0
His
1.035HisAla: 1.035 ± 0.255
0.129HisCys: 0.129 ± 0.076
2.265HisAsp: 2.265 ± 0.453
1.812HisGlu: 1.812 ± 0.342
0.841HisPhe: 0.841 ± 0.271
1.747HisGly: 1.747 ± 0.314
0.777HisHis: 0.777 ± 0.342
1.23HisIle: 1.23 ± 0.291
1.812HisLys: 1.812 ± 0.343
1.618HisLeu: 1.618 ± 0.282
0.712HisMet: 0.712 ± 0.202
1.035HisAsn: 1.035 ± 0.318
0.647HisPro: 0.647 ± 0.193
0.647HisGln: 0.647 ± 0.228
1.23HisArg: 1.23 ± 0.361
1.294HisSer: 1.294 ± 0.281
1.035HisThr: 1.035 ± 0.292
1.1HisVal: 1.1 ± 0.239
0.129HisTrp: 0.129 ± 0.095
0.518HisTyr: 0.518 ± 0.171
0.0HisXaa: 0.0 ± 0.0
Ile
5.759IleAla: 5.759 ± 0.47
0.324IleCys: 0.324 ± 0.152
7.118IleAsp: 7.118 ± 0.722
5.824IleGlu: 5.824 ± 0.514
1.812IlePhe: 1.812 ± 0.329
4.918IleGly: 4.918 ± 0.567
1.941IleHis: 1.941 ± 0.354
5.889IleIle: 5.889 ± 0.762
6.342IleLys: 6.342 ± 0.899
4.336IleLeu: 4.336 ± 0.411
2.006IleMet: 2.006 ± 0.42
5.759IleAsn: 5.759 ± 0.585
2.2IlePro: 2.2 ± 0.359
2.265IleGln: 2.265 ± 0.566
3.624IleArg: 3.624 ± 0.514
6.665IleSer: 6.665 ± 1.068
5.695IleThr: 5.695 ± 0.611
4.53IleVal: 4.53 ± 0.508
1.294IleTrp: 1.294 ± 0.538
2.2IleTyr: 2.2 ± 0.424
0.0IleXaa: 0.0 ± 0.0
Lys
4.142LysAla: 4.142 ± 0.501
0.647LysCys: 0.647 ± 0.242
3.818LysAsp: 3.818 ± 0.514
5.695LysGlu: 5.695 ± 0.819
3.365LysPhe: 3.365 ± 0.46
3.753LysGly: 3.753 ± 0.421
2.006LysHis: 2.006 ± 0.46
5.695LysIle: 5.695 ± 0.641
6.536LysLys: 6.536 ± 0.952
6.342LysLeu: 6.342 ± 0.793
2.265LysMet: 2.265 ± 0.541
2.653LysAsn: 2.653 ± 0.435
1.747LysPro: 1.747 ± 0.408
2.977LysGln: 2.977 ± 0.402
3.43LysArg: 3.43 ± 0.629
5.177LysSer: 5.177 ± 0.685
3.947LysThr: 3.947 ± 0.389
3.171LysVal: 3.171 ± 0.372
0.647LysTrp: 0.647 ± 0.243
2.653LysTyr: 2.653 ± 0.477
0.0LysXaa: 0.0 ± 0.0
Leu
5.63LeuAla: 5.63 ± 0.593
0.388LeuCys: 0.388 ± 0.175
4.983LeuAsp: 4.983 ± 0.583
7.312LeuGlu: 7.312 ± 0.622
3.171LeuPhe: 3.171 ± 0.38
4.983LeuGly: 4.983 ± 0.395
1.294LeuHis: 1.294 ± 0.278
4.853LeuIle: 4.853 ± 0.568
5.824LeuLys: 5.824 ± 0.826
6.342LeuLeu: 6.342 ± 0.704
2.588LeuMet: 2.588 ± 0.333
3.689LeuAsn: 3.689 ± 0.639
1.553LeuPro: 1.553 ± 0.292
3.559LeuGln: 3.559 ± 0.403
3.624LeuArg: 3.624 ± 0.433
5.695LeuSer: 5.695 ± 0.692
4.4LeuThr: 4.4 ± 0.484
4.465LeuVal: 4.465 ± 0.691
0.906LeuTrp: 0.906 ± 0.208
2.2LeuTyr: 2.2 ± 0.353
0.0LeuXaa: 0.0 ± 0.0
Met
2.394MetAla: 2.394 ± 0.567
0.065MetCys: 0.065 ± 0.063
2.136MetAsp: 2.136 ± 0.406
2.265MetGlu: 2.265 ± 0.362
0.841MetPhe: 0.841 ± 0.176
2.847MetGly: 2.847 ± 0.54
0.453MetHis: 0.453 ± 0.155
2.524MetIle: 2.524 ± 0.492
2.653MetLys: 2.653 ± 0.507
1.747MetLeu: 1.747 ± 0.469
0.518MetMet: 0.518 ± 0.177
1.488MetAsn: 1.488 ± 0.254
0.777MetPro: 0.777 ± 0.271
0.518MetGln: 0.518 ± 0.197
1.618MetArg: 1.618 ± 0.32
2.394MetSer: 2.394 ± 0.355
2.265MetThr: 2.265 ± 0.348
1.359MetVal: 1.359 ± 0.407
0.259MetTrp: 0.259 ± 0.107
0.582MetTyr: 0.582 ± 0.204
0.0MetXaa: 0.0 ± 0.0
Asn
3.883AsnAla: 3.883 ± 0.545
0.259AsnCys: 0.259 ± 0.14
3.624AsnAsp: 3.624 ± 0.637
2.653AsnGlu: 2.653 ± 0.374
2.006AsnPhe: 2.006 ± 0.423
4.789AsnGly: 4.789 ± 0.538
0.841AsnHis: 0.841 ± 0.259
4.142AsnIle: 4.142 ± 0.482
3.883AsnLys: 3.883 ± 0.506
3.106AsnLeu: 3.106 ± 0.388
2.136AsnMet: 2.136 ± 0.37
3.559AsnAsn: 3.559 ± 0.501
2.136AsnPro: 2.136 ± 0.437
2.33AsnGln: 2.33 ± 0.489
2.977AsnArg: 2.977 ± 0.593
3.106AsnSer: 3.106 ± 0.402
2.783AsnThr: 2.783 ± 0.451
3.041AsnVal: 3.041 ± 0.366
0.906AsnTrp: 0.906 ± 0.26
1.683AsnTyr: 1.683 ± 0.358
0.0AsnXaa: 0.0 ± 0.0
Pro
1.747ProAla: 1.747 ± 0.368
0.065ProCys: 0.065 ± 0.073
1.812ProAsp: 1.812 ± 0.401
1.812ProGlu: 1.812 ± 0.328
0.777ProPhe: 0.777 ± 0.237
0.0ProGly: 0.0 ± 0.0
0.647ProHis: 0.647 ± 0.237
3.041ProIle: 3.041 ± 0.442
1.359ProLys: 1.359 ± 0.346
1.941ProLeu: 1.941 ± 0.335
0.582ProMet: 0.582 ± 0.227
1.812ProAsn: 1.812 ± 0.372
2.006ProPro: 2.006 ± 0.663
0.971ProGln: 0.971 ± 0.224
1.359ProArg: 1.359 ± 0.273
2.783ProSer: 2.783 ± 0.439
2.524ProThr: 2.524 ± 0.494
1.618ProVal: 1.618 ± 0.394
0.324ProTrp: 0.324 ± 0.147
1.23ProTyr: 1.23 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
1.941GlnAla: 1.941 ± 0.43
0.259GlnCys: 0.259 ± 0.106
2.071GlnAsp: 2.071 ± 0.382
2.524GlnGlu: 2.524 ± 0.539
1.359GlnPhe: 1.359 ± 0.267
1.812GlnGly: 1.812 ± 0.324
0.582GlnHis: 0.582 ± 0.181
3.236GlnIle: 3.236 ± 0.414
2.847GlnLys: 2.847 ± 0.417
3.494GlnLeu: 3.494 ± 0.52
1.359GlnMet: 1.359 ± 0.272
1.359GlnAsn: 1.359 ± 0.278
1.035GlnPro: 1.035 ± 0.336
1.553GlnGln: 1.553 ± 0.301
2.006GlnArg: 2.006 ± 0.419
2.977GlnSer: 2.977 ± 0.361
1.877GlnThr: 1.877 ± 0.367
2.783GlnVal: 2.783 ± 0.417
0.388GlnTrp: 0.388 ± 0.169
1.683GlnTyr: 1.683 ± 0.307
0.0GlnXaa: 0.0 ± 0.0
Arg
2.653ArgAla: 2.653 ± 0.36
0.194ArgCys: 0.194 ± 0.135
3.041ArgAsp: 3.041 ± 0.453
4.336ArgGlu: 4.336 ± 0.664
1.618ArgPhe: 1.618 ± 0.326
3.106ArgGly: 3.106 ± 0.435
1.294ArgHis: 1.294 ± 0.375
4.4ArgIle: 4.4 ± 0.683
3.43ArgLys: 3.43 ± 0.731
4.4ArgLeu: 4.4 ± 0.631
2.265ArgMet: 2.265 ± 0.377
2.071ArgAsn: 2.071 ± 0.319
0.777ArgPro: 0.777 ± 0.191
1.294ArgGln: 1.294 ± 0.32
2.783ArgArg: 2.783 ± 0.553
3.3ArgSer: 3.3 ± 0.56
2.783ArgThr: 2.783 ± 0.433
3.883ArgVal: 3.883 ± 0.529
1.035ArgTrp: 1.035 ± 0.242
1.488ArgTyr: 1.488 ± 0.321
0.0ArgXaa: 0.0 ± 0.0
Ser
4.659SerAla: 4.659 ± 0.708
0.324SerCys: 0.324 ± 0.173
4.465SerAsp: 4.465 ± 0.515
3.818SerGlu: 3.818 ± 0.556
1.941SerPhe: 1.941 ± 0.331
5.048SerGly: 5.048 ± 0.698
1.683SerHis: 1.683 ± 0.333
5.759SerIle: 5.759 ± 0.602
3.883SerLys: 3.883 ± 0.546
5.759SerLeu: 5.759 ± 0.622
2.071SerMet: 2.071 ± 0.429
4.271SerAsn: 4.271 ± 0.535
2.33SerPro: 2.33 ± 0.35
3.43SerGln: 3.43 ± 0.444
3.818SerArg: 3.818 ± 0.519
5.63SerSer: 5.63 ± 1.117
4.4SerThr: 4.4 ± 0.786
5.177SerVal: 5.177 ± 0.621
0.906SerTrp: 0.906 ± 0.206
2.071SerTyr: 2.071 ± 0.478
0.0SerXaa: 0.0 ± 0.0
Thr
4.983ThrAla: 4.983 ± 0.574
0.388ThrCys: 0.388 ± 0.181
3.106ThrAsp: 3.106 ± 0.388
3.365ThrGlu: 3.365 ± 0.402
1.683ThrPhe: 1.683 ± 0.303
4.595ThrGly: 4.595 ± 0.554
1.165ThrHis: 1.165 ± 0.324
6.342ThrIle: 6.342 ± 0.704
2.847ThrLys: 2.847 ± 0.436
4.789ThrLeu: 4.789 ± 0.603
1.165ThrMet: 1.165 ± 0.215
3.171ThrAsn: 3.171 ± 0.501
3.236ThrPro: 3.236 ± 0.588
1.683ThrGln: 1.683 ± 0.319
2.136ThrArg: 2.136 ± 0.321
4.918ThrSer: 4.918 ± 0.827
5.177ThrThr: 5.177 ± 0.807
4.53ThrVal: 4.53 ± 0.609
0.777ThrTrp: 0.777 ± 0.189
2.33ThrTyr: 2.33 ± 0.471
0.0ThrXaa: 0.0 ± 0.0
Val
4.142ValAla: 4.142 ± 0.501
0.453ValCys: 0.453 ± 0.179
5.177ValAsp: 5.177 ± 0.624
4.271ValGlu: 4.271 ± 0.488
1.683ValPhe: 1.683 ± 0.427
3.365ValGly: 3.365 ± 0.617
1.424ValHis: 1.424 ± 0.302
5.436ValIle: 5.436 ± 0.576
3.624ValLys: 3.624 ± 0.574
4.4ValLeu: 4.4 ± 0.5
1.488ValMet: 1.488 ± 0.242
2.588ValAsn: 2.588 ± 0.413
1.294ValPro: 1.294 ± 0.323
2.977ValGln: 2.977 ± 0.392
2.459ValArg: 2.459 ± 0.35
4.659ValSer: 4.659 ± 0.579
4.142ValThr: 4.142 ± 0.691
4.53ValVal: 4.53 ± 0.618
1.165ValTrp: 1.165 ± 0.367
2.588ValTyr: 2.588 ± 0.472
0.0ValXaa: 0.0 ± 0.0
Trp
0.647TrpAla: 0.647 ± 0.212
0.065TrpCys: 0.065 ± 0.07
1.165TrpAsp: 1.165 ± 0.301
0.712TrpGlu: 0.712 ± 0.204
0.388TrpPhe: 0.388 ± 0.201
0.906TrpGly: 0.906 ± 0.276
0.194TrpHis: 0.194 ± 0.093
0.841TrpIle: 0.841 ± 0.359
1.1TrpLys: 1.1 ± 0.34
1.1TrpLeu: 1.1 ± 0.258
0.194TrpMet: 0.194 ± 0.09
1.1TrpAsn: 1.1 ± 0.336
0.0TrpPro: 0.0 ± 0.0
0.453TrpGln: 0.453 ± 0.16
0.388TrpArg: 0.388 ± 0.212
0.971TrpSer: 0.971 ± 0.396
1.294TrpThr: 1.294 ± 0.36
0.971TrpVal: 0.971 ± 0.242
0.324TrpTrp: 0.324 ± 0.169
0.647TrpTyr: 0.647 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.812TyrAla: 1.812 ± 0.41
0.324TyrCys: 0.324 ± 0.159
2.394TyrAsp: 2.394 ± 0.391
2.33TyrGlu: 2.33 ± 0.555
2.136TyrPhe: 2.136 ± 0.39
1.941TyrGly: 1.941 ± 0.37
0.712TyrHis: 0.712 ± 0.206
1.747TyrIle: 1.747 ± 0.331
2.718TyrLys: 2.718 ± 0.464
2.847TyrLeu: 2.847 ± 0.352
1.035TyrMet: 1.035 ± 0.279
2.653TyrAsn: 2.653 ± 0.409
1.424TyrPro: 1.424 ± 0.317
0.971TyrGln: 0.971 ± 0.314
2.33TyrArg: 2.33 ± 0.397
2.459TyrSer: 2.459 ± 0.331
1.683TyrThr: 1.683 ± 0.303
1.812TyrVal: 1.812 ± 0.401
0.518TyrTrp: 0.518 ± 0.164
1.1TyrTyr: 1.1 ± 0.26
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (15454 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski