Amino acid dipepetide frequency for Microbacterium phage McGalleon

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.507AlaAla: 10.507 ± 1.198
0.75AlaCys: 0.75 ± 0.261
5.779AlaAsp: 5.779 ± 0.628
6.004AlaGlu: 6.004 ± 0.673
2.702AlaPhe: 2.702 ± 0.471
7.505AlaGly: 7.505 ± 1.232
1.876AlaHis: 1.876 ± 0.302
6.454AlaIle: 6.454 ± 0.895
5.553AlaLys: 5.553 ± 0.731
9.456AlaLeu: 9.456 ± 1.003
1.951AlaMet: 1.951 ± 0.409
2.777AlaAsn: 2.777 ± 0.45
4.503AlaPro: 4.503 ± 0.674
3.977AlaGln: 3.977 ± 0.524
4.878AlaArg: 4.878 ± 0.654
5.403AlaSer: 5.403 ± 0.75
6.829AlaThr: 6.829 ± 0.807
6.829AlaVal: 6.829 ± 0.735
2.326AlaTrp: 2.326 ± 0.497
2.552AlaTyr: 2.552 ± 0.514
0.0AlaXaa: 0.0 ± 0.0
Cys
0.525CysAla: 0.525 ± 0.206
0.0CysCys: 0.0 ± 0.0
0.375CysAsp: 0.375 ± 0.14
0.3CysGlu: 0.3 ± 0.175
0.075CysPhe: 0.075 ± 0.067
0.525CysGly: 0.525 ± 0.172
0.15CysHis: 0.15 ± 0.099
0.075CysIle: 0.075 ± 0.074
0.45CysLys: 0.45 ± 0.167
0.45CysLeu: 0.45 ± 0.191
0.075CysMet: 0.075 ± 0.082
0.375CysAsn: 0.375 ± 0.161
0.375CysPro: 0.375 ± 0.17
0.075CysGln: 0.075 ± 0.073
0.525CysArg: 0.525 ± 0.174
0.375CysSer: 0.375 ± 0.14
0.6CysThr: 0.6 ± 0.205
0.375CysVal: 0.375 ± 0.176
0.15CysTrp: 0.15 ± 0.094
0.3CysTyr: 0.3 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
5.854AspAla: 5.854 ± 0.742
0.826AspCys: 0.826 ± 0.255
4.053AspAsp: 4.053 ± 0.674
6.079AspGlu: 6.079 ± 1.46
2.326AspPhe: 2.326 ± 0.392
4.278AspGly: 4.278 ± 0.563
1.201AspHis: 1.201 ± 0.294
3.602AspIle: 3.602 ± 0.438
2.627AspLys: 2.627 ± 0.509
5.103AspLeu: 5.103 ± 0.755
1.201AspMet: 1.201 ± 0.278
2.176AspAsn: 2.176 ± 0.486
4.203AspPro: 4.203 ± 0.646
2.176AspGln: 2.176 ± 0.366
4.503AspArg: 4.503 ± 0.626
3.152AspSer: 3.152 ± 0.427
3.302AspThr: 3.302 ± 0.611
4.503AspVal: 4.503 ± 0.588
1.576AspTrp: 1.576 ± 0.332
2.477AspTyr: 2.477 ± 0.423
0.0AspXaa: 0.0 ± 0.0
Glu
6.679GluAla: 6.679 ± 0.833
0.225GluCys: 0.225 ± 0.159
5.103GluAsp: 5.103 ± 1.347
6.004GluGlu: 6.004 ± 1.464
1.951GluPhe: 1.951 ± 0.382
5.028GluGly: 5.028 ± 0.783
1.051GluHis: 1.051 ± 0.238
2.627GluIle: 2.627 ± 0.424
2.852GluLys: 2.852 ± 0.535
5.253GluLeu: 5.253 ± 0.646
2.101GluMet: 2.101 ± 0.408
2.026GluAsn: 2.026 ± 0.464
2.251GluPro: 2.251 ± 0.472
3.152GluGln: 3.152 ± 0.506
3.902GluArg: 3.902 ± 0.712
2.477GluSer: 2.477 ± 0.433
3.902GluThr: 3.902 ± 0.492
4.728GluVal: 4.728 ± 0.71
1.351GluTrp: 1.351 ± 0.306
2.101GluTyr: 2.101 ± 0.325
0.0GluXaa: 0.0 ± 0.0
Phe
2.627PheAla: 2.627 ± 0.444
0.075PheCys: 0.075 ± 0.07
2.477PheAsp: 2.477 ± 0.461
2.026PheGlu: 2.026 ± 0.386
0.75PhePhe: 0.75 ± 0.229
2.927PheGly: 2.927 ± 0.437
0.525PheHis: 0.525 ± 0.185
1.426PheIle: 1.426 ± 0.359
1.501PheLys: 1.501 ± 0.326
1.951PheLeu: 1.951 ± 0.474
1.126PheMet: 1.126 ± 0.387
0.901PheAsn: 0.901 ± 0.248
0.976PhePro: 0.976 ± 0.244
1.576PheGln: 1.576 ± 0.361
2.702PheArg: 2.702 ± 0.515
2.326PheSer: 2.326 ± 0.375
1.951PheThr: 1.951 ± 0.404
1.726PheVal: 1.726 ± 0.417
0.675PheTrp: 0.675 ± 0.242
0.675PheTyr: 0.675 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
6.379GlyAla: 6.379 ± 0.842
0.675GlyCys: 0.675 ± 0.243
3.302GlyAsp: 3.302 ± 0.407
4.353GlyGlu: 4.353 ± 0.583
3.302GlyPhe: 3.302 ± 0.366
5.779GlyGly: 5.779 ± 0.824
0.976GlyHis: 0.976 ± 0.253
5.103GlyIle: 5.103 ± 0.832
4.653GlyLys: 4.653 ± 0.688
6.829GlyLeu: 6.829 ± 1.031
2.026GlyMet: 2.026 ± 0.284
2.402GlyAsn: 2.402 ± 0.453
3.077GlyPro: 3.077 ± 0.531
3.527GlyGln: 3.527 ± 0.655
4.953GlyArg: 4.953 ± 0.66
5.178GlySer: 5.178 ± 0.712
6.604GlyThr: 6.604 ± 0.901
5.779GlyVal: 5.779 ± 0.758
1.051GlyTrp: 1.051 ± 0.231
2.477GlyTyr: 2.477 ± 0.568
0.0GlyXaa: 0.0 ± 0.0
His
1.351HisAla: 1.351 ± 0.346
0.0HisCys: 0.0 ± 0.0
0.826HisAsp: 0.826 ± 0.232
1.351HisGlu: 1.351 ± 0.344
0.6HisPhe: 0.6 ± 0.222
1.951HisGly: 1.951 ± 0.403
0.45HisHis: 0.45 ± 0.197
0.75HisIle: 0.75 ± 0.264
1.351HisLys: 1.351 ± 0.41
1.201HisLeu: 1.201 ± 0.267
0.525HisMet: 0.525 ± 0.232
0.675HisAsn: 0.675 ± 0.232
0.901HisPro: 0.901 ± 0.231
0.826HisGln: 0.826 ± 0.227
0.375HisArg: 0.375 ± 0.156
1.201HisSer: 1.201 ± 0.258
0.826HisThr: 0.826 ± 0.219
1.576HisVal: 1.576 ± 0.337
0.375HisTrp: 0.375 ± 0.144
0.75HisTyr: 0.75 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
4.578IleAla: 4.578 ± 0.453
0.45IleCys: 0.45 ± 0.164
5.028IleAsp: 5.028 ± 0.512
3.827IleGlu: 3.827 ± 0.51
0.6IlePhe: 0.6 ± 0.198
3.977IleGly: 3.977 ± 0.841
0.976IleHis: 0.976 ± 0.28
3.077IleIle: 3.077 ± 0.774
2.777IleLys: 2.777 ± 0.619
2.852IleLeu: 2.852 ± 0.509
1.126IleMet: 1.126 ± 0.251
2.627IleAsn: 2.627 ± 0.595
3.227IlePro: 3.227 ± 0.661
2.777IleGln: 2.777 ± 0.534
2.326IleArg: 2.326 ± 0.469
3.077IleSer: 3.077 ± 0.425
3.452IleThr: 3.452 ± 0.896
3.302IleVal: 3.302 ± 0.591
0.75IleTrp: 0.75 ± 0.218
1.501IleTyr: 1.501 ± 0.341
0.0IleXaa: 0.0 ± 0.0
Lys
4.878LysAla: 4.878 ± 0.733
0.15LysCys: 0.15 ± 0.112
3.152LysAsp: 3.152 ± 0.496
2.852LysGlu: 2.852 ± 0.478
1.126LysPhe: 1.126 ± 0.328
3.752LysGly: 3.752 ± 0.57
0.901LysHis: 0.901 ± 0.233
2.026LysIle: 2.026 ± 0.426
2.777LysLys: 2.777 ± 0.528
4.353LysLeu: 4.353 ± 0.571
0.826LysMet: 0.826 ± 0.278
1.651LysAsn: 1.651 ± 0.388
3.752LysPro: 3.752 ± 0.822
1.951LysGln: 1.951 ± 0.474
2.927LysArg: 2.927 ± 0.618
2.326LysSer: 2.326 ± 0.428
2.176LysThr: 2.176 ± 0.352
4.128LysVal: 4.128 ± 0.687
0.976LysTrp: 0.976 ± 0.301
0.901LysTyr: 0.901 ± 0.259
0.0LysXaa: 0.0 ± 0.0
Leu
9.606LeuAla: 9.606 ± 0.828
0.6LeuCys: 0.6 ± 0.188
5.478LeuAsp: 5.478 ± 0.547
5.028LeuGlu: 5.028 ± 0.706
2.026LeuPhe: 2.026 ± 0.337
6.529LeuGly: 6.529 ± 0.606
1.426LeuHis: 1.426 ± 0.348
5.328LeuIle: 5.328 ± 1.219
4.128LeuLys: 4.128 ± 0.663
7.805LeuLeu: 7.805 ± 0.93
2.402LeuMet: 2.402 ± 0.432
3.077LeuAsn: 3.077 ± 0.415
3.752LeuPro: 3.752 ± 0.495
2.627LeuGln: 2.627 ± 0.413
5.553LeuArg: 5.553 ± 0.677
5.178LeuSer: 5.178 ± 0.535
4.803LeuThr: 4.803 ± 0.671
6.529LeuVal: 6.529 ± 0.866
1.651LeuTrp: 1.651 ± 0.312
1.876LeuTyr: 1.876 ± 0.315
0.0LeuXaa: 0.0 ± 0.0
Met
2.927MetAla: 2.927 ± 0.497
0.375MetCys: 0.375 ± 0.163
1.501MetAsp: 1.501 ± 0.372
1.051MetGlu: 1.051 ± 0.247
1.051MetPhe: 1.051 ± 0.253
1.726MetGly: 1.726 ± 0.378
0.225MetHis: 0.225 ± 0.124
0.75MetIle: 0.75 ± 0.242
0.375MetLys: 0.375 ± 0.181
2.176MetLeu: 2.176 ± 0.395
0.3MetMet: 0.3 ± 0.169
0.976MetAsn: 0.976 ± 0.272
1.351MetPro: 1.351 ± 0.311
0.675MetGln: 0.675 ± 0.227
0.75MetArg: 0.75 ± 0.229
2.026MetSer: 2.026 ± 0.355
2.101MetThr: 2.101 ± 0.351
1.201MetVal: 1.201 ± 0.263
0.225MetTrp: 0.225 ± 0.142
0.375MetTyr: 0.375 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
3.602AsnAla: 3.602 ± 0.832
0.075AsnCys: 0.075 ± 0.086
2.101AsnAsp: 2.101 ± 0.353
1.951AsnGlu: 1.951 ± 0.504
0.901AsnPhe: 0.901 ± 0.218
3.227AsnGly: 3.227 ± 0.402
0.6AsnHis: 0.6 ± 0.196
2.176AsnIle: 2.176 ± 0.601
1.576AsnLys: 1.576 ± 0.346
2.702AsnLeu: 2.702 ± 0.498
0.375AsnMet: 0.375 ± 0.164
1.501AsnAsn: 1.501 ± 0.453
1.426AsnPro: 1.426 ± 0.315
1.426AsnGln: 1.426 ± 0.305
1.651AsnArg: 1.651 ± 0.359
1.951AsnSer: 1.951 ± 0.418
1.801AsnThr: 1.801 ± 0.387
2.176AsnVal: 2.176 ± 0.359
0.826AsnTrp: 0.826 ± 0.253
1.351AsnTyr: 1.351 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
5.929ProAla: 5.929 ± 0.811
0.15ProCys: 0.15 ± 0.102
3.077ProAsp: 3.077 ± 0.506
4.053ProGlu: 4.053 ± 0.87
1.726ProPhe: 1.726 ± 0.36
3.452ProGly: 3.452 ± 0.627
0.45ProHis: 0.45 ± 0.17
1.876ProIle: 1.876 ± 0.27
2.026ProLys: 2.026 ± 0.295
3.827ProLeu: 3.827 ± 0.454
0.75ProMet: 0.75 ± 0.231
1.201ProAsn: 1.201 ± 0.285
1.351ProPro: 1.351 ± 0.37
2.627ProGln: 2.627 ± 0.425
2.026ProArg: 2.026 ± 0.469
3.527ProSer: 3.527 ± 0.452
3.827ProThr: 3.827 ± 0.595
4.878ProVal: 4.878 ± 0.552
0.826ProTrp: 0.826 ± 0.242
1.126ProTyr: 1.126 ± 0.227
0.0ProXaa: 0.0 ± 0.0
Gln
4.953GlnAla: 4.953 ± 0.591
0.15GlnCys: 0.15 ± 0.093
1.876GlnAsp: 1.876 ± 0.282
3.002GlnGlu: 3.002 ± 0.494
0.75GlnPhe: 0.75 ± 0.216
2.702GlnGly: 2.702 ± 0.493
1.051GlnHis: 1.051 ± 0.375
1.501GlnIle: 1.501 ± 0.375
1.576GlnLys: 1.576 ± 0.331
3.902GlnLeu: 3.902 ± 0.624
0.6GlnMet: 0.6 ± 0.237
1.426GlnAsn: 1.426 ± 0.401
2.176GlnPro: 2.176 ± 0.536
2.627GlnGln: 2.627 ± 0.557
2.627GlnArg: 2.627 ± 0.445
1.726GlnSer: 1.726 ± 0.293
2.326GlnThr: 2.326 ± 0.405
3.077GlnVal: 3.077 ± 0.517
0.901GlnTrp: 0.901 ± 0.244
1.651GlnTyr: 1.651 ± 0.362
0.0GlnXaa: 0.0 ± 0.0
Arg
5.328ArgAla: 5.328 ± 0.698
0.45ArgCys: 0.45 ± 0.172
3.977ArgAsp: 3.977 ± 0.434
3.077ArgGlu: 3.077 ± 0.481
2.251ArgPhe: 2.251 ± 0.441
3.227ArgGly: 3.227 ± 0.449
0.75ArgHis: 0.75 ± 0.238
3.302ArgIle: 3.302 ± 0.408
3.677ArgLys: 3.677 ± 0.747
5.854ArgLeu: 5.854 ± 0.676
1.801ArgMet: 1.801 ± 0.341
2.176ArgAsn: 2.176 ± 0.446
3.077ArgPro: 3.077 ± 0.599
1.801ArgGln: 1.801 ± 0.318
3.302ArgArg: 3.302 ± 0.581
3.452ArgSer: 3.452 ± 0.492
3.152ArgThr: 3.152 ± 0.627
4.428ArgVal: 4.428 ± 0.755
0.75ArgTrp: 0.75 ± 0.243
2.026ArgTyr: 2.026 ± 0.332
0.0ArgXaa: 0.0 ± 0.0
Ser
5.779SerAla: 5.779 ± 0.615
0.075SerCys: 0.075 ± 0.073
4.053SerAsp: 4.053 ± 0.46
2.402SerGlu: 2.402 ± 0.5
2.026SerPhe: 2.026 ± 0.357
5.028SerGly: 5.028 ± 0.663
1.351SerHis: 1.351 ± 0.404
3.227SerIle: 3.227 ± 0.599
2.777SerLys: 2.777 ± 0.483
5.629SerLeu: 5.629 ± 0.495
1.351SerMet: 1.351 ± 0.441
1.876SerAsn: 1.876 ± 0.364
2.402SerPro: 2.402 ± 0.386
2.176SerGln: 2.176 ± 0.363
3.377SerArg: 3.377 ± 0.513
3.827SerSer: 3.827 ± 0.597
4.053SerThr: 4.053 ± 0.711
4.578SerVal: 4.578 ± 0.596
1.651SerTrp: 1.651 ± 0.395
1.651SerTyr: 1.651 ± 0.345
0.0SerXaa: 0.0 ± 0.0
Thr
6.304ThrAla: 6.304 ± 1.11
0.225ThrCys: 0.225 ± 0.142
3.977ThrAsp: 3.977 ± 0.534
3.677ThrGlu: 3.677 ± 0.53
3.302ThrPhe: 3.302 ± 0.526
6.079ThrGly: 6.079 ± 0.705
0.901ThrHis: 0.901 ± 0.301
2.402ThrIle: 2.402 ± 0.415
1.876ThrLys: 1.876 ± 0.356
5.779ThrLeu: 5.779 ± 0.697
1.351ThrMet: 1.351 ± 0.337
1.501ThrAsn: 1.501 ± 0.317
3.602ThrPro: 3.602 ± 0.48
2.402ThrGln: 2.402 ± 0.421
3.902ThrArg: 3.902 ± 0.494
3.977ThrSer: 3.977 ± 0.544
4.203ThrThr: 4.203 ± 0.572
5.253ThrVal: 5.253 ± 0.701
1.426ThrTrp: 1.426 ± 0.281
2.326ThrTyr: 2.326 ± 0.571
0.0ThrXaa: 0.0 ± 0.0
Val
7.655ValAla: 7.655 ± 0.726
0.3ValCys: 0.3 ± 0.138
4.653ValAsp: 4.653 ± 0.69
4.728ValGlu: 4.728 ± 0.622
1.801ValPhe: 1.801 ± 0.348
6.004ValGly: 6.004 ± 0.778
1.801ValHis: 1.801 ± 0.346
3.827ValIle: 3.827 ± 0.621
3.002ValLys: 3.002 ± 0.544
5.929ValLeu: 5.929 ± 0.553
1.576ValMet: 1.576 ± 0.463
2.251ValAsn: 2.251 ± 0.362
3.902ValPro: 3.902 ± 0.548
2.627ValGln: 2.627 ± 0.519
4.953ValArg: 4.953 ± 0.681
4.578ValSer: 4.578 ± 0.574
5.328ValThr: 5.328 ± 0.66
5.403ValVal: 5.403 ± 0.772
2.176ValTrp: 2.176 ± 0.491
2.251ValTyr: 2.251 ± 0.351
0.0ValXaa: 0.0 ± 0.0
Trp
1.126TrpAla: 1.126 ± 0.294
0.15TrpCys: 0.15 ± 0.113
2.026TrpAsp: 2.026 ± 0.424
1.051TrpGlu: 1.051 ± 0.282
0.901TrpPhe: 0.901 ± 0.236
1.576TrpGly: 1.576 ± 0.442
0.6TrpHis: 0.6 ± 0.194
1.126TrpIle: 1.126 ± 0.295
0.675TrpLys: 0.675 ± 0.204
2.326TrpLeu: 2.326 ± 0.334
0.15TrpMet: 0.15 ± 0.113
0.6TrpAsn: 0.6 ± 0.233
0.826TrpPro: 0.826 ± 0.244
0.75TrpGln: 0.75 ± 0.273
1.126TrpArg: 1.126 ± 0.302
1.426TrpSer: 1.426 ± 0.307
1.426TrpThr: 1.426 ± 0.37
1.576TrpVal: 1.576 ± 0.363
0.75TrpTrp: 0.75 ± 0.254
0.901TrpTyr: 0.901 ± 0.256
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.101TyrAla: 2.101 ± 0.373
0.45TyrCys: 0.45 ± 0.155
2.402TyrAsp: 2.402 ± 0.462
1.951TyrGlu: 1.951 ± 0.388
0.75TyrPhe: 0.75 ± 0.244
3.152TyrGly: 3.152 ± 0.461
0.6TyrHis: 0.6 ± 0.193
1.651TyrIle: 1.651 ± 0.341
1.351TyrLys: 1.351 ± 0.321
2.026TyrLeu: 2.026 ± 0.334
0.45TyrMet: 0.45 ± 0.217
1.201TyrAsn: 1.201 ± 0.371
1.576TyrPro: 1.576 ± 0.437
0.826TyrGln: 0.826 ± 0.197
1.501TyrArg: 1.501 ± 0.414
2.101TyrSer: 2.101 ± 0.361
1.801TyrThr: 1.801 ± 0.413
2.702TyrVal: 2.702 ± 0.649
0.675TyrTrp: 0.675 ± 0.262
1.201TyrTyr: 1.201 ± 0.341
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13326 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski