Amino acid dipepetide frequency for Microbacterium phage Nebulous

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.648AlaAla: 11.648 ± 1.192
0.597AlaCys: 0.597 ± 0.228
5.675AlaAsp: 5.675 ± 0.715
5.525AlaGlu: 5.525 ± 0.806
2.987AlaPhe: 2.987 ± 0.412
8.885AlaGly: 8.885 ± 0.965
1.643AlaHis: 1.643 ± 0.457
5.525AlaIle: 5.525 ± 0.876
5.749AlaLys: 5.749 ± 0.602
9.931AlaLeu: 9.931 ± 1.084
2.539AlaMet: 2.539 ± 0.455
3.36AlaAsn: 3.36 ± 0.647
4.779AlaPro: 4.779 ± 0.745
5.227AlaGln: 5.227 ± 0.69
6.421AlaArg: 6.421 ± 0.859
5.749AlaSer: 5.749 ± 0.632
7.392AlaThr: 7.392 ± 0.828
6.944AlaVal: 6.944 ± 0.812
1.867AlaTrp: 1.867 ± 0.427
2.763AlaTyr: 2.763 ± 0.447
0.0AlaXaa: 0.0 ± 0.0
Cys
0.523CysAla: 0.523 ± 0.179
0.075CysCys: 0.075 ± 0.076
0.672CysAsp: 0.672 ± 0.264
0.224CysGlu: 0.224 ± 0.136
0.149CysPhe: 0.149 ± 0.12
0.597CysGly: 0.597 ± 0.189
0.149CysHis: 0.149 ± 0.114
0.149CysIle: 0.149 ± 0.104
0.448CysLys: 0.448 ± 0.158
0.299CysLeu: 0.299 ± 0.138
0.0CysMet: 0.0 ± 0.0
0.075CysAsn: 0.075 ± 0.075
0.672CysPro: 0.672 ± 0.225
0.373CysGln: 0.373 ± 0.208
0.672CysArg: 0.672 ± 0.226
0.075CysSer: 0.075 ± 0.075
0.448CysThr: 0.448 ± 0.182
0.373CysVal: 0.373 ± 0.142
0.075CysTrp: 0.075 ± 0.071
0.224CysTyr: 0.224 ± 0.13
0.0CysXaa: 0.0 ± 0.0
Asp
6.421AspAla: 6.421 ± 0.805
0.448AspCys: 0.448 ± 0.233
5.376AspAsp: 5.376 ± 0.981
5.227AspGlu: 5.227 ± 1.497
2.464AspPhe: 2.464 ± 0.405
5.525AspGly: 5.525 ± 0.813
1.195AspHis: 1.195 ± 0.317
2.763AspIle: 2.763 ± 0.446
2.315AspLys: 2.315 ± 0.384
5.675AspLeu: 5.675 ± 0.642
1.493AspMet: 1.493 ± 0.282
1.493AspAsn: 1.493 ± 0.358
3.285AspPro: 3.285 ± 0.493
2.613AspGln: 2.613 ± 0.404
4.555AspArg: 4.555 ± 0.587
2.613AspSer: 2.613 ± 0.45
3.883AspThr: 3.883 ± 0.552
3.808AspVal: 3.808 ± 0.575
1.419AspTrp: 1.419 ± 0.338
2.24AspTyr: 2.24 ± 0.391
0.0AspXaa: 0.0 ± 0.0
Glu
7.019GluAla: 7.019 ± 0.957
0.373GluCys: 0.373 ± 0.163
5.376GluAsp: 5.376 ± 1.313
5.301GluGlu: 5.301 ± 1.078
1.792GluPhe: 1.792 ± 0.353
4.256GluGly: 4.256 ± 0.631
1.045GluHis: 1.045 ± 0.227
1.941GluIle: 1.941 ± 0.4
1.941GluLys: 1.941 ± 0.331
6.496GluLeu: 6.496 ± 0.629
1.419GluMet: 1.419 ± 0.296
1.643GluAsn: 1.643 ± 0.317
2.315GluPro: 2.315 ± 0.404
2.688GluGln: 2.688 ± 0.49
3.584GluArg: 3.584 ± 0.556
2.165GluSer: 2.165 ± 0.371
3.285GluThr: 3.285 ± 0.474
5.152GluVal: 5.152 ± 0.696
1.045GluTrp: 1.045 ± 0.31
1.867GluTyr: 1.867 ± 0.308
0.0GluXaa: 0.0 ± 0.0
Phe
2.389PheAla: 2.389 ± 0.349
0.448PheCys: 0.448 ± 0.201
1.568PheAsp: 1.568 ± 0.337
1.568PheGlu: 1.568 ± 0.365
0.373PhePhe: 0.373 ± 0.152
2.837PheGly: 2.837 ± 0.586
0.672PheHis: 0.672 ± 0.268
1.344PheIle: 1.344 ± 0.27
1.344PheLys: 1.344 ± 0.267
2.315PheLeu: 2.315 ± 0.286
0.971PheMet: 0.971 ± 0.238
1.195PheAsn: 1.195 ± 0.374
1.717PhePro: 1.717 ± 0.344
1.269PheGln: 1.269 ± 0.346
2.091PheArg: 2.091 ± 0.36
2.165PheSer: 2.165 ± 0.405
2.464PheThr: 2.464 ± 0.468
1.867PheVal: 1.867 ± 0.316
0.523PheTrp: 0.523 ± 0.199
0.448PheTyr: 0.448 ± 0.196
0.0PheXaa: 0.0 ± 0.0
Gly
6.944GlyAla: 6.944 ± 1.358
0.523GlyCys: 0.523 ± 0.21
4.405GlyAsp: 4.405 ± 0.473
3.957GlyGlu: 3.957 ± 0.527
2.688GlyPhe: 2.688 ± 0.382
6.421GlyGly: 6.421 ± 1.136
1.493GlyHis: 1.493 ± 0.34
4.331GlyIle: 4.331 ± 0.81
4.928GlyLys: 4.928 ± 0.691
5.675GlyLeu: 5.675 ± 0.745
2.24GlyMet: 2.24 ± 0.405
2.315GlyAsn: 2.315 ± 0.49
4.181GlyPro: 4.181 ± 0.923
4.331GlyGln: 4.331 ± 0.67
4.853GlyArg: 4.853 ± 0.624
4.779GlySer: 4.779 ± 0.758
6.421GlyThr: 6.421 ± 0.608
5.227GlyVal: 5.227 ± 0.78
1.12GlyTrp: 1.12 ± 0.289
3.211GlyTyr: 3.211 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
1.269HisAla: 1.269 ± 0.407
0.075HisCys: 0.075 ± 0.085
1.12HisAsp: 1.12 ± 0.357
1.419HisGlu: 1.419 ± 0.29
0.821HisPhe: 0.821 ± 0.306
1.643HisGly: 1.643 ± 0.353
0.299HisHis: 0.299 ± 0.215
0.821HisIle: 0.821 ± 0.277
1.195HisLys: 1.195 ± 0.27
1.344HisLeu: 1.344 ± 0.456
0.299HisMet: 0.299 ± 0.145
0.373HisAsn: 0.373 ± 0.182
1.195HisPro: 1.195 ± 0.32
0.672HisGln: 0.672 ± 0.26
0.821HisArg: 0.821 ± 0.251
0.597HisSer: 0.597 ± 0.179
0.896HisThr: 0.896 ± 0.276
1.344HisVal: 1.344 ± 0.276
0.299HisTrp: 0.299 ± 0.145
1.045HisTyr: 1.045 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
5.6IleAla: 5.6 ± 0.518
0.149IleCys: 0.149 ± 0.112
2.763IleAsp: 2.763 ± 0.417
2.464IleGlu: 2.464 ± 0.423
1.12IlePhe: 1.12 ± 0.328
3.733IleGly: 3.733 ± 0.765
0.747IleHis: 0.747 ± 0.193
3.136IleIle: 3.136 ± 0.643
1.941IleLys: 1.941 ± 0.422
3.509IleLeu: 3.509 ± 0.4
0.821IleMet: 0.821 ± 0.226
2.016IleAsn: 2.016 ± 0.345
2.987IlePro: 2.987 ± 0.584
2.987IleGln: 2.987 ± 0.844
2.763IleArg: 2.763 ± 0.504
2.464IleSer: 2.464 ± 0.492
4.405IleThr: 4.405 ± 0.793
3.285IleVal: 3.285 ± 0.698
0.672IleTrp: 0.672 ± 0.213
1.269IleTyr: 1.269 ± 0.275
0.0IleXaa: 0.0 ± 0.0
Lys
5.899LysAla: 5.899 ± 0.826
0.373LysCys: 0.373 ± 0.155
2.912LysAsp: 2.912 ± 0.48
2.539LysGlu: 2.539 ± 0.493
1.12LysPhe: 1.12 ± 0.269
3.659LysGly: 3.659 ± 0.468
0.971LysHis: 0.971 ± 0.269
1.493LysIle: 1.493 ± 0.315
1.867LysLys: 1.867 ± 0.498
3.957LysLeu: 3.957 ± 0.462
1.12LysMet: 1.12 ± 0.277
1.493LysAsn: 1.493 ± 0.334
4.032LysPro: 4.032 ± 0.672
1.867LysGln: 1.867 ± 0.427
3.061LysArg: 3.061 ± 0.667
1.792LysSer: 1.792 ± 0.333
3.36LysThr: 3.36 ± 0.547
4.405LysVal: 4.405 ± 0.583
1.045LysTrp: 1.045 ± 0.281
0.747LysTyr: 0.747 ± 0.239
0.0LysXaa: 0.0 ± 0.0
Leu
10.528LeuAla: 10.528 ± 0.711
0.448LeuCys: 0.448 ± 0.161
5.749LeuAsp: 5.749 ± 0.695
4.331LeuGlu: 4.331 ± 0.607
2.389LeuPhe: 2.389 ± 0.47
6.123LeuGly: 6.123 ± 0.72
1.493LeuHis: 1.493 ± 0.326
5.077LeuIle: 5.077 ± 1.11
4.555LeuLys: 4.555 ± 0.564
6.645LeuLeu: 6.645 ± 0.555
1.195LeuMet: 1.195 ± 0.317
3.808LeuAsn: 3.808 ± 0.476
5.077LeuPro: 5.077 ± 0.73
2.464LeuGln: 2.464 ± 0.396
4.928LeuArg: 4.928 ± 0.763
3.733LeuSer: 3.733 ± 0.474
5.824LeuThr: 5.824 ± 0.628
5.749LeuVal: 5.749 ± 0.761
1.344LeuTrp: 1.344 ± 0.267
1.941LeuTyr: 1.941 ± 0.326
0.0LeuXaa: 0.0 ± 0.0
Met
2.389MetAla: 2.389 ± 0.355
0.299MetCys: 0.299 ± 0.132
1.643MetAsp: 1.643 ± 0.349
1.568MetGlu: 1.568 ± 0.366
0.821MetPhe: 0.821 ± 0.267
2.016MetGly: 2.016 ± 0.401
0.224MetHis: 0.224 ± 0.139
0.896MetIle: 0.896 ± 0.319
1.12MetLys: 1.12 ± 0.3
2.016MetLeu: 2.016 ± 0.451
0.747MetMet: 0.747 ± 0.223
0.821MetAsn: 0.821 ± 0.239
1.493MetPro: 1.493 ± 0.338
0.597MetGln: 0.597 ± 0.233
0.672MetArg: 0.672 ± 0.203
1.717MetSer: 1.717 ± 0.342
1.867MetThr: 1.867 ± 0.342
1.717MetVal: 1.717 ± 0.313
0.373MetTrp: 0.373 ± 0.161
0.523MetTyr: 0.523 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
3.435AsnAla: 3.435 ± 0.565
0.075AsnCys: 0.075 ± 0.092
2.091AsnAsp: 2.091 ± 0.394
1.717AsnGlu: 1.717 ± 0.441
1.344AsnPhe: 1.344 ± 0.32
3.211AsnGly: 3.211 ± 0.662
0.672AsnHis: 0.672 ± 0.21
1.419AsnIle: 1.419 ± 0.292
1.344AsnLys: 1.344 ± 0.314
2.315AsnLeu: 2.315 ± 0.371
0.672AsnMet: 0.672 ± 0.21
1.269AsnAsn: 1.269 ± 0.47
1.941AsnPro: 1.941 ± 0.437
1.195AsnGln: 1.195 ± 0.267
1.493AsnArg: 1.493 ± 0.452
2.464AsnSer: 2.464 ± 0.462
2.464AsnThr: 2.464 ± 0.55
2.165AsnVal: 2.165 ± 0.465
0.747AsnTrp: 0.747 ± 0.222
1.568AsnTyr: 1.568 ± 0.363
0.0AsnXaa: 0.0 ± 0.0
Pro
6.496ProAla: 6.496 ± 0.759
0.0ProCys: 0.0 ± 0.0
3.957ProAsp: 3.957 ± 0.654
3.285ProGlu: 3.285 ± 0.563
1.269ProPhe: 1.269 ± 0.311
5.152ProGly: 5.152 ± 0.836
0.971ProHis: 0.971 ± 0.263
2.091ProIle: 2.091 ± 0.36
3.136ProLys: 3.136 ± 0.466
3.36ProLeu: 3.36 ± 0.463
1.12ProMet: 1.12 ± 0.328
1.717ProAsn: 1.717 ± 0.354
1.643ProPro: 1.643 ± 0.373
2.539ProGln: 2.539 ± 0.658
2.464ProArg: 2.464 ± 0.424
3.285ProSer: 3.285 ± 0.495
4.181ProThr: 4.181 ± 0.406
4.704ProVal: 4.704 ± 0.574
0.821ProTrp: 0.821 ± 0.206
1.419ProTyr: 1.419 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
5.227GlnAla: 5.227 ± 0.56
0.224GlnCys: 0.224 ± 0.113
2.389GlnAsp: 2.389 ± 0.363
3.136GlnGlu: 3.136 ± 0.559
1.045GlnPhe: 1.045 ± 0.38
3.285GlnGly: 3.285 ± 0.494
0.672GlnHis: 0.672 ± 0.277
2.165GlnIle: 2.165 ± 0.625
1.792GlnLys: 1.792 ± 0.407
3.957GlnLeu: 3.957 ± 0.606
0.971GlnMet: 0.971 ± 0.256
1.419GlnAsn: 1.419 ± 0.396
2.389GlnPro: 2.389 ± 0.354
2.24GlnGln: 2.24 ± 0.609
2.389GlnArg: 2.389 ± 0.434
2.016GlnSer: 2.016 ± 0.48
2.016GlnThr: 2.016 ± 0.37
3.509GlnVal: 3.509 ± 0.474
1.12GlnTrp: 1.12 ± 0.286
0.971GlnTyr: 0.971 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
5.003ArgAla: 5.003 ± 0.643
0.672ArgCys: 0.672 ± 0.244
3.285ArgAsp: 3.285 ± 0.57
3.509ArgGlu: 3.509 ± 0.538
1.941ArgPhe: 1.941 ± 0.391
4.032ArgGly: 4.032 ± 0.673
0.821ArgHis: 0.821 ± 0.287
2.688ArgIle: 2.688 ± 0.43
3.061ArgLys: 3.061 ± 0.529
5.376ArgLeu: 5.376 ± 0.772
2.24ArgMet: 2.24 ± 0.339
1.867ArgAsn: 1.867 ± 0.46
3.061ArgPro: 3.061 ± 0.603
2.24ArgGln: 2.24 ± 0.41
4.032ArgArg: 4.032 ± 0.792
3.136ArgSer: 3.136 ± 0.534
3.808ArgThr: 3.808 ± 0.565
5.376ArgVal: 5.376 ± 0.596
1.493ArgTrp: 1.493 ± 0.379
0.821ArgTyr: 0.821 ± 0.251
0.0ArgXaa: 0.0 ± 0.0
Ser
6.496SerAla: 6.496 ± 0.848
0.299SerCys: 0.299 ± 0.144
3.36SerAsp: 3.36 ± 0.491
2.389SerGlu: 2.389 ± 0.429
1.269SerPhe: 1.269 ± 0.278
3.36SerGly: 3.36 ± 0.517
0.896SerHis: 0.896 ± 0.28
2.837SerIle: 2.837 ± 0.392
2.315SerLys: 2.315 ± 0.372
4.779SerLeu: 4.779 ± 0.822
1.643SerMet: 1.643 ± 0.346
1.643SerAsn: 1.643 ± 0.499
2.613SerPro: 2.613 ± 0.369
2.389SerGln: 2.389 ± 0.418
2.24SerArg: 2.24 ± 0.429
3.136SerSer: 3.136 ± 0.523
3.584SerThr: 3.584 ± 0.48
3.808SerVal: 3.808 ± 0.633
1.12SerTrp: 1.12 ± 0.355
2.389SerTyr: 2.389 ± 0.47
0.0SerXaa: 0.0 ± 0.0
Thr
6.197ThrAla: 6.197 ± 0.832
0.224ThrCys: 0.224 ± 0.111
3.061ThrAsp: 3.061 ± 0.484
4.181ThrGlu: 4.181 ± 0.604
2.837ThrPhe: 2.837 ± 0.722
6.123ThrGly: 6.123 ± 0.56
1.344ThrHis: 1.344 ± 0.356
3.061ThrIle: 3.061 ± 0.693
2.763ThrLys: 2.763 ± 0.489
5.899ThrLeu: 5.899 ± 0.55
1.493ThrMet: 1.493 ± 0.357
2.165ThrAsn: 2.165 ± 0.383
3.509ThrPro: 3.509 ± 0.507
2.389ThrGln: 2.389 ± 0.42
4.853ThrArg: 4.853 ± 0.66
4.555ThrSer: 4.555 ± 0.613
4.555ThrThr: 4.555 ± 1.065
5.899ThrVal: 5.899 ± 0.891
1.792ThrTrp: 1.792 ± 0.438
1.643ThrTyr: 1.643 ± 0.36
0.0ThrXaa: 0.0 ± 0.0
Val
7.915ValAla: 7.915 ± 0.58
0.373ValCys: 0.373 ± 0.148
5.301ValAsp: 5.301 ± 0.657
5.227ValGlu: 5.227 ± 0.736
1.792ValPhe: 1.792 ± 0.434
5.749ValGly: 5.749 ± 0.708
1.269ValHis: 1.269 ± 0.283
4.629ValIle: 4.629 ± 0.583
3.808ValLys: 3.808 ± 0.526
5.824ValLeu: 5.824 ± 0.707
1.344ValMet: 1.344 ± 0.368
2.613ValAsn: 2.613 ± 0.577
3.883ValPro: 3.883 ± 0.594
2.987ValGln: 2.987 ± 0.401
3.808ValArg: 3.808 ± 0.546
3.584ValSer: 3.584 ± 0.566
4.629ValThr: 4.629 ± 0.62
4.779ValVal: 4.779 ± 0.669
2.016ValTrp: 2.016 ± 0.419
2.389ValTyr: 2.389 ± 0.413
0.0ValXaa: 0.0 ± 0.0
Trp
1.493TrpAla: 1.493 ± 0.27
0.224TrpCys: 0.224 ± 0.126
2.016TrpAsp: 2.016 ± 0.335
1.195TrpGlu: 1.195 ± 0.316
0.597TrpPhe: 0.597 ± 0.195
0.747TrpGly: 0.747 ± 0.307
0.523TrpHis: 0.523 ± 0.191
1.269TrpIle: 1.269 ± 0.333
0.821TrpLys: 0.821 ± 0.281
1.643TrpLeu: 1.643 ± 0.317
0.299TrpMet: 0.299 ± 0.147
0.971TrpAsn: 0.971 ± 0.223
1.12TrpPro: 1.12 ± 0.284
0.971TrpGln: 0.971 ± 0.255
0.672TrpArg: 0.672 ± 0.213
0.971TrpSer: 0.971 ± 0.329
1.792TrpThr: 1.792 ± 0.469
1.568TrpVal: 1.568 ± 0.334
0.523TrpTrp: 0.523 ± 0.242
0.448TrpTyr: 0.448 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.315TyrAla: 2.315 ± 0.434
0.373TyrCys: 0.373 ± 0.155
2.091TyrAsp: 2.091 ± 0.499
1.941TyrGlu: 1.941 ± 0.454
0.747TyrPhe: 0.747 ± 0.206
2.688TyrGly: 2.688 ± 0.397
0.448TyrHis: 0.448 ± 0.16
1.195TyrIle: 1.195 ± 0.286
1.269TyrLys: 1.269 ± 0.302
2.389TyrLeu: 2.389 ± 0.379
0.821TyrMet: 0.821 ± 0.205
1.344TyrAsn: 1.344 ± 0.352
1.717TyrPro: 1.717 ± 0.415
0.896TyrGln: 0.896 ± 0.263
2.091TyrArg: 2.091 ± 0.347
1.568TyrSer: 1.568 ± 0.413
1.344TyrThr: 1.344 ± 0.398
2.24TyrVal: 2.24 ± 0.399
0.448TyrTrp: 0.448 ± 0.16
0.971TyrTyr: 0.971 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13394 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski