Amino acid dipepetide frequency for Staphylococcus virus phiSLT

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.669AlaAla: 2.669 ± 0.891
0.314AlaCys: 0.314 ± 0.173
2.512AlaAsp: 2.512 ± 0.562
4.239AlaGlu: 4.239 ± 0.483
1.334AlaPhe: 1.334 ± 0.268
3.611AlaGly: 3.611 ± 0.708
1.177AlaHis: 1.177 ± 0.272
3.925AlaIle: 3.925 ± 0.864
6.436AlaLys: 6.436 ± 1.24
5.338AlaLeu: 5.338 ± 0.806
0.942AlaMet: 0.942 ± 0.233
4.788AlaAsn: 4.788 ± 0.826
1.413AlaPro: 1.413 ± 0.284
1.491AlaGln: 1.491 ± 0.383
2.512AlaArg: 2.512 ± 0.43
4.082AlaSer: 4.082 ± 0.756
3.925AlaThr: 3.925 ± 0.588
3.061AlaVal: 3.061 ± 0.507
0.863AlaTrp: 0.863 ± 0.376
2.119AlaTyr: 2.119 ± 0.395
0.0AlaXaa: 0.0 ± 0.0
Cys
0.078CysAla: 0.078 ± 0.088
0.0CysCys: 0.0 ± 0.0
0.157CysAsp: 0.157 ± 0.129
0.628CysGlu: 0.628 ± 0.263
0.471CysPhe: 0.471 ± 0.224
0.235CysGly: 0.235 ± 0.129
0.157CysHis: 0.157 ± 0.108
0.235CysIle: 0.235 ± 0.143
0.471CysLys: 0.471 ± 0.245
0.235CysLeu: 0.235 ± 0.159
0.078CysMet: 0.078 ± 0.089
0.314CysAsn: 0.314 ± 0.155
0.078CysPro: 0.078 ± 0.082
0.078CysGln: 0.078 ± 0.077
0.314CysArg: 0.314 ± 0.159
0.0CysSer: 0.0 ± 0.0
0.235CysThr: 0.235 ± 0.158
0.235CysVal: 0.235 ± 0.148
0.0CysTrp: 0.0 ± 0.0
0.471CysTyr: 0.471 ± 0.199
0.0CysXaa: 0.0 ± 0.0
Asp
3.14AspAla: 3.14 ± 0.595
0.235AspCys: 0.235 ± 0.146
3.768AspAsp: 3.768 ± 0.707
4.474AspGlu: 4.474 ± 0.874
3.14AspPhe: 3.14 ± 0.6
3.375AspGly: 3.375 ± 0.68
0.628AspHis: 0.628 ± 0.167
5.181AspIle: 5.181 ± 0.581
6.593AspLys: 6.593 ± 0.9
5.416AspLeu: 5.416 ± 0.542
1.884AspMet: 1.884 ± 0.405
2.983AspAsn: 2.983 ± 0.406
0.863AspPro: 0.863 ± 0.21
1.256AspGln: 1.256 ± 0.303
2.276AspArg: 2.276 ± 0.47
4.16AspSer: 4.16 ± 0.761
3.297AspThr: 3.297 ± 0.472
3.454AspVal: 3.454 ± 0.484
0.863AspTrp: 0.863 ± 0.247
2.983AspTyr: 2.983 ± 0.605
0.0AspXaa: 0.0 ± 0.0
Glu
3.925GluAla: 3.925 ± 0.604
0.471GluCys: 0.471 ± 0.214
4.16GluAsp: 4.16 ± 0.632
5.73GluGlu: 5.73 ± 0.979
2.983GluPhe: 2.983 ± 0.539
3.218GluGly: 3.218 ± 0.478
0.863GluHis: 0.863 ± 0.275
5.495GluIle: 5.495 ± 1.01
9.027GluLys: 9.027 ± 1.005
6.436GluLeu: 6.436 ± 0.715
2.355GluMet: 2.355 ± 0.467
5.416GluAsn: 5.416 ± 0.67
1.256GluPro: 1.256 ± 0.379
3.532GluGln: 3.532 ± 0.591
3.14GluArg: 3.14 ± 0.557
3.611GluSer: 3.611 ± 0.602
4.239GluThr: 4.239 ± 0.723
3.454GluVal: 3.454 ± 0.498
1.02GluTrp: 1.02 ± 0.224
3.454GluTyr: 3.454 ± 0.673
0.0GluXaa: 0.0 ± 0.0
Phe
1.727PheAla: 1.727 ± 0.381
0.314PheCys: 0.314 ± 0.18
2.826PheAsp: 2.826 ± 0.604
3.218PheGlu: 3.218 ± 0.619
0.863PhePhe: 0.863 ± 0.255
3.061PheGly: 3.061 ± 0.632
0.549PheHis: 0.549 ± 0.201
3.611PheIle: 3.611 ± 0.825
4.16PheLys: 4.16 ± 0.484
2.119PheLeu: 2.119 ± 0.417
0.863PheMet: 0.863 ± 0.233
4.003PheAsn: 4.003 ± 0.5
0.706PhePro: 0.706 ± 0.348
1.099PheGln: 1.099 ± 0.279
1.413PheArg: 1.413 ± 0.352
2.119PheSer: 2.119 ± 0.429
2.119PheThr: 2.119 ± 0.501
2.276PheVal: 2.276 ± 0.463
0.235PheTrp: 0.235 ± 0.147
1.491PheTyr: 1.491 ± 0.409
0.0PheXaa: 0.0 ± 0.0
Gly
3.689GlyAla: 3.689 ± 1.088
0.314GlyCys: 0.314 ± 0.158
3.689GlyAsp: 3.689 ± 0.522
3.689GlyGlu: 3.689 ± 0.498
2.669GlyPhe: 2.669 ± 0.365
4.71GlyGly: 4.71 ± 1.294
1.648GlyHis: 1.648 ± 0.415
3.768GlyIle: 3.768 ± 0.537
5.338GlyLys: 5.338 ± 0.616
5.338GlyLeu: 5.338 ± 0.998
1.413GlyMet: 1.413 ± 0.425
3.611GlyAsn: 3.611 ± 0.691
1.099GlyPro: 1.099 ± 0.269
1.962GlyGln: 1.962 ± 0.532
2.041GlyArg: 2.041 ± 0.434
4.631GlySer: 4.631 ± 0.594
3.689GlyThr: 3.689 ± 0.661
4.71GlyVal: 4.71 ± 0.69
1.177GlyTrp: 1.177 ± 0.291
3.218GlyTyr: 3.218 ± 0.623
0.0GlyXaa: 0.0 ± 0.0
His
1.177HisAla: 1.177 ± 0.332
0.0HisCys: 0.0 ± 0.0
0.628HisAsp: 0.628 ± 0.185
1.099HisGlu: 1.099 ± 0.267
0.628HisPhe: 0.628 ± 0.205
1.334HisGly: 1.334 ± 0.386
0.392HisHis: 0.392 ± 0.15
1.491HisIle: 1.491 ± 0.433
1.334HisLys: 1.334 ± 0.319
1.57HisLeu: 1.57 ± 0.282
0.314HisMet: 0.314 ± 0.162
1.413HisAsn: 1.413 ± 0.317
0.785HisPro: 0.785 ± 0.181
0.549HisGln: 0.549 ± 0.191
0.785HisArg: 0.785 ± 0.198
0.785HisSer: 0.785 ± 0.171
1.57HisThr: 1.57 ± 0.376
0.942HisVal: 0.942 ± 0.312
0.235HisTrp: 0.235 ± 0.141
1.02HisTyr: 1.02 ± 0.3
0.0HisXaa: 0.0 ± 0.0
Ile
4.16IleAla: 4.16 ± 0.695
0.235IleCys: 0.235 ± 0.187
4.396IleAsp: 4.396 ± 0.79
5.573IleGlu: 5.573 ± 0.694
2.59IlePhe: 2.59 ± 0.656
4.16IleGly: 4.16 ± 0.585
1.727IleHis: 1.727 ± 0.377
4.003IleIle: 4.003 ± 0.79
8.242IleLys: 8.242 ± 0.93
4.867IleLeu: 4.867 ± 0.829
1.727IleMet: 1.727 ± 0.404
5.102IleAsn: 5.102 ± 0.566
1.962IlePro: 1.962 ± 0.273
2.276IleGln: 2.276 ± 0.378
3.218IleArg: 3.218 ± 0.401
4.631IleSer: 4.631 ± 0.612
4.788IleThr: 4.788 ± 0.628
4.16IleVal: 4.16 ± 0.822
0.471IleTrp: 0.471 ± 0.197
2.669IleTyr: 2.669 ± 0.463
0.0IleXaa: 0.0 ± 0.0
Lys
7.614LysAla: 7.614 ± 1.399
0.235LysCys: 0.235 ± 0.183
5.73LysAsp: 5.73 ± 0.757
7.849LysGlu: 7.849 ± 0.678
2.512LysPhe: 2.512 ± 0.447
6.044LysGly: 6.044 ± 0.963
1.648LysHis: 1.648 ± 0.39
5.965LysIle: 5.965 ± 0.866
7.849LysLys: 7.849 ± 1.174
8.085LysLeu: 8.085 ± 0.996
3.689LysMet: 3.689 ± 0.588
6.593LysAsn: 6.593 ± 0.629
2.669LysPro: 2.669 ± 0.428
5.102LysGln: 5.102 ± 0.6
4.082LysArg: 4.082 ± 0.737
6.593LysSer: 6.593 ± 1.415
5.651LysThr: 5.651 ± 0.804
5.181LysVal: 5.181 ± 0.802
1.57LysTrp: 1.57 ± 0.377
4.631LysTyr: 4.631 ± 0.631
0.0LysXaa: 0.0 ± 0.0
Leu
4.474LeuAla: 4.474 ± 0.82
0.392LeuCys: 0.392 ± 0.194
4.945LeuAsp: 4.945 ± 1.072
6.358LeuGlu: 6.358 ± 0.697
3.532LeuPhe: 3.532 ± 0.586
4.553LeuGly: 4.553 ± 0.909
1.256LeuHis: 1.256 ± 0.372
5.338LeuIle: 5.338 ± 0.721
8.085LeuLys: 8.085 ± 1.423
6.75LeuLeu: 6.75 ± 0.865
1.648LeuMet: 1.648 ± 0.326
5.73LeuAsn: 5.73 ± 0.666
2.59LeuPro: 2.59 ± 0.525
2.59LeuGln: 2.59 ± 0.445
2.983LeuArg: 2.983 ± 0.483
5.338LeuSer: 5.338 ± 0.794
5.181LeuThr: 5.181 ± 0.784
3.689LeuVal: 3.689 ± 0.43
0.235LeuTrp: 0.235 ± 0.14
3.532LeuTyr: 3.532 ± 0.863
0.0LeuXaa: 0.0 ± 0.0
Met
1.099MetAla: 1.099 ± 0.227
0.235MetCys: 0.235 ± 0.153
1.491MetAsp: 1.491 ± 0.335
1.02MetGlu: 1.02 ± 0.352
0.863MetPhe: 0.863 ± 0.292
1.57MetGly: 1.57 ± 0.527
0.471MetHis: 0.471 ± 0.206
1.177MetIle: 1.177 ± 0.26
2.826MetLys: 2.826 ± 0.505
1.805MetLeu: 1.805 ± 0.469
0.392MetMet: 0.392 ± 0.222
2.276MetAsn: 2.276 ± 0.454
0.863MetPro: 0.863 ± 0.236
1.805MetGln: 1.805 ± 0.548
0.785MetArg: 0.785 ± 0.206
1.884MetSer: 1.884 ± 0.347
2.198MetThr: 2.198 ± 0.435
1.099MetVal: 1.099 ± 0.296
0.314MetTrp: 0.314 ± 0.141
0.863MetTyr: 0.863 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
3.846AsnAla: 3.846 ± 0.55
0.078AsnCys: 0.078 ± 0.078
4.631AsnAsp: 4.631 ± 0.613
5.024AsnGlu: 5.024 ± 0.747
2.119AsnPhe: 2.119 ± 0.529
5.259AsnGly: 5.259 ± 0.65
0.785AsnHis: 0.785 ± 0.256
5.024AsnIle: 5.024 ± 0.485
6.75AsnLys: 6.75 ± 0.869
4.788AsnLeu: 4.788 ± 0.52
1.334AsnMet: 1.334 ± 0.304
5.965AsnAsn: 5.965 ± 1.075
2.512AsnPro: 2.512 ± 0.404
3.454AsnGln: 3.454 ± 0.538
2.59AsnArg: 2.59 ± 0.497
4.867AsnSer: 4.867 ± 0.61
4.396AsnThr: 4.396 ± 0.531
3.375AsnVal: 3.375 ± 0.6
1.334AsnTrp: 1.334 ± 0.367
3.218AsnTyr: 3.218 ± 0.567
0.0AsnXaa: 0.0 ± 0.0
Pro
0.942ProAla: 0.942 ± 0.269
0.235ProCys: 0.235 ± 0.156
1.177ProAsp: 1.177 ± 0.32
2.198ProGlu: 2.198 ± 0.487
1.413ProPhe: 1.413 ± 0.403
1.491ProGly: 1.491 ± 0.38
0.235ProHis: 0.235 ± 0.129
1.884ProIle: 1.884 ± 0.388
2.433ProLys: 2.433 ± 0.497
2.198ProLeu: 2.198 ± 0.568
0.863ProMet: 0.863 ± 0.304
2.041ProAsn: 2.041 ± 0.433
0.471ProPro: 0.471 ± 0.19
1.099ProGln: 1.099 ± 0.222
1.099ProArg: 1.099 ± 0.253
2.198ProSer: 2.198 ± 0.334
1.413ProThr: 1.413 ± 0.359
1.02ProVal: 1.02 ± 0.333
0.235ProTrp: 0.235 ± 0.141
1.256ProTyr: 1.256 ± 0.348
0.0ProXaa: 0.0 ± 0.0
Gln
2.826GlnAla: 2.826 ± 0.409
0.157GlnCys: 0.157 ± 0.117
2.276GlnAsp: 2.276 ± 0.476
2.669GlnGlu: 2.669 ± 0.42
1.491GlnPhe: 1.491 ± 0.267
2.041GlnGly: 2.041 ± 0.401
0.942GlnHis: 0.942 ± 0.216
3.297GlnIle: 3.297 ± 0.638
3.218GlnLys: 3.218 ± 0.472
3.061GlnLeu: 3.061 ± 0.567
0.785GlnMet: 0.785 ± 0.236
2.669GlnAsn: 2.669 ± 0.588
1.256GlnPro: 1.256 ± 0.331
1.727GlnGln: 1.727 ± 0.465
2.276GlnArg: 2.276 ± 0.451
2.826GlnSer: 2.826 ± 0.455
1.491GlnThr: 1.491 ± 0.415
2.041GlnVal: 2.041 ± 0.384
0.157GlnTrp: 0.157 ± 0.108
1.648GlnTyr: 1.648 ± 0.325
0.0GlnXaa: 0.0 ± 0.0
Arg
2.041ArgAla: 2.041 ± 0.559
0.235ArgCys: 0.235 ± 0.13
2.59ArgAsp: 2.59 ± 0.374
2.747ArgGlu: 2.747 ± 0.465
1.727ArgPhe: 1.727 ± 0.328
2.119ArgGly: 2.119 ± 0.389
1.02ArgHis: 1.02 ± 0.304
3.768ArgIle: 3.768 ± 0.582
4.082ArgLys: 4.082 ± 0.519
2.826ArgLeu: 2.826 ± 0.534
1.02ArgMet: 1.02 ± 0.25
2.433ArgAsn: 2.433 ± 0.506
1.02ArgPro: 1.02 ± 0.425
1.256ArgGln: 1.256 ± 0.305
1.57ArgArg: 1.57 ± 0.325
1.962ArgSer: 1.962 ± 0.393
2.198ArgThr: 2.198 ± 0.46
1.727ArgVal: 1.727 ± 0.452
0.471ArgTrp: 0.471 ± 0.169
2.355ArgTyr: 2.355 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
3.689SerAla: 3.689 ± 0.839
0.235SerCys: 0.235 ± 0.144
5.024SerAsp: 5.024 ± 0.563
4.474SerGlu: 4.474 ± 0.49
2.669SerPhe: 2.669 ± 0.594
5.259SerGly: 5.259 ± 0.806
0.863SerHis: 0.863 ± 0.226
4.003SerIle: 4.003 ± 0.561
7.064SerLys: 7.064 ± 1.203
4.082SerLeu: 4.082 ± 0.533
1.884SerMet: 1.884 ± 0.397
5.102SerAsn: 5.102 ± 0.684
1.177SerPro: 1.177 ± 0.415
3.218SerGln: 3.218 ± 0.488
2.198SerArg: 2.198 ± 0.414
4.317SerSer: 4.317 ± 0.741
2.826SerThr: 2.826 ± 0.456
3.611SerVal: 3.611 ± 0.646
0.785SerTrp: 0.785 ± 0.227
2.669SerTyr: 2.669 ± 0.508
0.0SerXaa: 0.0 ± 0.0
Thr
4.003ThrAla: 4.003 ± 0.574
0.235ThrCys: 0.235 ± 0.143
3.611ThrAsp: 3.611 ± 0.543
3.611ThrGlu: 3.611 ± 0.579
2.669ThrPhe: 2.669 ± 0.425
4.16ThrGly: 4.16 ± 0.682
1.727ThrHis: 1.727 ± 0.431
4.553ThrIle: 4.553 ± 0.745
5.024ThrLys: 5.024 ± 0.797
4.239ThrLeu: 4.239 ± 0.503
0.785ThrMet: 0.785 ± 0.237
3.846ThrAsn: 3.846 ± 0.631
2.669ThrPro: 2.669 ± 0.416
1.805ThrGln: 1.805 ± 0.371
1.727ThrArg: 1.727 ± 0.301
3.375ThrSer: 3.375 ± 0.575
3.454ThrThr: 3.454 ± 0.545
5.181ThrVal: 5.181 ± 0.697
0.392ThrTrp: 0.392 ± 0.176
2.355ThrTyr: 2.355 ± 0.514
0.0ThrXaa: 0.0 ± 0.0
Val
3.14ValAla: 3.14 ± 0.493
0.235ValCys: 0.235 ± 0.146
3.297ValAsp: 3.297 ± 0.548
5.181ValGlu: 5.181 ± 0.514
2.512ValPhe: 2.512 ± 0.523
3.061ValGly: 3.061 ± 0.582
0.785ValHis: 0.785 ± 0.2
4.16ValIle: 4.16 ± 0.603
5.259ValLys: 5.259 ± 0.716
4.553ValLeu: 4.553 ± 0.625
1.256ValMet: 1.256 ± 0.235
3.768ValAsn: 3.768 ± 0.461
1.648ValPro: 1.648 ± 0.446
2.119ValGln: 2.119 ± 0.37
1.805ValArg: 1.805 ± 0.517
3.768ValSer: 3.768 ± 0.625
3.689ValThr: 3.689 ± 0.811
3.061ValVal: 3.061 ± 0.449
0.471ValTrp: 0.471 ± 0.219
2.041ValTyr: 2.041 ± 0.425
0.0ValXaa: 0.0 ± 0.0
Trp
0.235TrpAla: 0.235 ± 0.134
0.0TrpCys: 0.0 ± 0.0
0.392TrpAsp: 0.392 ± 0.22
0.785TrpGlu: 0.785 ± 0.324
1.099TrpPhe: 1.099 ± 0.334
0.471TrpGly: 0.471 ± 0.261
0.0TrpHis: 0.0 ± 0.0
1.256TrpIle: 1.256 ± 0.377
0.706TrpLys: 0.706 ± 0.218
1.02TrpLeu: 1.02 ± 0.309
0.549TrpMet: 0.549 ± 0.22
0.863TrpAsn: 0.863 ± 0.221
0.078TrpPro: 0.078 ± 0.066
0.628TrpGln: 0.628 ± 0.201
0.471TrpArg: 0.471 ± 0.257
0.863TrpSer: 0.863 ± 0.373
0.628TrpThr: 0.628 ± 0.183
0.863TrpVal: 0.863 ± 0.257
0.157TrpTrp: 0.157 ± 0.128
0.471TrpTyr: 0.471 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.276TyrAla: 2.276 ± 0.337
0.314TyrCys: 0.314 ± 0.163
2.669TyrAsp: 2.669 ± 0.692
3.218TyrGlu: 3.218 ± 0.622
1.491TyrPhe: 1.491 ± 0.467
2.669TyrGly: 2.669 ± 0.604
1.177TyrHis: 1.177 ± 0.359
2.826TyrIle: 2.826 ± 0.661
4.396TyrLys: 4.396 ± 0.579
4.474TyrLeu: 4.474 ± 0.768
1.177TyrMet: 1.177 ± 0.267
2.433TyrAsn: 2.433 ± 0.536
0.785TyrPro: 0.785 ± 0.206
1.805TyrGln: 1.805 ± 0.384
1.884TyrArg: 1.884 ± 0.476
3.14TyrSer: 3.14 ± 0.477
2.433TyrThr: 2.433 ± 0.503
2.669TyrVal: 2.669 ± 0.489
0.549TyrTrp: 0.549 ± 0.179
1.727TyrTyr: 1.727 ± 0.592
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (12741 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski