Amino acid dipepetide frequency for Streptococcus virus 858

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.218AlaAla: 6.218 ± 2.128
0.541AlaCys: 0.541 ± 0.276
4.776AlaAsp: 4.776 ± 0.978
5.677AlaGlu: 5.677 ± 0.842
2.794AlaPhe: 2.794 ± 1.097
5.767AlaGly: 5.767 ± 1.287
0.631AlaHis: 0.631 ± 0.199
5.948AlaIle: 5.948 ± 1.706
4.416AlaLys: 4.416 ± 0.567
6.668AlaLeu: 6.668 ± 1.559
2.343AlaMet: 2.343 ± 1.028
4.506AlaAsn: 4.506 ± 0.855
2.163AlaPro: 2.163 ± 0.437
3.064AlaGln: 3.064 ± 1.146
2.974AlaArg: 2.974 ± 0.595
6.218AlaSer: 6.218 ± 1.54
5.407AlaThr: 5.407 ± 1.129
4.596AlaVal: 4.596 ± 1.089
0.721AlaTrp: 0.721 ± 0.222
2.523AlaTyr: 2.523 ± 0.522
0.0AlaXaa: 0.0 ± 0.0
Cys
0.18CysAla: 0.18 ± 0.133
0.09CysCys: 0.09 ± 0.092
0.451CysAsp: 0.451 ± 0.217
0.451CysGlu: 0.451 ± 0.254
0.09CysPhe: 0.09 ± 0.098
0.451CysGly: 0.451 ± 0.247
0.09CysHis: 0.09 ± 0.092
0.18CysIle: 0.18 ± 0.112
0.901CysLys: 0.901 ± 0.312
0.18CysLeu: 0.18 ± 0.124
0.09CysMet: 0.09 ± 0.099
0.541CysAsn: 0.541 ± 0.222
0.09CysPro: 0.09 ± 0.098
0.0CysGln: 0.0 ± 0.0
0.541CysArg: 0.541 ± 0.31
0.541CysSer: 0.541 ± 0.301
0.27CysThr: 0.27 ± 0.143
0.36CysVal: 0.36 ± 0.166
0.09CysTrp: 0.09 ± 0.093
0.36CysTyr: 0.36 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
3.064AspAla: 3.064 ± 0.367
0.36AspCys: 0.36 ± 0.191
3.605AspAsp: 3.605 ± 0.712
3.875AspGlu: 3.875 ± 0.848
3.064AspPhe: 3.064 ± 0.425
6.308AspGly: 6.308 ± 1.561
0.36AspHis: 0.36 ± 0.251
3.514AspIle: 3.514 ± 0.767
4.776AspLys: 4.776 ± 0.818
4.416AspLeu: 4.416 ± 0.78
1.171AspMet: 1.171 ± 0.294
3.154AspAsn: 3.154 ± 0.572
0.901AspPro: 0.901 ± 0.29
1.532AspGln: 1.532 ± 0.367
2.343AspArg: 2.343 ± 0.34
4.506AspSer: 4.506 ± 0.742
4.325AspThr: 4.325 ± 0.697
3.514AspVal: 3.514 ± 0.535
0.901AspTrp: 0.901 ± 0.398
3.244AspTyr: 3.244 ± 0.663
0.0AspXaa: 0.0 ± 0.0
Glu
4.686GluAla: 4.686 ± 0.881
0.18GluCys: 0.18 ± 0.112
2.703GluAsp: 2.703 ± 0.525
3.424GluGlu: 3.424 ± 0.776
2.523GluPhe: 2.523 ± 0.535
4.055GluGly: 4.055 ± 0.646
1.442GluHis: 1.442 ± 0.378
4.776GluIle: 4.776 ± 0.681
4.416GluLys: 4.416 ± 0.936
6.578GluLeu: 6.578 ± 1.145
2.253GluMet: 2.253 ± 0.575
3.514GluAsn: 3.514 ± 0.538
1.442GluPro: 1.442 ± 0.429
2.974GluGln: 2.974 ± 0.571
4.145GluArg: 4.145 ± 0.819
2.794GluSer: 2.794 ± 0.762
3.154GluThr: 3.154 ± 0.586
4.866GluVal: 4.866 ± 0.676
0.991GluTrp: 0.991 ± 0.295
3.064GluTyr: 3.064 ± 0.671
0.0GluXaa: 0.0 ± 0.0
Phe
2.253PheAla: 2.253 ± 0.483
0.36PheCys: 0.36 ± 0.237
2.433PheAsp: 2.433 ± 0.492
3.965PheGlu: 3.965 ± 0.618
1.171PhePhe: 1.171 ± 0.433
3.244PheGly: 3.244 ± 0.667
0.541PheHis: 0.541 ± 0.165
2.613PheIle: 2.613 ± 0.347
3.875PheLys: 3.875 ± 0.452
1.983PheLeu: 1.983 ± 0.607
0.631PheMet: 0.631 ± 0.247
2.613PheAsn: 2.613 ± 0.337
0.451PhePro: 0.451 ± 0.227
1.352PheGln: 1.352 ± 0.278
1.352PheArg: 1.352 ± 0.364
3.875PheSer: 3.875 ± 0.735
3.154PheThr: 3.154 ± 0.531
2.523PheVal: 2.523 ± 0.681
0.541PheTrp: 0.541 ± 0.247
1.171PheTyr: 1.171 ± 0.395
0.0PheXaa: 0.0 ± 0.0
Gly
5.497GlyAla: 5.497 ± 1.118
0.451GlyCys: 0.451 ± 0.217
3.605GlyAsp: 3.605 ± 0.486
3.424GlyGlu: 3.424 ± 0.423
3.154GlyPhe: 3.154 ± 0.585
2.884GlyGly: 2.884 ± 0.5
0.541GlyHis: 0.541 ± 0.221
7.209GlyIle: 7.209 ± 1.761
6.849GlyLys: 6.849 ± 0.857
5.767GlyLeu: 5.767 ± 0.784
2.163GlyMet: 2.163 ± 0.731
3.695GlyAsn: 3.695 ± 0.58
0.991GlyPro: 0.991 ± 0.434
2.884GlyGln: 2.884 ± 0.479
3.064GlyArg: 3.064 ± 0.56
4.866GlySer: 4.866 ± 0.803
5.767GlyThr: 5.767 ± 0.729
4.506GlyVal: 4.506 ± 0.566
1.081GlyTrp: 1.081 ± 0.324
2.703GlyTyr: 2.703 ± 0.581
0.0GlyXaa: 0.0 ± 0.0
His
0.901HisAla: 0.901 ± 0.265
0.09HisCys: 0.09 ± 0.066
0.991HisAsp: 0.991 ± 0.251
0.541HisGlu: 0.541 ± 0.258
0.541HisPhe: 0.541 ± 0.208
0.631HisGly: 0.631 ± 0.249
0.451HisHis: 0.451 ± 0.192
1.262HisIle: 1.262 ± 0.319
1.081HisLys: 1.081 ± 0.27
0.631HisLeu: 0.631 ± 0.204
0.27HisMet: 0.27 ± 0.17
0.991HisAsn: 0.991 ± 0.313
0.27HisPro: 0.27 ± 0.152
0.27HisGln: 0.27 ± 0.142
0.541HisArg: 0.541 ± 0.217
0.811HisSer: 0.811 ± 0.323
0.541HisThr: 0.541 ± 0.187
0.991HisVal: 0.991 ± 0.394
0.18HisTrp: 0.18 ± 0.127
0.541HisTyr: 0.541 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
5.767IleAla: 5.767 ± 1.062
0.451IleCys: 0.451 ± 0.253
5.046IleAsp: 5.046 ± 0.819
3.605IleGlu: 3.605 ± 0.651
1.802IlePhe: 1.802 ± 0.405
5.407IleGly: 5.407 ± 1.116
1.171IleHis: 1.171 ± 0.25
2.884IleIle: 2.884 ± 0.787
4.866IleLys: 4.866 ± 0.577
3.785IleLeu: 3.785 ± 0.653
2.073IleMet: 2.073 ± 0.374
3.695IleAsn: 3.695 ± 0.59
2.794IlePro: 2.794 ± 0.651
2.884IleGln: 2.884 ± 0.507
2.974IleArg: 2.974 ± 0.528
6.128IleSer: 6.128 ± 1.555
3.965IleThr: 3.965 ± 0.684
4.416IleVal: 4.416 ± 0.859
0.631IleTrp: 0.631 ± 0.272
2.794IleTyr: 2.794 ± 0.74
0.0IleXaa: 0.0 ± 0.0
Lys
6.849LysAla: 6.849 ± 1.003
0.18LysCys: 0.18 ± 0.123
3.605LysAsp: 3.605 ± 0.546
6.668LysGlu: 6.668 ± 1.034
1.983LysPhe: 1.983 ± 0.345
5.137LysGly: 5.137 ± 0.731
1.081LysHis: 1.081 ± 0.348
4.776LysIle: 4.776 ± 0.657
5.767LysLys: 5.767 ± 1.243
5.587LysLeu: 5.587 ± 0.847
1.712LysMet: 1.712 ± 0.408
4.325LysAsn: 4.325 ± 0.743
2.613LysPro: 2.613 ± 0.488
2.253LysGln: 2.253 ± 0.495
4.325LysArg: 4.325 ± 0.856
4.866LysSer: 4.866 ± 0.639
6.668LysThr: 6.668 ± 0.937
3.695LysVal: 3.695 ± 0.598
0.901LysTrp: 0.901 ± 0.251
3.695LysTyr: 3.695 ± 0.951
0.0LysXaa: 0.0 ± 0.0
Leu
6.038LeuAla: 6.038 ± 0.885
0.09LeuCys: 0.09 ± 0.098
5.046LeuAsp: 5.046 ± 0.913
5.137LeuGlu: 5.137 ± 0.957
2.253LeuPhe: 2.253 ± 0.444
5.317LeuGly: 5.317 ± 1.076
0.721LeuHis: 0.721 ± 0.303
4.055LeuIle: 4.055 ± 0.515
5.407LeuLys: 5.407 ± 0.798
4.596LeuLeu: 4.596 ± 0.552
1.532LeuMet: 1.532 ± 0.4
5.046LeuAsn: 5.046 ± 0.606
2.703LeuPro: 2.703 ± 0.517
2.703LeuGln: 2.703 ± 0.442
3.064LeuArg: 3.064 ± 0.525
5.497LeuSer: 5.497 ± 0.617
6.038LeuThr: 6.038 ± 0.878
4.776LeuVal: 4.776 ± 0.595
0.631LeuTrp: 0.631 ± 0.23
3.064LeuTyr: 3.064 ± 0.512
0.0LeuXaa: 0.0 ± 0.0
Met
2.703MetAla: 2.703 ± 0.825
0.0MetCys: 0.0 ± 0.0
0.721MetAsp: 0.721 ± 0.217
1.352MetGlu: 1.352 ± 0.36
0.991MetPhe: 0.991 ± 0.226
1.262MetGly: 1.262 ± 0.409
0.18MetHis: 0.18 ± 0.126
1.171MetIle: 1.171 ± 0.346
1.983MetLys: 1.983 ± 0.507
1.171MetLeu: 1.171 ± 0.232
1.262MetMet: 1.262 ± 0.528
1.262MetAsn: 1.262 ± 0.278
0.811MetPro: 0.811 ± 0.288
1.622MetGln: 1.622 ± 0.5
1.171MetArg: 1.171 ± 0.323
2.253MetSer: 2.253 ± 0.537
0.901MetThr: 0.901 ± 0.316
2.433MetVal: 2.433 ± 0.511
0.0MetTrp: 0.0 ± 0.0
1.081MetTyr: 1.081 ± 0.371
0.0MetXaa: 0.0 ± 0.0
Asn
4.145AsnAla: 4.145 ± 0.609
0.451AsnCys: 0.451 ± 0.173
3.695AsnAsp: 3.695 ± 0.749
4.325AsnGlu: 4.325 ± 0.763
2.253AsnPhe: 2.253 ± 0.481
6.038AsnGly: 6.038 ± 0.718
1.081AsnHis: 1.081 ± 0.348
2.884AsnIle: 2.884 ± 0.448
3.785AsnLys: 3.785 ± 0.629
3.334AsnLeu: 3.334 ± 0.52
0.991AsnMet: 0.991 ± 0.248
3.154AsnAsn: 3.154 ± 0.757
2.794AsnPro: 2.794 ± 0.67
1.892AsnGln: 1.892 ± 0.39
2.523AsnArg: 2.523 ± 0.522
3.334AsnSer: 3.334 ± 0.5
3.605AsnThr: 3.605 ± 0.837
2.794AsnVal: 2.794 ± 0.443
1.171AsnTrp: 1.171 ± 0.275
2.433AsnTyr: 2.433 ± 0.569
0.0AsnXaa: 0.0 ± 0.0
Pro
1.352ProAla: 1.352 ± 0.368
0.18ProCys: 0.18 ± 0.183
1.532ProAsp: 1.532 ± 0.468
1.532ProGlu: 1.532 ± 0.424
0.991ProPhe: 0.991 ± 0.285
1.532ProGly: 1.532 ± 0.432
0.18ProHis: 0.18 ± 0.113
1.532ProIle: 1.532 ± 0.358
3.064ProLys: 3.064 ± 0.465
1.802ProLeu: 1.802 ± 0.331
0.0ProMet: 0.0 ± 0.0
2.433ProAsn: 2.433 ± 0.578
0.811ProPro: 0.811 ± 0.239
1.262ProGln: 1.262 ± 0.308
1.802ProArg: 1.802 ± 0.498
2.974ProSer: 2.974 ± 0.59
1.892ProThr: 1.892 ± 0.644
1.712ProVal: 1.712 ± 0.428
0.36ProTrp: 0.36 ± 0.167
1.262ProTyr: 1.262 ± 0.29
0.0ProXaa: 0.0 ± 0.0
Gln
3.605GlnAla: 3.605 ± 1.14
0.18GlnCys: 0.18 ± 0.138
1.712GlnAsp: 1.712 ± 0.353
2.343GlnGlu: 2.343 ± 0.609
2.073GlnPhe: 2.073 ± 0.387
2.523GlnGly: 2.523 ± 0.836
0.36GlnHis: 0.36 ± 0.173
2.253GlnIle: 2.253 ± 0.538
2.253GlnLys: 2.253 ± 0.49
3.785GlnLeu: 3.785 ± 0.458
1.081GlnMet: 1.081 ± 0.379
1.442GlnAsn: 1.442 ± 0.292
0.991GlnPro: 0.991 ± 0.303
1.081GlnGln: 1.081 ± 0.33
1.262GlnArg: 1.262 ± 0.292
3.244GlnSer: 3.244 ± 0.643
2.523GlnThr: 2.523 ± 0.436
2.884GlnVal: 2.884 ± 0.415
0.541GlnTrp: 0.541 ± 0.232
1.532GlnTyr: 1.532 ± 0.469
0.0GlnXaa: 0.0 ± 0.0
Arg
3.605ArgAla: 3.605 ± 0.436
0.721ArgCys: 0.721 ± 0.306
2.433ArgAsp: 2.433 ± 0.485
2.794ArgGlu: 2.794 ± 0.641
1.892ArgPhe: 1.892 ± 0.609
3.064ArgGly: 3.064 ± 0.489
0.451ArgHis: 0.451 ± 0.26
2.703ArgIle: 2.703 ± 0.614
3.965ArgLys: 3.965 ± 0.727
4.235ArgLeu: 4.235 ± 0.688
1.262ArgMet: 1.262 ± 0.4
1.892ArgAsn: 1.892 ± 0.47
0.901ArgPro: 0.901 ± 0.226
1.171ArgGln: 1.171 ± 0.331
1.892ArgArg: 1.892 ± 0.566
2.343ArgSer: 2.343 ± 0.425
2.073ArgThr: 2.073 ± 0.514
2.523ArgVal: 2.523 ± 0.53
0.901ArgTrp: 0.901 ± 0.29
2.343ArgTyr: 2.343 ± 0.541
0.0ArgXaa: 0.0 ± 0.0
Ser
7.209SerAla: 7.209 ± 2.783
0.451SerCys: 0.451 ± 0.172
4.506SerAsp: 4.506 ± 0.805
4.506SerGlu: 4.506 ± 0.891
3.965SerPhe: 3.965 ± 0.619
5.046SerGly: 5.046 ± 0.601
0.811SerHis: 0.811 ± 0.297
5.137SerIle: 5.137 ± 0.922
5.227SerLys: 5.227 ± 0.714
5.046SerLeu: 5.046 ± 0.704
2.073SerMet: 2.073 ± 0.39
3.334SerAsn: 3.334 ± 0.478
1.622SerPro: 1.622 ± 0.51
3.064SerGln: 3.064 ± 0.911
1.532SerArg: 1.532 ± 0.326
4.325SerSer: 4.325 ± 0.911
5.407SerThr: 5.407 ± 0.749
6.218SerVal: 6.218 ± 0.745
1.352SerTrp: 1.352 ± 0.358
1.892SerTyr: 1.892 ± 0.465
0.0SerXaa: 0.0 ± 0.0
Thr
5.407ThrAla: 5.407 ± 1.749
0.27ThrCys: 0.27 ± 0.169
3.875ThrAsp: 3.875 ± 0.706
2.703ThrGlu: 2.703 ± 0.474
3.965ThrPhe: 3.965 ± 0.676
4.866ThrGly: 4.866 ± 0.755
1.171ThrHis: 1.171 ± 0.341
6.128ThrIle: 6.128 ± 1.157
5.227ThrLys: 5.227 ± 0.843
5.587ThrLeu: 5.587 ± 0.734
1.352ThrMet: 1.352 ± 0.625
3.244ThrAsn: 3.244 ± 0.53
2.163ThrPro: 2.163 ± 0.41
2.794ThrGln: 2.794 ± 0.58
1.983ThrArg: 1.983 ± 0.366
4.506ThrSer: 4.506 ± 0.825
4.325ThrThr: 4.325 ± 0.644
5.948ThrVal: 5.948 ± 0.652
0.541ThrTrp: 0.541 ± 0.332
2.974ThrTyr: 2.974 ± 0.779
0.0ThrXaa: 0.0 ± 0.0
Val
5.137ValAla: 5.137 ± 0.855
0.18ValCys: 0.18 ± 0.143
5.046ValAsp: 5.046 ± 0.684
4.686ValGlu: 4.686 ± 0.858
2.703ValPhe: 2.703 ± 0.563
4.325ValGly: 4.325 ± 0.68
0.451ValHis: 0.451 ± 0.166
4.596ValIle: 4.596 ± 0.596
5.857ValLys: 5.857 ± 0.637
4.325ValLeu: 4.325 ± 0.459
1.081ValMet: 1.081 ± 0.338
4.506ValAsn: 4.506 ± 0.878
2.253ValPro: 2.253 ± 0.373
2.433ValGln: 2.433 ± 0.655
2.163ValArg: 2.163 ± 0.562
6.038ValSer: 6.038 ± 0.815
4.416ValThr: 4.416 ± 0.644
4.325ValVal: 4.325 ± 0.734
0.811ValTrp: 0.811 ± 0.239
1.622ValTyr: 1.622 ± 0.364
0.0ValXaa: 0.0 ± 0.0
Trp
0.541TrpAla: 0.541 ± 0.14
0.09TrpCys: 0.09 ± 0.09
0.541TrpAsp: 0.541 ± 0.196
1.081TrpGlu: 1.081 ± 0.335
0.451TrpPhe: 0.451 ± 0.231
0.811TrpGly: 0.811 ± 0.285
0.18TrpHis: 0.18 ± 0.133
0.631TrpIle: 0.631 ± 0.196
0.721TrpLys: 0.721 ± 0.17
1.081TrpLeu: 1.081 ± 0.358
0.27TrpMet: 0.27 ± 0.129
0.991TrpAsn: 0.991 ± 0.337
0.18TrpPro: 0.18 ± 0.108
0.541TrpGln: 0.541 ± 0.217
0.721TrpArg: 0.721 ± 0.245
1.081TrpSer: 1.081 ± 0.501
1.352TrpThr: 1.352 ± 0.447
0.991TrpVal: 0.991 ± 0.25
0.36TrpTrp: 0.36 ± 0.212
0.27TrpTyr: 0.27 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.794TyrAla: 2.794 ± 0.502
0.631TyrCys: 0.631 ± 0.243
2.613TyrAsp: 2.613 ± 0.755
1.892TyrGlu: 1.892 ± 0.523
1.442TyrPhe: 1.442 ± 0.389
2.433TyrGly: 2.433 ± 0.503
0.631TyrHis: 0.631 ± 0.201
2.884TyrIle: 2.884 ± 0.665
2.253TyrLys: 2.253 ± 0.465
3.244TyrLeu: 3.244 ± 0.623
0.631TyrMet: 0.631 ± 0.211
2.343TyrAsn: 2.343 ± 0.556
1.262TyrPro: 1.262 ± 0.312
1.892TyrGln: 1.892 ± 0.51
2.794TyrArg: 2.794 ± 0.716
2.523TyrSer: 2.523 ± 0.483
3.244TyrThr: 3.244 ± 0.916
2.884TyrVal: 2.884 ± 0.504
0.18TyrTrp: 0.18 ± 0.137
1.712TyrTyr: 1.712 ± 0.549
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11098 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski