Amino acid dipepetide frequency for Streptococcus virus 9874

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.733AlaAla: 3.733 ± 0.687
0.415AlaCys: 0.415 ± 0.185
3.94AlaAsp: 3.94 ± 0.653
4.148AlaGlu: 4.148 ± 0.679
2.696AlaPhe: 2.696 ± 0.56
3.214AlaGly: 3.214 ± 0.804
0.622AlaHis: 0.622 ± 0.249
5.496AlaIle: 5.496 ± 1.011
5.703AlaLys: 5.703 ± 0.834
5.496AlaLeu: 5.496 ± 0.752
1.659AlaMet: 1.659 ± 0.391
5.496AlaAsn: 5.496 ± 0.889
2.178AlaPro: 2.178 ± 0.478
1.866AlaGln: 1.866 ± 0.488
2.385AlaArg: 2.385 ± 0.423
3.629AlaSer: 3.629 ± 0.629
3.733AlaThr: 3.733 ± 0.691
3.629AlaVal: 3.629 ± 0.787
2.074AlaTrp: 2.074 ± 0.548
2.281AlaTyr: 2.281 ± 0.385
0.0AlaXaa: 0.0 ± 0.0
Cys
0.207CysAla: 0.207 ± 0.219
0.104CysCys: 0.104 ± 0.11
0.933CysAsp: 0.933 ± 0.377
0.83CysGlu: 0.83 ± 0.282
0.0CysPhe: 0.0 ± 0.0
0.622CysGly: 0.622 ± 0.223
0.311CysHis: 0.311 ± 0.198
0.415CysIle: 0.415 ± 0.204
0.726CysLys: 0.726 ± 0.211
0.518CysLeu: 0.518 ± 0.204
0.0CysMet: 0.0 ± 0.0
0.415CysAsn: 0.415 ± 0.22
0.0CysPro: 0.0 ± 0.0
0.104CysGln: 0.104 ± 0.1
0.726CysArg: 0.726 ± 0.291
1.244CysSer: 1.244 ± 0.363
0.104CysThr: 0.104 ± 0.1
0.622CysVal: 0.622 ± 0.293
0.104CysTrp: 0.104 ± 0.109
0.207CysTyr: 0.207 ± 0.151
0.0CysXaa: 0.0 ± 0.0
Asp
3.214AspAla: 3.214 ± 0.579
0.83AspCys: 0.83 ± 0.327
5.703AspAsp: 5.703 ± 0.889
4.666AspGlu: 4.666 ± 0.847
3.526AspPhe: 3.526 ± 0.693
4.77AspGly: 4.77 ± 0.828
0.622AspHis: 0.622 ± 0.228
4.251AspIle: 4.251 ± 0.578
4.873AspLys: 4.873 ± 0.798
5.599AspLeu: 5.599 ± 0.805
1.244AspMet: 1.244 ± 0.355
3.526AspAsn: 3.526 ± 0.636
1.348AspPro: 1.348 ± 0.457
0.933AspGln: 0.933 ± 0.384
2.074AspArg: 2.074 ± 0.35
2.8AspSer: 2.8 ± 0.521
3.733AspThr: 3.733 ± 0.526
4.355AspVal: 4.355 ± 0.614
1.037AspTrp: 1.037 ± 0.381
2.592AspTyr: 2.592 ± 0.434
0.0AspXaa: 0.0 ± 0.0
Glu
4.148GluAla: 4.148 ± 0.644
0.518GluCys: 0.518 ± 0.306
3.214GluAsp: 3.214 ± 0.581
5.081GluGlu: 5.081 ± 0.86
2.178GluPhe: 2.178 ± 0.379
3.214GluGly: 3.214 ± 0.537
1.348GluHis: 1.348 ± 0.335
4.044GluIle: 4.044 ± 0.614
6.429GluLys: 6.429 ± 1.192
7.051GluLeu: 7.051 ± 0.879
1.555GluMet: 1.555 ± 0.407
3.422GluAsn: 3.422 ± 0.67
1.866GluPro: 1.866 ± 0.501
3.318GluGln: 3.318 ± 0.636
3.422GluArg: 3.422 ± 0.487
3.733GluSer: 3.733 ± 0.796
4.459GluThr: 4.459 ± 0.634
5.081GluVal: 5.081 ± 0.792
1.244GluTrp: 1.244 ± 0.333
3.526GluTyr: 3.526 ± 0.614
0.0GluXaa: 0.0 ± 0.0
Phe
3.214PheAla: 3.214 ± 0.669
0.622PheCys: 0.622 ± 0.294
3.629PheAsp: 3.629 ± 0.509
3.94PheGlu: 3.94 ± 0.735
1.348PhePhe: 1.348 ± 0.354
2.696PheGly: 2.696 ± 0.596
0.415PheHis: 0.415 ± 0.231
2.385PheIle: 2.385 ± 0.49
5.185PheLys: 5.185 ± 0.855
2.489PheLeu: 2.489 ± 0.435
1.659PheMet: 1.659 ± 0.337
3.111PheAsn: 3.111 ± 0.638
0.622PhePro: 0.622 ± 0.297
1.763PheGln: 1.763 ± 0.399
1.037PheArg: 1.037 ± 0.424
3.733PheSer: 3.733 ± 0.553
3.007PheThr: 3.007 ± 0.696
2.074PheVal: 2.074 ± 0.461
0.622PheTrp: 0.622 ± 0.29
1.452PheTyr: 1.452 ± 0.445
0.0PheXaa: 0.0 ± 0.0
Gly
4.148GlyAla: 4.148 ± 0.855
0.415GlyCys: 0.415 ± 0.174
3.422GlyAsp: 3.422 ± 0.564
1.659GlyGlu: 1.659 ± 0.391
2.592GlyPhe: 2.592 ± 0.496
2.903GlyGly: 2.903 ± 0.751
0.518GlyHis: 0.518 ± 0.223
6.533GlyIle: 6.533 ± 1.089
6.844GlyLys: 6.844 ± 1.003
5.599GlyLeu: 5.599 ± 1.227
1.141GlyMet: 1.141 ± 0.409
3.007GlyAsn: 3.007 ± 0.662
1.141GlyPro: 1.141 ± 0.348
3.007GlyGln: 3.007 ± 0.594
2.489GlyArg: 2.489 ± 0.633
3.422GlySer: 3.422 ± 0.68
5.081GlyThr: 5.081 ± 0.56
3.837GlyVal: 3.837 ± 0.725
0.415GlyTrp: 0.415 ± 0.203
3.422GlyTyr: 3.422 ± 0.644
0.0GlyXaa: 0.0 ± 0.0
His
0.726HisAla: 0.726 ± 0.28
0.0HisCys: 0.0 ± 0.0
0.83HisAsp: 0.83 ± 0.271
0.622HisGlu: 0.622 ± 0.296
0.104HisPhe: 0.104 ± 0.1
1.141HisGly: 1.141 ± 0.282
0.311HisHis: 0.311 ± 0.225
0.933HisIle: 0.933 ± 0.317
1.244HisLys: 1.244 ± 0.353
0.83HisLeu: 0.83 ± 0.265
0.104HisMet: 0.104 ± 0.112
0.622HisAsn: 0.622 ± 0.219
0.207HisPro: 0.207 ± 0.144
0.518HisGln: 0.518 ± 0.26
0.518HisArg: 0.518 ± 0.249
0.726HisSer: 0.726 ± 0.271
0.726HisThr: 0.726 ± 0.285
0.933HisVal: 0.933 ± 0.276
0.207HisTrp: 0.207 ± 0.159
0.622HisTyr: 0.622 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
3.837IleAla: 3.837 ± 0.779
0.726IleCys: 0.726 ± 0.317
3.422IleAsp: 3.422 ± 0.544
5.496IleGlu: 5.496 ± 0.867
2.592IlePhe: 2.592 ± 0.616
4.977IleGly: 4.977 ± 0.825
0.83IleHis: 0.83 ± 0.363
5.081IleIle: 5.081 ± 0.834
6.118IleLys: 6.118 ± 0.677
4.148IleLeu: 4.148 ± 0.526
1.659IleMet: 1.659 ± 0.403
6.533IleAsn: 6.533 ± 0.822
1.97IlePro: 1.97 ± 0.378
3.214IleGln: 3.214 ± 0.596
2.281IleArg: 2.281 ± 0.343
4.251IleSer: 4.251 ± 0.777
4.666IleThr: 4.666 ± 0.616
4.044IleVal: 4.044 ± 0.607
0.726IleTrp: 0.726 ± 0.231
1.763IleTyr: 1.763 ± 0.406
0.0IleXaa: 0.0 ± 0.0
Lys
7.155LysAla: 7.155 ± 1.215
0.311LysCys: 0.311 ± 0.232
4.873LysAsp: 4.873 ± 0.656
6.947LysGlu: 6.947 ± 1.037
3.111LysPhe: 3.111 ± 0.78
4.666LysGly: 4.666 ± 0.73
1.348LysHis: 1.348 ± 0.408
5.599LysIle: 5.599 ± 0.855
7.673LysLys: 7.673 ± 1.392
7.155LysLeu: 7.155 ± 1.153
2.8LysMet: 2.8 ± 0.562
5.91LysAsn: 5.91 ± 0.798
3.007LysPro: 3.007 ± 0.635
4.251LysGln: 4.251 ± 0.677
3.733LysArg: 3.733 ± 0.933
4.977LysSer: 4.977 ± 0.869
4.77LysThr: 4.77 ± 0.753
4.251LysVal: 4.251 ± 0.669
1.037LysTrp: 1.037 ± 0.302
4.459LysTyr: 4.459 ± 1.007
0.0LysXaa: 0.0 ± 0.0
Leu
4.562LeuAla: 4.562 ± 0.73
0.518LeuCys: 0.518 ± 0.289
5.392LeuAsp: 5.392 ± 0.58
5.496LeuGlu: 5.496 ± 1.138
3.837LeuPhe: 3.837 ± 0.636
4.148LeuGly: 4.148 ± 0.768
0.622LeuHis: 0.622 ± 0.228
4.355LeuIle: 4.355 ± 0.685
8.295LeuLys: 8.295 ± 1.234
6.429LeuLeu: 6.429 ± 0.962
1.97LeuMet: 1.97 ± 0.378
4.459LeuAsn: 4.459 ± 0.546
2.903LeuPro: 2.903 ± 0.4
2.696LeuGln: 2.696 ± 0.44
2.903LeuArg: 2.903 ± 0.58
7.051LeuSer: 7.051 ± 0.822
4.77LeuThr: 4.77 ± 0.813
3.629LeuVal: 3.629 ± 0.552
1.452LeuTrp: 1.452 ± 0.657
3.111LeuTyr: 3.111 ± 0.525
0.0LeuXaa: 0.0 ± 0.0
Met
1.452MetAla: 1.452 ± 0.426
0.0MetCys: 0.0 ± 0.0
1.141MetAsp: 1.141 ± 0.328
1.763MetGlu: 1.763 ± 0.413
0.726MetPhe: 0.726 ± 0.234
1.555MetGly: 1.555 ± 0.468
0.518MetHis: 0.518 ± 0.231
1.037MetIle: 1.037 ± 0.294
2.281MetLys: 2.281 ± 0.543
2.178MetLeu: 2.178 ± 0.422
0.83MetMet: 0.83 ± 0.323
1.763MetAsn: 1.763 ± 0.461
0.622MetPro: 0.622 ± 0.27
0.726MetGln: 0.726 ± 0.258
1.555MetArg: 1.555 ± 0.404
2.385MetSer: 2.385 ± 0.563
2.903MetThr: 2.903 ± 0.501
1.452MetVal: 1.452 ± 0.295
0.104MetTrp: 0.104 ± 0.102
1.037MetTyr: 1.037 ± 0.413
0.0MetXaa: 0.0 ± 0.0
Asn
5.185AsnAla: 5.185 ± 0.955
0.83AsnCys: 0.83 ± 0.29
2.903AsnAsp: 2.903 ± 0.426
3.318AsnGlu: 3.318 ± 0.759
3.111AsnPhe: 3.111 ± 0.657
6.533AsnGly: 6.533 ± 1.09
0.415AsnHis: 0.415 ± 0.181
4.459AsnIle: 4.459 ± 0.673
4.873AsnLys: 4.873 ± 0.673
5.392AsnLeu: 5.392 ± 0.635
1.866AsnMet: 1.866 ± 0.537
4.562AsnAsn: 4.562 ± 0.626
1.555AsnPro: 1.555 ± 0.338
2.592AsnGln: 2.592 ± 0.569
2.592AsnArg: 2.592 ± 0.402
4.459AsnSer: 4.459 ± 0.786
3.629AsnThr: 3.629 ± 0.509
4.355AsnVal: 4.355 ± 0.888
0.933AsnTrp: 0.933 ± 0.316
3.111AsnTyr: 3.111 ± 0.609
0.0AsnXaa: 0.0 ± 0.0
Pro
1.348ProAla: 1.348 ± 0.379
0.311ProCys: 0.311 ± 0.232
2.281ProAsp: 2.281 ± 0.55
1.452ProGlu: 1.452 ± 0.362
1.452ProPhe: 1.452 ± 0.344
0.415ProGly: 0.415 ± 0.234
0.518ProHis: 0.518 ± 0.228
1.555ProIle: 1.555 ± 0.417
2.8ProLys: 2.8 ± 0.545
1.97ProLeu: 1.97 ± 0.483
0.415ProMet: 0.415 ± 0.211
1.659ProAsn: 1.659 ± 0.33
0.83ProPro: 0.83 ± 0.237
0.83ProGln: 0.83 ± 0.322
1.141ProArg: 1.141 ± 0.329
1.348ProSer: 1.348 ± 0.303
1.763ProThr: 1.763 ± 0.432
1.452ProVal: 1.452 ± 0.348
0.415ProTrp: 0.415 ± 0.208
1.348ProTyr: 1.348 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
4.044GlnAla: 4.044 ± 0.752
0.311GlnCys: 0.311 ± 0.178
1.97GlnAsp: 1.97 ± 0.594
3.422GlnGlu: 3.422 ± 0.568
1.763GlnPhe: 1.763 ± 0.499
2.8GlnGly: 2.8 ± 0.55
0.207GlnHis: 0.207 ± 0.12
1.97GlnIle: 1.97 ± 0.392
2.592GlnLys: 2.592 ± 0.519
2.489GlnLeu: 2.489 ± 0.512
1.763GlnMet: 1.763 ± 0.338
1.452GlnAsn: 1.452 ± 0.324
0.933GlnPro: 0.933 ± 0.352
2.489GlnGln: 2.489 ± 0.749
1.348GlnArg: 1.348 ± 0.324
1.97GlnSer: 1.97 ± 0.375
2.489GlnThr: 2.489 ± 0.477
2.592GlnVal: 2.592 ± 0.563
0.933GlnTrp: 0.933 ± 0.321
2.074GlnTyr: 2.074 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
2.178ArgAla: 2.178 ± 0.441
0.518ArgCys: 0.518 ± 0.21
1.866ArgAsp: 1.866 ± 0.481
2.8ArgGlu: 2.8 ± 0.541
2.489ArgPhe: 2.489 ± 0.534
1.659ArgGly: 1.659 ± 0.365
0.311ArgHis: 0.311 ± 0.182
2.489ArgIle: 2.489 ± 0.531
3.733ArgLys: 3.733 ± 0.557
4.666ArgLeu: 4.666 ± 0.882
1.452ArgMet: 1.452 ± 0.477
3.111ArgAsn: 3.111 ± 0.44
0.622ArgPro: 0.622 ± 0.317
1.452ArgGln: 1.452 ± 0.435
1.037ArgArg: 1.037 ± 0.357
2.592ArgSer: 2.592 ± 0.429
1.97ArgThr: 1.97 ± 0.419
2.074ArgVal: 2.074 ± 0.359
0.415ArgTrp: 0.415 ± 0.199
1.555ArgTyr: 1.555 ± 0.537
0.0ArgXaa: 0.0 ± 0.0
Ser
3.94SerAla: 3.94 ± 0.765
0.207SerCys: 0.207 ± 0.15
5.392SerAsp: 5.392 ± 0.59
4.666SerGlu: 4.666 ± 0.7
3.837SerPhe: 3.837 ± 0.709
4.148SerGly: 4.148 ± 0.964
0.726SerHis: 0.726 ± 0.29
5.807SerIle: 5.807 ± 0.836
3.94SerLys: 3.94 ± 0.83
4.459SerLeu: 4.459 ± 0.64
1.141SerMet: 1.141 ± 0.381
5.392SerAsn: 5.392 ± 1.087
1.037SerPro: 1.037 ± 0.281
3.111SerGln: 3.111 ± 0.463
2.281SerArg: 2.281 ± 0.533
3.318SerSer: 3.318 ± 0.658
3.526SerThr: 3.526 ± 0.629
4.044SerVal: 4.044 ± 0.682
0.83SerTrp: 0.83 ± 0.312
2.696SerTyr: 2.696 ± 0.656
0.0SerXaa: 0.0 ± 0.0
Thr
4.666ThrAla: 4.666 ± 0.693
0.311ThrCys: 0.311 ± 0.184
3.94ThrAsp: 3.94 ± 0.639
4.355ThrGlu: 4.355 ± 0.677
3.318ThrPhe: 3.318 ± 0.565
4.666ThrGly: 4.666 ± 0.7
0.518ThrHis: 0.518 ± 0.225
4.251ThrIle: 4.251 ± 0.717
5.288ThrLys: 5.288 ± 0.809
4.977ThrLeu: 4.977 ± 0.668
1.348ThrMet: 1.348 ± 0.434
3.837ThrAsn: 3.837 ± 0.631
0.726ThrPro: 0.726 ± 0.316
1.555ThrGln: 1.555 ± 0.39
2.489ThrArg: 2.489 ± 0.416
3.94ThrSer: 3.94 ± 0.603
4.251ThrThr: 4.251 ± 0.627
5.496ThrVal: 5.496 ± 0.892
0.726ThrTrp: 0.726 ± 0.35
2.592ThrTyr: 2.592 ± 0.473
0.0ThrXaa: 0.0 ± 0.0
Val
3.837ValAla: 3.837 ± 0.616
0.311ValCys: 0.311 ± 0.177
3.837ValAsp: 3.837 ± 0.9
4.666ValGlu: 4.666 ± 0.718
3.007ValPhe: 3.007 ± 0.686
3.111ValGly: 3.111 ± 0.662
0.83ValHis: 0.83 ± 0.469
3.526ValIle: 3.526 ± 0.53
5.91ValLys: 5.91 ± 0.893
3.94ValLeu: 3.94 ± 0.575
1.348ValMet: 1.348 ± 0.447
4.562ValAsn: 4.562 ± 0.726
1.97ValPro: 1.97 ± 0.475
2.385ValGln: 2.385 ± 0.57
1.866ValArg: 1.866 ± 0.416
4.562ValSer: 4.562 ± 0.656
4.977ValThr: 4.977 ± 0.937
4.044ValVal: 4.044 ± 0.77
0.933ValTrp: 0.933 ± 0.25
2.178ValTyr: 2.178 ± 0.444
0.0ValXaa: 0.0 ± 0.0
Trp
1.244TrpAla: 1.244 ± 0.368
0.0TrpCys: 0.0 ± 0.0
0.83TrpAsp: 0.83 ± 0.268
1.037TrpGlu: 1.037 ± 0.365
0.622TrpPhe: 0.622 ± 0.224
1.037TrpGly: 1.037 ± 0.366
0.104TrpHis: 0.104 ± 0.091
0.933TrpIle: 0.933 ± 0.275
0.933TrpLys: 0.933 ± 0.3
0.726TrpLeu: 0.726 ± 0.32
0.415TrpMet: 0.415 ± 0.204
1.555TrpAsn: 1.555 ± 0.596
0.207TrpPro: 0.207 ± 0.144
0.83TrpGln: 0.83 ± 0.398
0.933TrpArg: 0.933 ± 0.239
0.622TrpSer: 0.622 ± 0.201
0.933TrpThr: 0.933 ± 0.361
1.348TrpVal: 1.348 ± 0.396
0.104TrpTrp: 0.104 ± 0.1
0.415TrpTyr: 0.415 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.555TyrAla: 1.555 ± 0.314
0.933TyrCys: 0.933 ± 0.339
2.178TyrAsp: 2.178 ± 0.564
2.592TyrGlu: 2.592 ± 0.555
2.696TyrPhe: 2.696 ± 0.489
3.318TyrGly: 3.318 ± 0.603
0.83TyrHis: 0.83 ± 0.324
3.214TyrIle: 3.214 ± 0.78
2.592TyrLys: 2.592 ± 0.709
2.489TyrLeu: 2.489 ± 0.64
1.555TyrMet: 1.555 ± 0.538
2.281TyrAsn: 2.281 ± 0.579
1.555TyrPro: 1.555 ± 0.301
1.97TyrGln: 1.97 ± 0.502
2.178TyrArg: 2.178 ± 0.532
3.733TyrSer: 3.733 ± 0.469
1.763TyrThr: 1.763 ± 0.363
2.489TyrVal: 2.489 ± 0.49
0.518TyrTrp: 0.518 ± 0.18
1.97TyrTyr: 1.97 ± 0.663
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (9645 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski