Amino acid dipepetide frequency for Severe fever with thrombocytopenia syndrome virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.768AlaAla: 6.768 ± 3.703
1.354AlaCys: 1.354 ± 0.58
2.436AlaAsp: 2.436 ± 0.667
2.436AlaGlu: 2.436 ± 0.641
1.895AlaPhe: 1.895 ± 0.726
4.061AlaGly: 4.061 ± 1.042
1.354AlaHis: 1.354 ± 0.861
4.602AlaIle: 4.602 ± 0.631
3.249AlaLys: 3.249 ± 0.502
5.956AlaLeu: 5.956 ± 1.755
1.895AlaMet: 1.895 ± 0.779
2.707AlaAsn: 2.707 ± 0.542
1.895AlaPro: 1.895 ± 0.747
1.895AlaGln: 1.895 ± 0.919
2.978AlaArg: 2.978 ± 0.183
4.602AlaSer: 4.602 ± 1.529
2.166AlaThr: 2.166 ± 0.389
4.331AlaVal: 4.331 ± 1.375
1.354AlaTrp: 1.354 ± 0.344
2.707AlaTyr: 2.707 ± 1.388
0.0AlaXaa: 0.0 ± 0.0
Cys
1.624CysAla: 1.624 ± 0.598
0.0CysCys: 0.0 ± 0.0
1.624CysAsp: 1.624 ± 0.516
1.354CysGlu: 1.354 ± 0.58
0.541CysPhe: 0.541 ± 0.172
1.354CysGly: 1.354 ± 0.945
0.271CysHis: 0.271 ± 0.268
0.812CysIle: 0.812 ± 0.418
2.166CysLys: 2.166 ± 0.996
3.249CysLeu: 3.249 ± 0.833
1.895CysMet: 1.895 ± 0.306
0.271CysAsn: 0.271 ± 0.268
0.812CysPro: 0.812 ± 0.446
1.083CysGln: 1.083 ± 1.07
1.895CysArg: 1.895 ± 1.096
3.519CysSer: 3.519 ± 1.923
1.624CysThr: 1.624 ± 0.835
1.354CysVal: 1.354 ± 0.344
0.541CysTrp: 0.541 ± 0.455
1.624CysTyr: 1.624 ± 1.418
0.0CysXaa: 0.0 ± 0.0
Asp
4.061AspAla: 4.061 ± 0.809
1.354AspCys: 1.354 ± 0.433
2.436AspAsp: 2.436 ± 0.444
3.519AspGlu: 3.519 ± 0.993
1.895AspPhe: 1.895 ± 0.494
5.143AspGly: 5.143 ± 0.567
0.541AspHis: 0.541 ± 0.172
3.79AspIle: 3.79 ± 1.31
1.354AspLys: 1.354 ± 0.836
4.331AspLeu: 4.331 ± 1.231
1.895AspMet: 1.895 ± 1.59
1.354AspAsn: 1.354 ± 0.836
2.436AspPro: 2.436 ± 1.154
1.083AspGln: 1.083 ± 0.668
2.436AspArg: 2.436 ± 0.667
3.519AspSer: 3.519 ± 0.45
2.166AspThr: 2.166 ± 0.69
4.331AspVal: 4.331 ± 0.182
1.895AspTrp: 1.895 ± 0.764
0.541AspTyr: 0.541 ± 0.334
0.0AspXaa: 0.0 ± 0.0
Glu
4.873GluAla: 4.873 ± 0.575
0.812GluCys: 0.812 ± 0.732
3.249GluAsp: 3.249 ± 1.386
5.685GluGlu: 5.685 ± 1.059
3.249GluPhe: 3.249 ± 0.639
5.414GluGly: 5.414 ± 0.698
0.812GluHis: 0.812 ± 0.418
4.061GluIle: 4.061 ± 0.559
4.331GluLys: 4.331 ± 1.181
7.038GluLeu: 7.038 ± 0.611
1.895GluMet: 1.895 ± 0.306
2.707GluAsn: 2.707 ± 0.296
1.895GluPro: 1.895 ± 1.338
2.166GluGln: 2.166 ± 0.386
3.519GluArg: 3.519 ± 1.111
5.143GluSer: 5.143 ± 0.842
2.978GluThr: 2.978 ± 0.706
5.143GluVal: 5.143 ± 0.394
1.083GluTrp: 1.083 ± 0.522
1.624GluTyr: 1.624 ± 0.695
0.0GluXaa: 0.0 ± 0.0
Phe
1.354PheAla: 1.354 ± 0.759
1.354PheCys: 1.354 ± 0.58
1.354PheAsp: 1.354 ± 0.782
1.354PheGlu: 1.354 ± 0.782
2.707PhePhe: 2.707 ± 0.783
3.249PheGly: 3.249 ± 0.986
2.436PheHis: 2.436 ± 0.649
2.166PheIle: 2.166 ± 0.524
1.895PheLys: 1.895 ± 0.534
4.602PheLeu: 4.602 ± 0.978
0.812PheMet: 0.812 ± 0.501
1.354PheAsn: 1.354 ± 0.344
1.895PhePro: 1.895 ± 0.306
0.541PheGln: 0.541 ± 0.172
2.436PheArg: 2.436 ± 0.841
6.226PheSer: 6.226 ± 1.137
1.895PheThr: 1.895 ± 0.764
1.895PheVal: 1.895 ± 0.502
0.812PheTrp: 0.812 ± 0.208
0.541PheTyr: 0.541 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
2.978GlyAla: 2.978 ± 0.479
3.79GlyCys: 3.79 ± 2.57
4.061GlyAsp: 4.061 ± 0.66
4.331GlyGlu: 4.331 ± 0.906
2.707GlyPhe: 2.707 ± 0.977
5.685GlyGly: 5.685 ± 1.064
1.354GlyHis: 1.354 ± 0.344
2.436GlyIle: 2.436 ± 0.594
4.873GlyLys: 4.873 ± 1.28
8.663GlyLeu: 8.663 ± 1.027
2.166GlyMet: 2.166 ± 0.408
2.436GlyAsn: 2.436 ± 0.79
2.978GlyPro: 2.978 ± 0.7
1.354GlyGln: 1.354 ± 0.782
3.79GlyArg: 3.79 ± 0.642
6.768GlySer: 6.768 ± 1.844
3.519GlyThr: 3.519 ± 0.943
7.58GlyVal: 7.58 ± 0.995
0.541GlyTrp: 0.541 ± 0.172
2.166GlyTyr: 2.166 ± 0.719
0.0GlyXaa: 0.0 ± 0.0
His
1.624HisAla: 1.624 ± 0.314
0.812HisCys: 0.812 ± 0.501
0.812HisAsp: 0.812 ± 0.208
1.354HisGlu: 1.354 ± 0.344
1.354HisPhe: 1.354 ± 0.58
2.436HisGly: 2.436 ± 0.822
0.271HisHis: 0.271 ± 0.167
0.271HisIle: 0.271 ± 0.167
0.812HisLys: 0.812 ± 0.803
2.166HisLeu: 2.166 ± 0.289
0.271HisMet: 0.271 ± 0.519
0.541HisAsn: 0.541 ± 0.455
0.541HisPro: 0.541 ± 0.172
0.541HisGln: 0.541 ± 0.455
1.083HisArg: 1.083 ± 0.5
3.249HisSer: 3.249 ± 1.326
0.271HisThr: 0.271 ± 0.167
2.166HisVal: 2.166 ± 0.542
0.271HisTrp: 0.271 ± 0.268
1.083HisTyr: 1.083 ± 0.5
0.0HisXaa: 0.0 ± 0.0
Ile
2.707IleAla: 2.707 ± 0.998
1.624IleCys: 1.624 ± 0.835
2.978IleAsp: 2.978 ± 1.136
4.602IleGlu: 4.602 ± 0.861
1.624IlePhe: 1.624 ± 0.417
3.249IleGly: 3.249 ± 0.968
1.624IleHis: 1.624 ± 0.835
3.249IleIle: 3.249 ± 0.863
3.249IleLys: 3.249 ± 0.188
4.873IleLeu: 4.873 ± 0.869
2.166IleMet: 2.166 ± 0.542
2.436IleAsn: 2.436 ± 0.841
2.707IlePro: 2.707 ± 1.014
2.166IleGln: 2.166 ± 0.542
2.707IleArg: 2.707 ± 0.863
4.602IleSer: 4.602 ± 2.115
3.519IleThr: 3.519 ± 0.692
2.436IleVal: 2.436 ± 0.678
1.354IleTrp: 1.354 ± 0.782
1.354IleTyr: 1.354 ± 0.836
0.0IleXaa: 0.0 ± 0.0
Lys
3.249LysAla: 3.249 ± 0.846
1.624LysCys: 1.624 ± 1.017
1.354LysAsp: 1.354 ± 0.433
4.331LysGlu: 4.331 ± 1.054
1.354LysPhe: 1.354 ± 0.338
3.79LysGly: 3.79 ± 0.786
1.083LysHis: 1.083 ± 0.668
3.249LysIle: 3.249 ± 0.977
6.226LysLys: 6.226 ± 0.535
6.497LysLeu: 6.497 ± 1.265
2.436LysMet: 2.436 ± 0.854
2.436LysAsn: 2.436 ± 0.649
3.249LysPro: 3.249 ± 1.067
1.624LysGln: 1.624 ± 0.417
1.895LysArg: 1.895 ± 1.047
5.685LysSer: 5.685 ± 0.943
5.685LysThr: 5.685 ± 1.001
2.436LysVal: 2.436 ± 1.746
1.624LysTrp: 1.624 ± 0.598
1.624LysTyr: 1.624 ± 0.695
0.0LysXaa: 0.0 ± 0.0
Leu
4.873LeuAla: 4.873 ± 0.347
2.166LeuCys: 2.166 ± 0.673
6.226LeuAsp: 6.226 ± 2.104
7.58LeuGlu: 7.58 ± 1.685
4.331LeuPhe: 4.331 ± 1.401
7.309LeuGly: 7.309 ± 0.458
2.707LeuHis: 2.707 ± 0.354
5.143LeuIle: 5.143 ± 1.124
7.038LeuLys: 7.038 ± 1.077
8.663LeuLeu: 8.663 ± 1.586
3.79LeuMet: 3.79 ± 1.084
4.061LeuAsn: 4.061 ± 1.155
3.519LeuPro: 3.519 ± 0.979
2.436LeuGln: 2.436 ± 1.473
5.956LeuArg: 5.956 ± 2.187
9.204LeuSer: 9.204 ± 2.059
6.226LeuThr: 6.226 ± 0.901
4.331LeuVal: 4.331 ± 0.976
1.895LeuTrp: 1.895 ± 0.547
1.083LeuTyr: 1.083 ± 0.668
0.0LeuXaa: 0.0 ± 0.0
Met
2.436MetAla: 2.436 ± 0.803
0.541MetCys: 0.541 ± 0.414
2.707MetAsp: 2.707 ± 0.737
2.166MetGlu: 2.166 ± 0.524
1.083MetPhe: 1.083 ± 0.578
2.166MetGly: 2.166 ± 0.289
0.0MetHis: 0.0 ± 0.0
1.083MetIle: 1.083 ± 0.35
1.354MetLys: 1.354 ± 0.58
2.436MetLeu: 2.436 ± 0.822
1.624MetMet: 1.624 ± 0.314
1.354MetAsn: 1.354 ± 0.662
0.812MetPro: 0.812 ± 0.208
1.083MetGln: 1.083 ± 0.344
2.166MetArg: 2.166 ± 0.602
1.895MetSer: 1.895 ± 1.521
1.624MetThr: 1.624 ± 0.312
2.436MetVal: 2.436 ± 0.761
0.541MetTrp: 0.541 ± 0.334
1.083MetTyr: 1.083 ± 0.35
0.0MetXaa: 0.0 ± 0.0
Asn
1.895AsnAla: 1.895 ± 0.485
0.271AsnCys: 0.271 ± 0.268
1.083AsnAsp: 1.083 ± 0.522
0.541AsnGlu: 0.541 ± 0.334
0.812AsnPhe: 0.812 ± 0.501
1.083AsnGly: 1.083 ± 0.344
1.083AsnHis: 1.083 ± 0.336
2.166AsnIle: 2.166 ± 0.975
1.624AsnLys: 1.624 ± 0.314
4.602AsnLeu: 4.602 ± 1.26
0.812AsnMet: 0.812 ± 0.208
0.541AsnAsn: 0.541 ± 0.334
3.519AsnPro: 3.519 ± 0.54
2.166AsnGln: 2.166 ± 0.542
2.436AsnArg: 2.436 ± 0.761
3.519AsnSer: 3.519 ± 1.071
1.083AsnThr: 1.083 ± 1.478
2.166AsnVal: 2.166 ± 0.386
0.812AsnTrp: 0.812 ± 0.448
1.354AsnTyr: 1.354 ± 0.777
0.0AsnXaa: 0.0 ± 0.0
Pro
3.519ProAla: 3.519 ± 0.572
1.624ProCys: 1.624 ± 0.835
2.707ProAsp: 2.707 ± 0.513
2.436ProGlu: 2.436 ± 0.331
1.895ProPhe: 1.895 ± 0.478
3.519ProGly: 3.519 ± 0.54
0.812ProHis: 0.812 ± 0.418
2.436ProIle: 2.436 ± 0.741
2.436ProLys: 2.436 ± 0.649
2.436ProLeu: 2.436 ± 0.888
0.812ProMet: 0.812 ± 0.418
1.083ProAsn: 1.083 ± 0.522
1.083ProPro: 1.083 ± 0.325
1.624ProGln: 1.624 ± 0.417
1.895ProArg: 1.895 ± 0.811
4.331ProSer: 4.331 ± 1.697
2.707ProThr: 2.707 ± 1.395
1.895ProVal: 1.895 ± 0.699
0.541ProTrp: 0.541 ± 0.334
0.541ProTyr: 0.541 ± 0.455
0.0ProXaa: 0.0 ± 0.0
Gln
1.624GlnAla: 1.624 ± 0.371
2.166GlnCys: 2.166 ± 1.36
1.354GlnAsp: 1.354 ± 0.338
3.519GlnGlu: 3.519 ± 1.111
1.895GlnPhe: 1.895 ± 0.502
2.707GlnGly: 2.707 ± 0.298
0.541GlnHis: 0.541 ± 0.334
2.166GlnIle: 2.166 ± 0.613
1.624GlnLys: 1.624 ± 0.719
1.895GlnLeu: 1.895 ± 0.982
0.271GlnMet: 0.271 ± 0.167
0.541GlnAsn: 0.541 ± 0.334
1.083GlnPro: 1.083 ± 0.344
0.271GlnGln: 0.271 ± 0.452
1.354GlnArg: 1.354 ± 0.433
1.354GlnSer: 1.354 ± 0.4
0.812GlnThr: 0.812 ± 0.208
2.436GlnVal: 2.436 ± 0.915
0.541GlnTrp: 0.541 ± 0.334
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
3.79ArgAla: 3.79 ± 1.094
2.707ArgCys: 2.707 ± 0.866
2.436ArgAsp: 2.436 ± 1.322
4.873ArgGlu: 4.873 ± 0.864
2.707ArgPhe: 2.707 ± 0.915
4.602ArgGly: 4.602 ± 0.788
1.083ArgHis: 1.083 ± 0.336
2.978ArgIle: 2.978 ± 0.183
3.249ArgLys: 3.249 ± 1.067
5.956ArgLeu: 5.956 ± 1.607
1.624ArgMet: 1.624 ± 0.371
1.354ArgAsn: 1.354 ± 0.338
2.436ArgPro: 2.436 ± 0.625
1.083ArgGln: 1.083 ± 0.336
2.978ArgArg: 2.978 ± 0.672
4.602ArgSer: 4.602 ± 1.271
2.978ArgThr: 2.978 ± 0.401
4.602ArgVal: 4.602 ± 0.872
1.354ArgTrp: 1.354 ± 0.782
0.541ArgTyr: 0.541 ± 0.334
0.0ArgXaa: 0.0 ± 0.0
Ser
4.873SerAla: 4.873 ± 0.747
2.707SerCys: 2.707 ± 1.229
4.602SerAsp: 4.602 ± 1.158
7.58SerGlu: 7.58 ± 1.244
3.249SerPhe: 3.249 ± 0.502
7.851SerGly: 7.851 ± 3.302
2.978SerHis: 2.978 ± 0.745
4.873SerIle: 4.873 ± 1.91
4.873SerLys: 4.873 ± 0.362
10.287SerLeu: 10.287 ± 2.005
1.083SerMet: 1.083 ± 0.5
3.79SerAsn: 3.79 ± 0.455
3.519SerPro: 3.519 ± 0.856
2.436SerGln: 2.436 ± 0.97
6.226SerArg: 6.226 ± 1.145
9.475SerSer: 9.475 ± 1.464
2.707SerThr: 2.707 ± 0.977
4.873SerVal: 4.873 ± 1.952
1.895SerTrp: 1.895 ± 0.25
2.978SerTyr: 2.978 ± 1.49
0.0SerXaa: 0.0 ± 0.0
Thr
2.166ThrAla: 2.166 ± 1.743
1.083ThrCys: 1.083 ± 0.527
2.978ThrAsp: 2.978 ± 0.695
3.519ThrGlu: 3.519 ± 0.688
3.249ThrPhe: 3.249 ± 0.843
4.061ThrGly: 4.061 ± 1.374
0.812ThrHis: 0.812 ± 0.501
3.519ThrIle: 3.519 ± 0.305
4.602ThrLys: 4.602 ± 1.001
5.143ThrLeu: 5.143 ± 0.996
1.354ThrMet: 1.354 ± 0.489
1.354ThrAsn: 1.354 ± 0.433
1.354ThrPro: 1.354 ± 0.338
1.895ThrGln: 1.895 ± 0.811
1.624ThrArg: 1.624 ± 0.881
3.519ThrSer: 3.519 ± 0.856
2.436ThrThr: 2.436 ± 0.5
2.707ThrVal: 2.707 ± 0.639
0.271ThrTrp: 0.271 ± 0.452
1.354ThrTyr: 1.354 ± 0.344
0.0ThrXaa: 0.0 ± 0.0
Val
3.79ValAla: 3.79 ± 0.989
1.354ValCys: 1.354 ± 0.58
4.061ValAsp: 4.061 ± 0.559
5.685ValGlu: 5.685 ± 1.997
2.436ValPhe: 2.436 ± 0.92
3.249ValGly: 3.249 ± 1.736
1.083ValHis: 1.083 ± 0.68
4.331ValIle: 4.331 ± 1.376
2.978ValLys: 2.978 ± 1.437
4.331ValLeu: 4.331 ± 0.872
1.895ValMet: 1.895 ± 0.745
2.166ValAsn: 2.166 ± 0.706
2.436ValPro: 2.436 ± 0.955
2.166ValGln: 2.166 ± 0.975
6.768ValArg: 6.768 ± 0.645
6.226ValSer: 6.226 ± 1.7
2.707ValThr: 2.707 ± 0.687
4.602ValVal: 4.602 ± 1.271
1.354ValTrp: 1.354 ± 0.861
1.354ValTyr: 1.354 ± 0.638
0.0ValXaa: 0.0 ± 0.0
Trp
1.624TrpAla: 1.624 ± 0.312
0.0TrpCys: 0.0 ± 0.0
0.541TrpAsp: 0.541 ± 0.455
0.541TrpGlu: 0.541 ± 0.334
1.083TrpPhe: 1.083 ± 0.35
1.895TrpGly: 1.895 ± 0.25
0.0TrpHis: 0.0 ± 0.0
1.083TrpIle: 1.083 ± 0.344
1.083TrpLys: 1.083 ± 0.336
2.707TrpLeu: 2.707 ± 0.623
0.812TrpMet: 0.812 ± 0.418
0.541TrpAsn: 0.541 ± 0.172
0.812TrpPro: 0.812 ± 0.962
0.271TrpGln: 0.271 ± 0.452
1.083TrpArg: 1.083 ± 0.35
1.624TrpSer: 1.624 ± 0.423
0.541TrpThr: 0.541 ± 0.414
2.166TrpVal: 2.166 ± 0.706
0.0TrpTrp: 0.0 ± 0.0
0.541TrpTyr: 0.541 ± 0.334
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.812TyrAla: 0.812 ± 0.208
0.271TyrCys: 0.271 ± 0.167
1.083TyrAsp: 1.083 ± 0.668
0.271TyrGlu: 0.271 ± 0.452
0.812TyrPhe: 0.812 ± 0.208
1.354TyrGly: 1.354 ± 0.4
1.083TyrHis: 1.083 ± 0.35
0.812TyrIle: 0.812 ± 0.501
2.436TyrLys: 2.436 ± 0.401
2.978TyrLeu: 2.978 ± 1.322
0.812TyrMet: 0.812 ± 0.208
0.541TyrAsn: 0.541 ± 0.414
1.354TyrPro: 1.354 ± 1.35
0.271TyrGln: 0.271 ± 0.167
2.707TyrArg: 2.707 ± 0.998
3.79TyrSer: 3.79 ± 1.16
1.354TyrThr: 1.354 ± 0.489
1.083TyrVal: 1.083 ± 0.344
0.271TyrTrp: 0.271 ± 0.167
1.354TyrTyr: 1.354 ± 0.319
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3695 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski