Amino acid dipepetide frequency for Wenling crustacean virus 15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.684AlaAla: 10.684 ± 2.995
1.221AlaCys: 1.221 ± 0.637
5.495AlaAsp: 5.495 ± 0.739
5.495AlaGlu: 5.495 ± 0.754
3.663AlaPhe: 3.663 ± 0.459
6.105AlaGly: 6.105 ± 1.681
2.442AlaHis: 2.442 ± 0.714
4.274AlaIle: 4.274 ± 0.905
2.747AlaLys: 2.747 ± 0.92
11.6AlaLeu: 11.6 ± 2.147
2.137AlaMet: 2.137 ± 0.949
1.526AlaAsn: 1.526 ± 0.46
6.105AlaPro: 6.105 ± 0.4
4.579AlaGln: 4.579 ± 0.06
6.716AlaArg: 6.716 ± 1.502
5.495AlaSer: 5.495 ± 0.679
7.937AlaThr: 7.937 ± 2.368
9.463AlaVal: 9.463 ± 0.441
1.832AlaTrp: 1.832 ± 0.088
4.274AlaTyr: 4.274 ± 0.207
0.0AlaXaa: 0.0 ± 0.0
Cys
2.137CysAla: 2.137 ± 0.177
0.611CysCys: 0.611 ± 0.318
0.916CysAsp: 0.916 ± 0.477
1.832CysGlu: 1.832 ± 0.44
0.611CysPhe: 0.611 ± 0.318
0.916CysGly: 0.916 ± 0.694
2.137CysHis: 2.137 ± 1.114
0.611CysIle: 0.611 ± 0.318
0.611CysLys: 0.611 ± 0.318
1.526CysLeu: 1.526 ± 0.187
0.916CysMet: 0.916 ± 0.477
0.305CysAsn: 0.305 ± 0.412
1.832CysPro: 1.832 ± 0.496
0.305CysGln: 0.305 ± 0.159
1.832CysArg: 1.832 ± 0.496
2.442CysSer: 2.442 ± 1.157
0.305CysThr: 0.305 ± 0.159
0.916CysVal: 0.916 ± 0.492
0.305CysTrp: 0.305 ± 0.159
0.611CysTyr: 0.611 ± 0.289
0.0CysXaa: 0.0 ± 0.0
Asp
5.8AspAla: 5.8 ± 0.812
0.611AspCys: 0.611 ± 0.289
2.442AspAsp: 2.442 ± 0.714
3.053AspGlu: 3.053 ± 0.375
1.832AspPhe: 1.832 ± 0.955
2.137AspGly: 2.137 ± 0.177
0.916AspHis: 0.916 ± 0.307
2.747AspIle: 2.747 ± 1.339
0.611AspLys: 0.611 ± 0.334
4.274AspLeu: 4.274 ± 0.706
0.916AspMet: 0.916 ± 0.22
0.611AspAsn: 0.611 ± 0.649
4.274AspPro: 4.274 ± 1.286
2.137AspGln: 2.137 ± 0.643
4.884AspArg: 4.884 ± 0.568
3.053AspSer: 3.053 ± 0.72
3.663AspThr: 3.663 ± 0.953
2.747AspVal: 2.747 ± 0.647
0.916AspTrp: 0.916 ± 0.22
1.526AspTyr: 1.526 ± 0.187
0.0AspXaa: 0.0 ± 0.0
Glu
4.884GluAla: 4.884 ± 0.568
1.221GluCys: 1.221 ± 0.637
3.663GluAsp: 3.663 ± 0.459
2.747GluGlu: 2.747 ± 0.394
2.747GluPhe: 2.747 ± 0.601
3.358GluGly: 3.358 ± 0.309
3.053GluHis: 3.053 ± 0.751
1.832GluIle: 1.832 ± 0.496
0.305GluLys: 0.305 ± 0.159
5.495GluLeu: 5.495 ± 0.679
1.832GluMet: 1.832 ± 0.604
0.916GluAsn: 0.916 ± 0.307
1.832GluPro: 1.832 ± 1.148
0.916GluGln: 0.916 ± 0.477
3.358GluArg: 3.358 ± 0.309
4.274GluSer: 4.274 ± 0.891
2.442GluThr: 2.442 ± 0.917
6.105GluVal: 6.105 ± 0.546
0.916GluTrp: 0.916 ± 0.22
1.832GluTyr: 1.832 ± 0.868
0.0GluXaa: 0.0 ± 0.0
Phe
3.053PheAla: 3.053 ± 0.432
0.611PheCys: 0.611 ± 0.318
0.305PheAsp: 0.305 ± 0.412
2.442PheGlu: 2.442 ± 1.273
0.305PhePhe: 0.305 ± 0.159
1.221PheGly: 1.221 ± 0.637
0.0PheHis: 0.0 ± 0.0
0.916PheIle: 0.916 ± 0.477
1.221PheLys: 1.221 ± 0.336
5.189PheLeu: 5.189 ± 0.605
0.305PheMet: 0.305 ± 0.159
0.305PheAsn: 0.305 ± 0.159
1.832PhePro: 1.832 ± 0.496
1.221PheGln: 1.221 ± 0.252
2.442PheArg: 2.442 ± 0.876
2.442PheSer: 2.442 ± 0.324
0.916PheThr: 0.916 ± 0.477
2.442PheVal: 2.442 ± 0.505
0.0PheTrp: 0.0 ± 0.0
0.611PheTyr: 0.611 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
4.274GlyAla: 4.274 ± 0.989
1.526GlyCys: 1.526 ± 0.36
2.442GlyAsp: 2.442 ± 0.786
3.358GlyGlu: 3.358 ± 1.163
1.526GlyPhe: 1.526 ± 0.796
3.358GlyGly: 3.358 ± 0.929
1.526GlyHis: 1.526 ± 0.796
2.442GlyIle: 2.442 ± 1.339
1.832GlyLys: 1.832 ± 1.499
7.326GlyLeu: 7.326 ± 0.917
1.526GlyMet: 1.526 ± 0.187
2.137GlyAsn: 2.137 ± 0.932
4.274GlyPro: 4.274 ± 0.769
1.526GlyGln: 1.526 ± 0.489
4.884GlyArg: 4.884 ± 2.449
3.358GlySer: 3.358 ± 0.415
3.663GlyThr: 3.663 ± 0.757
5.189GlyVal: 5.189 ± 0.85
1.221GlyTrp: 1.221 ± 0.336
3.053GlyTyr: 3.053 ± 0.52
0.0GlyXaa: 0.0 ± 0.0
His
3.358HisAla: 3.358 ± 0.795
0.916HisCys: 0.916 ± 0.477
1.526HisAsp: 1.526 ± 0.187
0.611HisGlu: 0.611 ± 0.318
0.916HisPhe: 0.916 ± 0.477
3.358HisGly: 3.358 ± 1.751
2.747HisHis: 2.747 ± 0.135
2.442HisIle: 2.442 ± 0.418
0.305HisLys: 0.305 ± 0.159
3.663HisLeu: 3.663 ± 0.177
0.0HisMet: 0.0 ± 0.0
0.916HisAsn: 0.916 ± 0.22
3.663HisPro: 3.663 ± 0.459
0.916HisGln: 0.916 ± 0.477
3.053HisArg: 3.053 ± 0.637
2.442HisSer: 2.442 ± 0.418
1.832HisThr: 1.832 ± 0.496
1.832HisVal: 1.832 ± 1.081
0.0HisTrp: 0.0 ± 0.0
1.526HisTyr: 1.526 ± 0.36
0.0HisXaa: 0.0 ± 0.0
Ile
4.579IleAla: 4.579 ± 0.562
0.611IleCys: 0.611 ± 0.334
1.526IleAsp: 1.526 ± 0.36
2.442IleGlu: 2.442 ± 0.876
1.221IlePhe: 1.221 ± 0.357
1.526IleGly: 1.526 ± 0.46
1.832IleHis: 1.832 ± 0.088
1.526IleIle: 1.526 ± 0.98
2.137IleLys: 2.137 ± 0.643
1.832IleLeu: 1.832 ± 0.496
0.305IleMet: 0.305 ± 0.751
0.611IleAsn: 0.611 ± 0.318
2.442IlePro: 2.442 ± 0.505
1.526IleGln: 1.526 ± 1.14
3.358IleArg: 3.358 ± 0.309
3.968IleSer: 3.968 ± 0.358
1.526IleThr: 1.526 ± 0.796
2.747IleVal: 2.747 ± 1.0
0.305IleTrp: 0.305 ± 0.412
0.611IleTyr: 0.611 ± 0.289
0.0IleXaa: 0.0 ± 0.0
Lys
3.663LysAla: 3.663 ± 0.354
0.611LysCys: 0.611 ± 0.318
0.916LysAsp: 0.916 ± 0.477
0.916LysGlu: 0.916 ± 0.22
0.611LysPhe: 0.611 ± 0.318
0.305LysGly: 0.305 ± 0.159
0.611LysHis: 0.611 ± 0.289
1.832LysIle: 1.832 ± 0.955
0.611LysLys: 0.611 ± 0.318
2.747LysLeu: 2.747 ± 1.339
0.611LysMet: 0.611 ± 0.334
0.0LysAsn: 0.0 ± 0.0
0.611LysPro: 0.611 ± 0.649
0.611LysGln: 0.611 ± 0.318
1.832LysArg: 1.832 ± 0.588
1.832LysSer: 1.832 ± 0.955
1.526LysThr: 1.526 ± 0.187
1.221LysVal: 1.221 ± 0.252
0.611LysTrp: 0.611 ± 0.318
1.526LysTyr: 1.526 ± 1.14
0.0LysXaa: 0.0 ± 0.0
Leu
12.21LeuAla: 12.21 ± 0.543
3.968LeuCys: 3.968 ± 1.187
4.274LeuAsp: 4.274 ± 1.286
5.495LeuGlu: 5.495 ± 1.202
2.442LeuPhe: 2.442 ± 0.235
6.41LeuGly: 6.41 ± 1.705
3.968LeuHis: 3.968 ± 0.765
2.137LeuIle: 2.137 ± 0.177
2.137LeuLys: 2.137 ± 0.643
12.515LeuLeu: 12.515 ± 2.66
1.526LeuMet: 1.526 ± 1.588
1.221LeuAsn: 1.221 ± 0.336
7.937LeuPro: 7.937 ± 2.753
6.41LeuGln: 6.41 ± 1.634
7.631LeuArg: 7.631 ± 1.071
8.547LeuSer: 8.547 ± 0.706
7.326LeuThr: 7.326 ± 1.807
9.768LeuVal: 9.768 ± 0.889
1.221LeuTrp: 1.221 ± 0.252
1.832LeuTyr: 1.832 ± 0.44
0.0LeuXaa: 0.0 ± 0.0
Met
1.221MetAla: 1.221 ± 1.298
0.0MetCys: 0.0 ± 0.0
1.526MetAsp: 1.526 ± 1.075
0.916MetGlu: 0.916 ± 0.746
0.305MetPhe: 0.305 ± 0.159
1.221MetGly: 1.221 ± 0.252
0.305MetHis: 0.305 ± 0.423
0.305MetIle: 0.305 ± 0.159
0.611MetLys: 0.611 ± 0.318
0.916MetLeu: 0.916 ± 0.746
0.305MetMet: 0.305 ± 0.159
0.916MetAsn: 0.916 ± 0.307
1.526MetPro: 1.526 ± 0.732
0.611MetGln: 0.611 ± 0.334
1.221MetArg: 1.221 ± 0.357
1.221MetSer: 1.221 ± 1.166
0.611MetThr: 0.611 ± 0.318
2.442MetVal: 2.442 ± 0.714
0.916MetTrp: 0.916 ± 0.22
0.916MetTyr: 0.916 ± 0.307
0.0MetXaa: 0.0 ± 0.0
Asn
1.526AsnAla: 1.526 ± 0.687
0.916AsnCys: 0.916 ± 0.22
0.305AsnAsp: 0.305 ± 0.159
0.611AsnGlu: 0.611 ± 0.318
0.305AsnPhe: 0.305 ± 0.423
1.832AsnGly: 1.832 ± 0.44
0.611AsnHis: 0.611 ± 0.318
0.916AsnIle: 0.916 ± 0.694
0.305AsnLys: 0.305 ± 0.159
3.053AsnLeu: 3.053 ± 1.18
0.611AsnMet: 0.611 ± 0.571
0.305AsnAsn: 0.305 ± 0.159
1.221AsnPro: 1.221 ± 0.843
0.611AsnGln: 0.611 ± 0.334
1.221AsnArg: 1.221 ± 0.336
0.611AsnSer: 0.611 ± 0.318
1.221AsnThr: 1.221 ± 0.357
1.221AsnVal: 1.221 ± 0.336
0.305AsnTrp: 0.305 ± 0.412
0.916AsnTyr: 0.916 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
5.8ProAla: 5.8 ± 1.229
0.916ProCys: 0.916 ± 0.22
3.358ProAsp: 3.358 ± 1.334
4.274ProGlu: 4.274 ± 0.353
0.916ProPhe: 0.916 ± 0.694
4.884ProGly: 4.884 ± 0.649
3.358ProHis: 3.358 ± 0.852
1.221ProIle: 1.221 ± 0.637
0.916ProLys: 0.916 ± 0.694
7.021ProLeu: 7.021 ± 1.23
1.526ProMet: 1.526 ± 0.257
1.526ProAsn: 1.526 ± 0.46
2.137ProPro: 2.137 ± 0.495
1.221ProGln: 1.221 ± 1.166
2.747ProArg: 2.747 ± 0.66
7.326ProSer: 7.326 ± 2.124
4.884ProThr: 4.884 ± 0.665
4.274ProVal: 4.274 ± 0.621
0.916ProTrp: 0.916 ± 0.694
1.526ProTyr: 1.526 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
4.579GlnAla: 4.579 ± 1.411
0.611GlnCys: 0.611 ± 0.318
1.832GlnAsp: 1.832 ± 0.088
1.526GlnGlu: 1.526 ± 0.796
1.221GlnPhe: 1.221 ± 0.637
1.526GlnGly: 1.526 ± 0.687
1.526GlnHis: 1.526 ± 0.36
0.916GlnIle: 0.916 ± 0.492
0.305GlnLys: 0.305 ± 0.159
3.968GlnLeu: 3.968 ± 0.703
0.916GlnMet: 0.916 ± 0.477
1.221GlnAsn: 1.221 ± 0.357
3.358GlnPro: 3.358 ± 0.795
1.832GlnGln: 1.832 ± 0.496
3.053GlnArg: 3.053 ± 0.432
2.442GlnSer: 2.442 ± 0.876
2.747GlnThr: 2.747 ± 0.647
2.747GlnVal: 2.747 ± 1.058
0.611GlnTrp: 0.611 ± 0.847
1.221GlnTyr: 1.221 ± 0.579
0.0GlnXaa: 0.0 ± 0.0
Arg
5.495ArgAla: 5.495 ± 0.436
1.832ArgCys: 1.832 ± 0.531
3.968ArgAsp: 3.968 ± 0.765
4.884ArgGlu: 4.884 ± 0.902
2.747ArgPhe: 2.747 ± 1.0
3.968ArgGly: 3.968 ± 0.856
4.274ArgHis: 4.274 ± 1.286
3.053ArgIle: 3.053 ± 0.751
2.442ArgLys: 2.442 ± 0.876
6.41ArgLeu: 6.41 ± 1.57
1.221ArgMet: 1.221 ± 1.166
2.137ArgAsn: 2.137 ± 1.268
3.358ArgPro: 3.358 ± 0.916
3.053ArgGln: 3.053 ± 0.751
5.495ArgArg: 5.495 ± 0.951
6.105ArgSer: 6.105 ± 1.273
4.884ArgThr: 4.884 ± 0.649
3.663ArgVal: 3.663 ± 0.879
2.747ArgTrp: 2.747 ± 0.949
1.221ArgTyr: 1.221 ± 0.357
0.0ArgXaa: 0.0 ± 0.0
Ser
7.631SerAla: 7.631 ± 0.937
1.832SerCys: 1.832 ± 0.44
4.884SerAsp: 4.884 ± 0.471
3.053SerGlu: 3.053 ± 0.178
1.526SerPhe: 1.526 ± 0.46
5.189SerGly: 5.189 ± 0.688
1.832SerHis: 1.832 ± 0.955
2.442SerIle: 2.442 ± 0.795
2.137SerLys: 2.137 ± 0.643
9.158SerLeu: 9.158 ± 1.672
1.221SerMet: 1.221 ± 1.166
1.526SerAsn: 1.526 ± 0.187
4.579SerPro: 4.579 ± 1.006
3.663SerGln: 3.663 ± 1.071
3.358SerArg: 3.358 ± 0.916
6.716SerSer: 6.716 ± 2.542
5.495SerThr: 5.495 ± 0.679
4.579SerVal: 4.579 ± 0.06
1.221SerTrp: 1.221 ± 0.252
3.053SerTyr: 3.053 ± 0.919
0.0SerXaa: 0.0 ± 0.0
Thr
6.716ThrAla: 6.716 ± 1.037
0.916ThrCys: 0.916 ± 0.22
3.358ThrAsp: 3.358 ± 0.682
4.274ThrGlu: 4.274 ± 1.31
1.526ThrPhe: 1.526 ± 0.796
3.663ThrGly: 3.663 ± 0.757
1.832ThrHis: 1.832 ± 0.531
2.442ThrIle: 2.442 ± 0.786
0.611ThrLys: 0.611 ± 0.318
9.768ThrLeu: 9.768 ± 1.298
0.916ThrMet: 0.916 ± 0.477
0.611ThrAsn: 0.611 ± 0.318
2.442ThrPro: 2.442 ± 0.7
2.747ThrGln: 2.747 ± 0.647
4.579ThrArg: 4.579 ± 0.61
4.884ThrSer: 4.884 ± 0.428
3.968ThrThr: 3.968 ± 0.364
5.189ThrVal: 5.189 ± 0.85
2.137ThrTrp: 2.137 ± 0.379
1.526ThrTyr: 1.526 ± 0.187
0.0ThrXaa: 0.0 ± 0.0
Val
9.158ValAla: 9.158 ± 1.124
1.832ValCys: 1.832 ± 0.588
4.274ValAsp: 4.274 ± 0.207
3.053ValGlu: 3.053 ± 0.751
1.526ValPhe: 1.526 ± 0.36
5.189ValGly: 5.189 ± 3.216
1.526ValHis: 1.526 ± 0.489
2.137ValIle: 2.137 ± 0.446
2.137ValLys: 2.137 ± 0.495
7.631ValLeu: 7.631 ± 1.6
0.916ValMet: 0.916 ± 0.477
0.611ValAsn: 0.611 ± 0.318
4.579ValPro: 4.579 ± 0.992
3.968ValGln: 3.968 ± 1.75
6.716ValArg: 6.716 ± 0.479
5.8ValSer: 5.8 ± 2.793
5.495ValThr: 5.495 ± 1.169
8.242ValVal: 8.242 ± 0.658
1.526ValTrp: 1.526 ± 0.36
2.442ValTyr: 2.442 ± 0.714
0.0ValXaa: 0.0 ± 0.0
Trp
2.747TrpAla: 2.747 ± 0.48
0.305TrpCys: 0.305 ± 0.159
0.0TrpAsp: 0.0 ± 0.0
0.916TrpGlu: 0.916 ± 0.694
1.221TrpPhe: 1.221 ± 0.637
1.221TrpGly: 1.221 ± 0.252
0.0TrpHis: 0.0 ± 0.0
0.916TrpIle: 0.916 ± 0.492
0.611TrpLys: 0.611 ± 0.289
0.916TrpLeu: 0.916 ± 0.477
0.0TrpMet: 0.0 ± 0.0
0.611TrpAsn: 0.611 ± 0.318
0.305TrpPro: 0.305 ± 0.159
0.0TrpGln: 0.0 ± 0.0
2.442TrpArg: 2.442 ± 0.235
0.916TrpSer: 0.916 ± 0.477
2.137TrpThr: 2.137 ± 0.827
1.832TrpVal: 1.832 ± 0.44
0.611TrpTrp: 0.611 ± 0.824
1.526TrpTyr: 1.526 ± 0.98
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.274TyrAla: 4.274 ± 0.353
0.611TyrCys: 0.611 ± 0.289
2.137TyrAsp: 2.137 ± 0.379
1.832TyrGlu: 1.832 ± 0.955
0.611TyrPhe: 0.611 ± 0.318
3.053TyrGly: 3.053 ± 2.16
1.526TyrHis: 1.526 ± 0.489
1.832TyrIle: 1.832 ± 0.496
0.611TyrLys: 0.611 ± 0.289
3.968TyrLeu: 3.968 ± 1.324
0.0TyrMet: 0.0 ± 0.0
0.611TyrAsn: 0.611 ± 0.649
2.442TyrPro: 2.442 ± 0.917
0.305TyrGln: 0.305 ± 0.159
2.137TyrArg: 2.137 ± 0.729
1.221TyrSer: 1.221 ± 0.579
1.526TyrThr: 1.526 ± 0.687
2.137TyrVal: 2.137 ± 1.027
0.916TyrTrp: 0.916 ± 0.477
1.221TyrTyr: 1.221 ± 0.336
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3277 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski