Amino acid dipepetide frequency for Piscine myocarditis-like virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.256AlaAla: 10.256 ± 3.119
0.513AlaCys: 0.513 ± 0.641
3.077AlaAsp: 3.077 ± 1.628
5.128AlaGlu: 5.128 ± 1.176
0.513AlaPhe: 0.513 ± 0.377
5.128AlaGly: 5.128 ± 1.175
0.513AlaHis: 0.513 ± 0.377
4.615AlaIle: 4.615 ± 1.161
2.564AlaLys: 2.564 ± 1.096
7.692AlaLeu: 7.692 ± 1.115
3.59AlaMet: 3.59 ± 1.387
3.077AlaAsn: 3.077 ± 0.338
3.077AlaPro: 3.077 ± 1.628
5.128AlaGln: 5.128 ± 1.915
3.077AlaArg: 3.077 ± 1.054
4.103AlaSer: 4.103 ± 0.941
4.615AlaThr: 4.615 ± 2.139
5.641AlaVal: 5.641 ± 3.541
1.026AlaTrp: 1.026 ± 0.754
0.513AlaTyr: 0.513 ± 0.377
0.0AlaXaa: 0.0 ± 0.0
Cys
1.538CysAla: 1.538 ± 0.749
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.513CysGlu: 0.513 ± 0.342
0.0CysPhe: 0.0 ± 0.0
0.513CysGly: 0.513 ± 0.342
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.026CysLys: 1.026 ± 0.235
0.0CysLeu: 0.0 ± 0.0
0.513CysMet: 0.513 ± 0.377
0.513CysAsn: 0.513 ± 0.377
0.513CysPro: 0.513 ± 0.641
0.0CysGln: 0.0 ± 0.0
1.026CysArg: 1.026 ± 0.66
1.538CysSer: 1.538 ± 0.833
1.026CysThr: 1.026 ± 0.586
1.026CysVal: 1.026 ± 1.281
0.0CysTrp: 0.0 ± 0.0
1.026CysTyr: 1.026 ± 0.235
0.0CysXaa: 0.0 ± 0.0
Asp
1.538AspAla: 1.538 ± 1.131
0.513AspCys: 0.513 ± 0.641
2.051AspAsp: 2.051 ± 1.37
2.051AspGlu: 2.051 ± 1.37
2.051AspPhe: 2.051 ± 0.886
2.051AspGly: 2.051 ± 0.765
0.513AspHis: 0.513 ± 0.641
4.103AspIle: 4.103 ± 0.941
2.564AspLys: 2.564 ± 1.384
2.564AspLeu: 2.564 ± 0.632
1.538AspMet: 1.538 ± 0.527
3.077AspAsn: 3.077 ± 0.338
1.538AspPro: 1.538 ± 1.027
1.026AspGln: 1.026 ± 0.754
4.615AspArg: 4.615 ± 1.161
3.59AspSer: 3.59 ± 1.208
3.59AspThr: 3.59 ± 0.708
2.564AspVal: 2.564 ± 0.632
2.564AspTrp: 2.564 ± 0.713
1.026AspTyr: 1.026 ± 0.685
0.0AspXaa: 0.0 ± 0.0
Glu
4.103GluAla: 4.103 ± 1.754
0.0GluCys: 0.0 ± 0.0
3.077GluAsp: 3.077 ± 1.709
3.59GluGlu: 3.59 ± 1.772
2.051GluPhe: 2.051 ± 0.765
3.59GluGly: 3.59 ± 1.408
0.513GluHis: 0.513 ± 0.377
4.615GluIle: 4.615 ± 2.765
2.564GluLys: 2.564 ± 1.712
2.051GluLeu: 2.051 ± 0.765
2.051GluMet: 2.051 ± 0.471
4.615GluAsn: 4.615 ± 0.686
3.077GluPro: 3.077 ± 1.054
1.026GluGln: 1.026 ± 0.235
1.538GluArg: 1.538 ± 0.749
3.077GluSer: 3.077 ± 0.901
2.051GluThr: 2.051 ± 0.468
5.641GluVal: 5.641 ± 2.678
0.0GluTrp: 0.0 ± 0.0
4.103GluTyr: 4.103 ± 1.072
0.0GluXaa: 0.0 ± 0.0
Phe
2.051PheAla: 2.051 ± 1.508
0.513PheCys: 0.513 ± 0.342
2.564PheAsp: 2.564 ± 1.096
3.077PheGlu: 3.077 ± 1.016
1.026PhePhe: 1.026 ± 0.235
2.051PheGly: 2.051 ± 0.422
0.0PheHis: 0.0 ± 0.0
1.026PheIle: 1.026 ± 0.66
1.538PheLys: 1.538 ± 0.833
2.564PheLeu: 2.564 ± 0.948
1.026PheMet: 1.026 ± 0.235
2.051PheAsn: 2.051 ± 1.046
2.051PhePro: 2.051 ± 0.886
1.026PheGln: 1.026 ± 0.685
3.077PheArg: 3.077 ± 0.901
1.026PheSer: 1.026 ± 0.685
3.077PheThr: 3.077 ± 1.433
2.051PheVal: 2.051 ± 0.886
1.026PheTrp: 1.026 ± 0.235
1.538PheTyr: 1.538 ± 0.527
0.0PheXaa: 0.0 ± 0.0
Gly
6.154GlyAla: 6.154 ± 2.057
0.513GlyCys: 0.513 ± 0.377
3.077GlyAsp: 3.077 ± 0.351
2.564GlyGlu: 2.564 ± 0.632
5.128GlyPhe: 5.128 ± 1.175
9.231GlyGly: 9.231 ± 2.933
1.026GlyHis: 1.026 ± 1.281
5.641GlyIle: 5.641 ± 1.711
5.641GlyLys: 5.641 ± 1.369
7.179GlyLeu: 7.179 ± 1.877
0.0GlyMet: 0.0 ± 0.0
2.564GlyAsn: 2.564 ± 0.724
3.077GlyPro: 3.077 ± 2.262
1.538GlyGln: 1.538 ± 1.131
4.615GlyArg: 4.615 ± 1.321
3.59GlySer: 3.59 ± 1.208
4.103GlyThr: 4.103 ± 0.844
9.231GlyVal: 9.231 ± 1.559
3.59GlyTrp: 3.59 ± 0.807
4.615GlyTyr: 4.615 ± 1.352
0.0GlyXaa: 0.0 ± 0.0
His
0.513HisAla: 0.513 ± 0.377
0.0HisCys: 0.0 ± 0.0
0.513HisAsp: 0.513 ± 0.641
0.513HisGlu: 0.513 ± 0.641
0.0HisPhe: 0.0 ± 0.0
1.538HisGly: 1.538 ± 1.255
0.513HisHis: 0.513 ± 0.377
0.513HisIle: 0.513 ± 0.342
0.0HisLys: 0.0 ± 0.0
1.538HisLeu: 1.538 ± 0.833
0.513HisMet: 0.513 ± 0.377
0.513HisAsn: 0.513 ± 0.342
1.026HisPro: 1.026 ± 0.235
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
0.513HisSer: 0.513 ± 0.641
1.538HisThr: 1.538 ± 0.527
2.051HisVal: 2.051 ± 0.886
0.0HisTrp: 0.0 ± 0.0
0.513HisTyr: 0.513 ± 0.377
0.0HisXaa: 0.0 ± 0.0
Ile
3.077IleAla: 3.077 ± 1.433
1.538IleCys: 1.538 ± 0.406
3.59IleAsp: 3.59 ± 0.841
1.538IleGlu: 1.538 ± 0.527
1.026IlePhe: 1.026 ± 0.235
4.615IleGly: 4.615 ± 1.08
1.538IleHis: 1.538 ± 1.027
1.026IleIle: 1.026 ± 1.281
4.615IleLys: 4.615 ± 2.335
6.667IleLeu: 6.667 ± 1.581
2.051IleMet: 2.051 ± 0.934
2.564IleAsn: 2.564 ± 1.096
2.051IlePro: 2.051 ± 0.765
1.026IleGln: 1.026 ± 0.754
5.128IleArg: 5.128 ± 1.655
5.128IleSer: 5.128 ± 1.674
2.564IleThr: 2.564 ± 1.384
7.179IleVal: 7.179 ± 2.26
1.026IleTrp: 1.026 ± 0.586
2.051IleTyr: 2.051 ± 1.173
0.0IleXaa: 0.0 ± 0.0
Lys
1.026LysAla: 1.026 ± 0.685
0.513LysCys: 0.513 ± 0.641
2.564LysAsp: 2.564 ± 1.096
2.564LysGlu: 2.564 ± 1.096
1.538LysPhe: 1.538 ± 0.833
4.615LysGly: 4.615 ± 1.932
0.513LysHis: 0.513 ± 0.377
4.103LysIle: 4.103 ± 1.533
22.051LysLys: 22.051 ± 24.736
6.154LysLeu: 6.154 ± 3.684
2.051LysMet: 2.051 ± 0.23
2.051LysAsn: 2.051 ± 0.468
1.026LysPro: 1.026 ± 0.685
2.564LysGln: 2.564 ± 1.463
3.077LysArg: 3.077 ± 1.12
2.564LysSer: 2.564 ± 1.384
5.641LysThr: 5.641 ± 3.317
6.154LysVal: 6.154 ± 1.833
1.538LysTrp: 1.538 ± 1.027
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
7.179LeuAla: 7.179 ± 2.775
1.026LeuCys: 1.026 ± 0.685
6.667LeuAsp: 6.667 ± 2.623
4.615LeuGlu: 4.615 ± 0.301
2.564LeuPhe: 2.564 ± 0.69
8.718LeuGly: 8.718 ± 1.22
1.538LeuHis: 1.538 ± 1.169
7.179LeuIle: 7.179 ± 4.559
4.615LeuLys: 4.615 ± 1.803
6.154LeuLeu: 6.154 ± 1.353
3.59LeuMet: 3.59 ± 1.298
1.538LeuAsn: 1.538 ± 0.527
2.564LeuPro: 2.564 ± 0.724
2.051LeuGln: 2.051 ± 1.319
4.615LeuArg: 4.615 ± 1.161
5.128LeuSer: 5.128 ± 2.094
4.615LeuThr: 4.615 ± 0.798
4.103LeuVal: 4.103 ± 0.347
3.59LeuTrp: 3.59 ± 0.659
1.026LeuTyr: 1.026 ± 0.235
0.0LeuXaa: 0.0 ± 0.0
Met
3.077MetAla: 3.077 ± 0.338
1.026MetCys: 1.026 ± 0.754
0.0MetAsp: 0.0 ± 0.0
2.051MetGlu: 2.051 ± 0.422
1.538MetPhe: 1.538 ± 1.027
2.051MetGly: 2.051 ± 0.886
0.0MetHis: 0.0 ± 0.0
0.513MetIle: 0.513 ± 0.377
3.077MetLys: 3.077 ± 1.12
3.077MetLeu: 3.077 ± 1.029
0.513MetMet: 0.513 ± 0.342
1.026MetAsn: 1.026 ± 0.754
0.513MetPro: 0.513 ± 0.342
0.0MetGln: 0.0 ± 0.0
3.59MetArg: 3.59 ± 1.408
1.538MetSer: 1.538 ± 0.406
0.0MetThr: 0.0 ± 0.0
2.051MetVal: 2.051 ± 0.471
0.0MetTrp: 0.0 ± 0.0
1.538MetTyr: 1.538 ± 0.749
0.0MetXaa: 0.0 ± 0.0
Asn
3.077AsnAla: 3.077 ± 1.054
1.026AsnCys: 1.026 ± 0.586
2.564AsnAsp: 2.564 ± 1.096
1.538AsnGlu: 1.538 ± 0.406
1.538AsnPhe: 1.538 ± 0.406
2.051AsnGly: 2.051 ± 0.422
1.026AsnHis: 1.026 ± 0.754
3.077AsnIle: 3.077 ± 0.901
2.564AsnLys: 2.564 ± 1.028
4.615AsnLeu: 4.615 ± 0.596
1.026AsnMet: 1.026 ± 0.235
2.051AsnAsn: 2.051 ± 0.765
3.077AsnPro: 3.077 ± 1.628
2.564AsnGln: 2.564 ± 0.171
3.59AsnArg: 3.59 ± 0.841
2.051AsnSer: 2.051 ± 0.765
2.564AsnThr: 2.564 ± 1.36
4.103AsnVal: 4.103 ± 0.844
2.564AsnTrp: 2.564 ± 1.096
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
3.59ProAla: 3.59 ± 0.659
0.513ProCys: 0.513 ± 0.641
1.538ProAsp: 1.538 ± 0.406
1.538ProGlu: 1.538 ± 1.131
3.077ProPhe: 3.077 ± 0.706
4.103ProGly: 4.103 ± 1.244
0.0ProHis: 0.0 ± 0.0
1.026ProIle: 1.026 ± 0.235
1.026ProLys: 1.026 ± 0.685
5.641ProLeu: 5.641 ± 1.04
0.513ProMet: 0.513 ± 0.377
1.538ProAsn: 1.538 ± 0.527
4.103ProPro: 4.103 ± 2.827
1.538ProGln: 1.538 ± 0.451
1.538ProArg: 1.538 ± 0.451
2.051ProSer: 2.051 ± 0.886
3.59ProThr: 3.59 ± 1.167
5.128ProVal: 5.128 ± 1.933
1.026ProTrp: 1.026 ± 0.754
1.026ProTyr: 1.026 ± 0.754
0.0ProXaa: 0.0 ± 0.0
Gln
2.564GlnAla: 2.564 ± 0.69
0.513GlnCys: 0.513 ± 0.377
1.026GlnAsp: 1.026 ± 0.235
1.026GlnGlu: 1.026 ± 0.235
0.513GlnPhe: 0.513 ± 0.641
2.564GlnGly: 2.564 ± 0.171
0.0GlnHis: 0.0 ± 0.0
0.513GlnIle: 0.513 ± 0.377
1.538GlnLys: 1.538 ± 1.922
2.564GlnLeu: 2.564 ± 0.632
1.026GlnMet: 1.026 ± 0.605
3.077GlnAsn: 3.077 ± 0.351
2.564GlnPro: 2.564 ± 1.028
1.538GlnGln: 1.538 ± 0.527
1.538GlnArg: 1.538 ± 1.131
2.564GlnSer: 2.564 ± 1.255
2.564GlnThr: 2.564 ± 0.948
3.59GlnVal: 3.59 ± 1.408
0.513GlnTrp: 0.513 ± 0.342
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
7.179ArgAla: 7.179 ± 3.394
0.0ArgCys: 0.0 ± 0.0
2.564ArgAsp: 2.564 ± 1.712
5.128ArgGlu: 5.128 ± 1.175
3.59ArgPhe: 3.59 ± 1.208
4.615ArgGly: 4.615 ± 1.352
1.026ArgHis: 1.026 ± 0.235
3.59ArgIle: 3.59 ± 0.067
5.128ArgLys: 5.128 ± 0.783
4.615ArgLeu: 4.615 ± 1.157
1.538ArgMet: 1.538 ± 0.527
1.538ArgAsn: 1.538 ± 0.527
3.077ArgPro: 3.077 ± 0.706
1.026ArgGln: 1.026 ± 0.754
4.103ArgArg: 4.103 ± 2.111
3.077ArgSer: 3.077 ± 0.901
2.564ArgThr: 2.564 ± 0.632
6.154ArgVal: 6.154 ± 1.306
1.538ArgTrp: 1.538 ± 0.527
3.077ArgTyr: 3.077 ± 1.016
0.0ArgXaa: 0.0 ± 0.0
Ser
3.59SerAla: 3.59 ± 1.408
1.538SerCys: 1.538 ± 0.451
2.051SerAsp: 2.051 ± 0.765
3.59SerGlu: 3.59 ± 2.019
2.051SerPhe: 2.051 ± 0.765
6.154SerGly: 6.154 ± 1.66
0.0SerHis: 0.0 ± 0.0
4.103SerIle: 4.103 ± 0.347
3.077SerLys: 3.077 ± 1.433
5.641SerLeu: 5.641 ± 1.255
2.051SerMet: 2.051 ± 0.422
1.538SerAsn: 1.538 ± 0.833
0.513SerPro: 0.513 ± 0.377
3.077SerGln: 3.077 ± 0.338
6.667SerArg: 6.667 ± 2.104
5.641SerSer: 5.641 ± 2.015
4.103SerThr: 4.103 ± 1.53
2.051SerVal: 2.051 ± 0.471
1.026SerTrp: 1.026 ± 0.685
0.513SerTyr: 0.513 ± 0.641
0.0SerXaa: 0.0 ± 0.0
Thr
4.615ThrAla: 4.615 ± 0.301
0.513ThrCys: 0.513 ± 0.342
1.538ThrAsp: 1.538 ± 0.749
2.564ThrGlu: 2.564 ± 0.171
2.564ThrPhe: 2.564 ± 1.028
3.59ThrGly: 3.59 ± 1.338
1.026ThrHis: 1.026 ± 0.235
6.154ThrIle: 6.154 ± 0.681
4.103ThrLys: 4.103 ± 2.638
3.59ThrLeu: 3.59 ± 2.279
1.026ThrMet: 1.026 ± 0.586
5.128ThrAsn: 5.128 ± 2.496
3.077ThrPro: 3.077 ± 1.054
1.538ThrGln: 1.538 ± 1.131
4.103ThrArg: 4.103 ± 1.771
3.077ThrSer: 3.077 ± 1.054
4.615ThrThr: 4.615 ± 2.139
5.128ThrVal: 5.128 ± 0.739
1.026ThrTrp: 1.026 ± 0.586
1.538ThrTyr: 1.538 ± 0.749
0.0ThrXaa: 0.0 ± 0.0
Val
4.615ValAla: 4.615 ± 2.753
0.0ValCys: 0.0 ± 0.0
5.128ValAsp: 5.128 ± 0.587
7.179ValGlu: 7.179 ± 1.464
2.564ValPhe: 2.564 ± 1.096
9.744ValGly: 9.744 ± 3.948
2.051ValHis: 2.051 ± 0.468
4.103ValIle: 4.103 ± 1.072
3.077ValLys: 3.077 ± 1.12
7.179ValLeu: 7.179 ± 2.095
1.026ValMet: 1.026 ± 0.235
6.667ValAsn: 6.667 ± 0.31
5.641ValPro: 5.641 ± 1.731
3.077ValGln: 3.077 ± 0.811
7.179ValArg: 7.179 ± 1.78
3.077ValSer: 3.077 ± 0.901
4.615ValThr: 4.615 ± 1.08
8.205ValVal: 8.205 ± 1.078
0.513ValTrp: 0.513 ± 0.377
1.026ValTyr: 1.026 ± 0.685
0.0ValXaa: 0.0 ± 0.0
Trp
2.564TrpAla: 2.564 ± 0.632
0.0TrpCys: 0.0 ± 0.0
0.513TrpAsp: 0.513 ± 0.377
2.564TrpGlu: 2.564 ± 1.384
0.0TrpPhe: 0.0 ± 0.0
3.59TrpGly: 3.59 ± 0.067
0.0TrpHis: 0.0 ± 0.0
2.564TrpIle: 2.564 ± 1.384
1.026TrpLys: 1.026 ± 0.66
0.513TrpLeu: 0.513 ± 0.342
0.513TrpMet: 0.513 ± 0.342
1.538TrpAsn: 1.538 ± 0.406
0.0TrpPro: 0.0 ± 0.0
1.026TrpGln: 1.026 ± 0.754
1.538TrpArg: 1.538 ± 0.451
1.538TrpSer: 1.538 ± 0.749
1.538TrpThr: 1.538 ± 1.131
2.564TrpVal: 2.564 ± 0.632
0.513TrpTrp: 0.513 ± 0.641
0.513TrpTyr: 0.513 ± 0.377
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.538TyrAla: 1.538 ± 0.527
0.513TyrCys: 0.513 ± 0.641
1.026TyrAsp: 1.026 ± 0.685
0.0TyrGlu: 0.0 ± 0.0
1.026TyrPhe: 1.026 ± 0.66
2.564TyrGly: 2.564 ± 0.724
0.513TyrHis: 0.513 ± 0.342
1.538TyrIle: 1.538 ± 0.406
1.026TyrLys: 1.026 ± 0.685
3.077TyrLeu: 3.077 ± 1.016
0.513TyrMet: 0.513 ± 0.377
0.0TyrAsn: 0.0 ± 0.0
1.538TyrPro: 1.538 ± 1.131
1.026TyrGln: 1.026 ± 0.235
0.513TyrArg: 0.513 ± 0.377
4.103TyrSer: 4.103 ± 0.347
1.538TyrThr: 1.538 ± 0.749
2.051TyrVal: 2.051 ± 0.471
1.538TyrTrp: 1.538 ± 0.527
0.513TyrTyr: 0.513 ± 0.377
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1951 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski