Amino acid dipepetide frequency for Beihai razor shell virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.524AlaAla: 7.524 ± 1.374
4.389AlaCys: 4.389 ± 1.235
5.016AlaAsp: 5.016 ± 0.916
4.389AlaGlu: 4.389 ± 0.33
2.508AlaPhe: 2.508 ± 0.447
5.016AlaGly: 5.016 ± 0.916
1.254AlaHis: 1.254 ± 0.224
6.27AlaIle: 6.27 ± 0.213
3.135AlaLys: 3.135 ± 1.704
8.777AlaLeu: 8.777 ± 0.66
1.254AlaMet: 1.254 ± 0.224
3.135AlaAsn: 3.135 ± 1.704
1.881AlaPro: 1.881 ± 1.022
5.016AlaGln: 5.016 ± 1.821
5.643AlaArg: 5.643 ± 0.352
8.777AlaSer: 8.777 ± 2.961
6.27AlaThr: 6.27 ± 2.023
7.524AlaVal: 7.524 ± 0.469
1.254AlaTrp: 1.254 ± 0.682
1.254AlaTyr: 1.254 ± 0.224
0.0AlaXaa: 0.0 ± 0.0
Cys
1.254CysAla: 1.254 ± 0.224
1.881CysCys: 1.881 ± 0.788
1.254CysAsp: 1.254 ± 0.224
1.881CysGlu: 1.881 ± 0.788
0.627CysPhe: 0.627 ± 0.341
0.627CysGly: 0.627 ± 0.564
0.627CysHis: 0.627 ± 0.564
0.627CysIle: 0.627 ± 0.341
1.254CysLys: 1.254 ± 1.129
1.881CysLeu: 1.881 ± 0.117
1.254CysMet: 1.254 ± 0.682
0.0CysAsn: 0.0 ± 0.0
1.254CysPro: 1.254 ± 0.682
3.135CysGln: 3.135 ± 1.917
0.627CysArg: 0.627 ± 0.564
2.508CysSer: 2.508 ± 0.447
0.627CysThr: 0.627 ± 0.341
0.627CysVal: 0.627 ± 0.341
0.627CysTrp: 0.627 ± 0.564
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.016AspAla: 5.016 ± 0.894
0.627AspCys: 0.627 ± 0.564
3.135AspAsp: 3.135 ± 1.012
5.643AspGlu: 5.643 ± 1.459
0.627AspPhe: 0.627 ± 0.341
3.135AspGly: 3.135 ± 1.012
1.254AspHis: 1.254 ± 0.224
1.881AspIle: 1.881 ± 0.117
1.881AspLys: 1.881 ± 1.022
2.508AspLeu: 2.508 ± 0.458
0.0AspMet: 0.0 ± 0.0
1.881AspAsn: 1.881 ± 0.788
3.135AspPro: 3.135 ± 1.704
2.508AspGln: 2.508 ± 0.447
5.016AspArg: 5.016 ± 0.011
3.135AspSer: 3.135 ± 0.799
1.254AspThr: 1.254 ± 1.129
2.508AspVal: 2.508 ± 0.447
1.881AspTrp: 1.881 ± 0.117
3.135AspTyr: 3.135 ± 0.106
0.0AspXaa: 0.0 ± 0.0
Glu
6.27GluAla: 6.27 ± 1.598
1.254GluCys: 1.254 ± 1.129
1.254GluAsp: 1.254 ± 1.129
5.016GluGlu: 5.016 ± 0.916
2.508GluPhe: 2.508 ± 0.458
3.762GluGly: 3.762 ± 1.576
1.254GluHis: 1.254 ± 0.224
5.016GluIle: 5.016 ± 0.011
1.881GluLys: 1.881 ± 0.788
6.27GluLeu: 6.27 ± 1.598
3.135GluMet: 3.135 ± 1.012
1.254GluAsn: 1.254 ± 0.224
1.881GluPro: 1.881 ± 0.788
3.762GluGln: 3.762 ± 1.14
5.016GluArg: 5.016 ± 1.8
3.762GluSer: 3.762 ± 2.045
4.389GluThr: 4.389 ± 0.33
3.762GluVal: 3.762 ± 0.234
1.254GluTrp: 1.254 ± 1.129
1.881GluTyr: 1.881 ± 0.117
0.0GluXaa: 0.0 ± 0.0
Phe
2.508PheAla: 2.508 ± 1.353
1.254PheCys: 1.254 ± 0.224
3.762PheAsp: 3.762 ± 2.481
3.135PheGlu: 3.135 ± 0.799
1.881PhePhe: 1.881 ± 0.788
5.643PheGly: 5.643 ± 4.175
0.627PheHis: 0.627 ± 0.341
0.0PheIle: 0.0 ± 0.0
1.254PheLys: 1.254 ± 1.129
3.135PheLeu: 3.135 ± 1.012
0.0PheMet: 0.0 ± 0.0
3.135PheAsn: 3.135 ± 0.106
0.627PhePro: 0.627 ± 0.341
1.254PheGln: 1.254 ± 0.682
1.881PheArg: 1.881 ± 0.117
3.762PheSer: 3.762 ± 1.576
3.135PheThr: 3.135 ± 1.012
2.508PheVal: 2.508 ± 1.363
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.389GlyAla: 4.389 ± 2.141
1.881GlyCys: 1.881 ± 0.788
1.881GlyAsp: 1.881 ± 0.117
4.389GlyGlu: 4.389 ± 1.235
5.016GlyPhe: 5.016 ± 0.894
2.508GlyGly: 2.508 ± 0.447
0.627GlyHis: 0.627 ± 0.341
0.0GlyIle: 0.0 ± 0.0
3.135GlyLys: 3.135 ± 0.106
5.643GlyLeu: 5.643 ± 2.162
1.881GlyMet: 1.881 ± 1.022
1.881GlyAsn: 1.881 ± 0.117
1.881GlyPro: 1.881 ± 0.788
3.135GlyGln: 3.135 ± 0.106
4.389GlyArg: 4.389 ± 3.046
7.524GlySer: 7.524 ± 0.436
3.762GlyThr: 3.762 ± 1.14
8.15GlyVal: 8.15 ± 3.717
3.135GlyTrp: 3.135 ± 1.012
1.254GlyTyr: 1.254 ± 0.224
0.0GlyXaa: 0.0 ± 0.0
His
1.881HisAla: 1.881 ± 0.117
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.627HisGlu: 0.627 ± 0.564
0.627HisPhe: 0.627 ± 0.564
1.254HisGly: 1.254 ± 0.224
1.254HisHis: 1.254 ± 0.224
0.627HisIle: 0.627 ± 0.564
0.627HisLys: 0.627 ± 0.564
4.389HisLeu: 4.389 ± 2.141
0.627HisMet: 0.627 ± 0.564
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.254HisGln: 1.254 ± 0.224
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
1.881HisThr: 1.881 ± 0.117
3.762HisVal: 3.762 ± 1.14
0.0HisTrp: 0.0 ± 0.0
1.254HisTyr: 1.254 ± 0.682
0.0HisXaa: 0.0 ± 0.0
Ile
6.27IleAla: 6.27 ± 0.692
1.881IleCys: 1.881 ± 0.117
3.135IleAsp: 3.135 ± 0.106
2.508IleGlu: 2.508 ± 0.447
1.881IlePhe: 1.881 ± 1.693
3.135IleGly: 3.135 ± 1.012
0.627IleHis: 0.627 ± 0.564
1.881IleIle: 1.881 ± 0.117
1.881IleLys: 1.881 ± 1.022
1.254IleLeu: 1.254 ± 1.129
1.254IleMet: 1.254 ± 0.682
2.508IleAsn: 2.508 ± 0.458
1.254IlePro: 1.254 ± 0.682
1.254IleGln: 1.254 ± 0.682
3.762IleArg: 3.762 ± 1.14
1.254IleSer: 1.254 ± 1.129
1.254IleThr: 1.254 ± 0.682
3.135IleVal: 3.135 ± 0.106
0.627IleTrp: 0.627 ± 0.341
1.881IleTyr: 1.881 ± 1.022
0.0IleXaa: 0.0 ± 0.0
Lys
4.389LysAla: 4.389 ± 1.48
0.0LysCys: 0.0 ± 0.0
1.254LysAsp: 1.254 ± 0.224
1.254LysGlu: 1.254 ± 0.682
0.627LysPhe: 0.627 ± 0.564
4.389LysGly: 4.389 ± 0.575
1.254LysHis: 1.254 ± 0.224
1.254LysIle: 1.254 ± 0.224
0.0LysLys: 0.0 ± 0.0
3.762LysLeu: 3.762 ± 0.234
1.254LysMet: 1.254 ± 0.559
0.0LysAsn: 0.0 ± 0.0
3.135LysPro: 3.135 ± 0.106
1.254LysGln: 1.254 ± 0.224
1.881LysArg: 1.881 ± 1.022
4.389LysSer: 4.389 ± 1.48
1.254LysThr: 1.254 ± 1.129
4.389LysVal: 4.389 ± 0.33
0.627LysTrp: 0.627 ± 0.341
0.627LysTyr: 0.627 ± 0.564
0.0LysXaa: 0.0 ± 0.0
Leu
6.897LeuAla: 6.897 ± 1.033
3.135LeuCys: 3.135 ± 0.106
5.643LeuAsp: 5.643 ± 1.459
3.762LeuGlu: 3.762 ± 0.234
3.135LeuPhe: 3.135 ± 1.012
6.27LeuGly: 6.27 ± 1.118
1.254LeuHis: 1.254 ± 0.682
3.762LeuIle: 3.762 ± 0.671
3.762LeuLys: 3.762 ± 0.234
8.15LeuLeu: 8.15 ± 1.906
1.254LeuMet: 1.254 ± 0.224
5.016LeuAsn: 5.016 ± 0.011
5.643LeuPro: 5.643 ± 0.554
2.508LeuGln: 2.508 ± 0.447
6.27LeuArg: 6.27 ± 0.213
10.031LeuSer: 10.031 ± 0.884
3.135LeuThr: 3.135 ± 0.799
10.031LeuVal: 10.031 ± 3.643
0.627LeuTrp: 0.627 ± 0.341
2.508LeuTyr: 2.508 ± 0.447
0.0LeuXaa: 0.0 ± 0.0
Met
1.881MetAla: 1.881 ± 1.022
0.0MetCys: 0.0 ± 0.0
1.881MetAsp: 1.881 ± 1.022
1.254MetGlu: 1.254 ± 0.682
0.627MetPhe: 0.627 ± 0.564
3.762MetGly: 3.762 ± 0.671
0.627MetHis: 0.627 ± 0.564
1.254MetIle: 1.254 ± 0.682
1.254MetLys: 1.254 ± 0.682
3.762MetLeu: 3.762 ± 0.671
1.254MetMet: 1.254 ± 0.224
0.0MetAsn: 0.0 ± 0.0
3.135MetPro: 3.135 ± 0.106
0.0MetGln: 0.0 ± 0.0
2.508MetArg: 2.508 ± 1.363
1.881MetSer: 1.881 ± 0.117
1.881MetThr: 1.881 ± 1.022
1.881MetVal: 1.881 ± 0.117
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.389AsnAla: 4.389 ± 1.48
0.0AsnCys: 0.0 ± 0.0
1.254AsnAsp: 1.254 ± 0.682
1.254AsnGlu: 1.254 ± 0.224
0.627AsnPhe: 0.627 ± 0.564
3.762AsnGly: 3.762 ± 0.671
0.627AsnHis: 0.627 ± 0.564
1.254AsnIle: 1.254 ± 0.682
0.627AsnLys: 0.627 ± 0.341
5.016AsnLeu: 5.016 ± 0.011
1.254AsnMet: 1.254 ± 0.682
0.627AsnAsn: 0.627 ± 0.341
1.254AsnPro: 1.254 ± 0.224
2.508AsnGln: 2.508 ± 0.447
3.135AsnArg: 3.135 ± 0.799
3.135AsnSer: 3.135 ± 0.106
1.254AsnThr: 1.254 ± 0.682
2.508AsnVal: 2.508 ± 0.458
0.0AsnTrp: 0.0 ± 0.0
0.627AsnTyr: 0.627 ± 0.341
0.0AsnXaa: 0.0 ± 0.0
Pro
4.389ProAla: 4.389 ± 1.48
0.627ProCys: 0.627 ± 0.564
0.627ProAsp: 0.627 ± 0.564
4.389ProGlu: 4.389 ± 0.33
1.254ProPhe: 1.254 ± 0.224
0.627ProGly: 0.627 ± 0.564
1.254ProHis: 1.254 ± 1.129
0.627ProIle: 0.627 ± 0.564
3.762ProLys: 3.762 ± 0.234
6.27ProLeu: 6.27 ± 0.692
0.0ProMet: 0.0 ± 0.0
2.508ProAsn: 2.508 ± 1.363
0.627ProPro: 0.627 ± 0.564
1.254ProGln: 1.254 ± 0.224
1.254ProArg: 1.254 ± 0.682
1.254ProSer: 1.254 ± 0.682
3.135ProThr: 3.135 ± 0.106
5.016ProVal: 5.016 ± 0.916
0.627ProTrp: 0.627 ± 0.341
0.627ProTyr: 0.627 ± 0.341
0.0ProXaa: 0.0 ± 0.0
Gln
2.508GlnAla: 2.508 ± 1.363
2.508GlnCys: 2.508 ± 0.458
0.627GlnAsp: 0.627 ± 0.341
2.508GlnGlu: 2.508 ± 0.447
1.254GlnPhe: 1.254 ± 0.224
1.881GlnGly: 1.881 ± 0.788
2.508GlnHis: 2.508 ± 0.447
2.508GlnIle: 2.508 ± 0.458
1.881GlnLys: 1.881 ± 0.117
3.762GlnLeu: 3.762 ± 2.045
0.627GlnMet: 0.627 ± 0.341
0.627GlnAsn: 0.627 ± 0.564
1.254GlnPro: 1.254 ± 0.224
3.135GlnGln: 3.135 ± 1.012
5.643GlnArg: 5.643 ± 1.459
5.016GlnSer: 5.016 ± 0.894
2.508GlnThr: 2.508 ± 0.458
1.881GlnVal: 1.881 ± 0.117
0.627GlnTrp: 0.627 ± 0.564
0.627GlnTyr: 0.627 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
4.389ArgAla: 4.389 ± 0.575
0.627ArgCys: 0.627 ± 0.564
5.016ArgAsp: 5.016 ± 0.894
3.762ArgGlu: 3.762 ± 1.14
3.762ArgPhe: 3.762 ± 3.387
3.135ArgGly: 3.135 ± 0.799
0.0ArgHis: 0.0 ± 0.0
3.135ArgIle: 3.135 ± 1.012
4.389ArgLys: 4.389 ± 0.33
7.524ArgLeu: 7.524 ± 0.469
0.0ArgMet: 0.0 ± 0.0
5.643ArgAsn: 5.643 ± 2.162
2.508ArgPro: 2.508 ± 0.458
1.881ArgGln: 1.881 ± 1.022
5.643ArgArg: 5.643 ± 1.257
4.389ArgSer: 4.389 ± 1.235
1.881ArgThr: 1.881 ± 1.022
5.643ArgVal: 5.643 ± 0.554
1.254ArgTrp: 1.254 ± 1.129
1.881ArgTyr: 1.881 ± 0.117
0.0ArgXaa: 0.0 ± 0.0
Ser
10.031SerAla: 10.031 ± 0.927
1.254SerCys: 1.254 ± 0.682
3.135SerAsp: 3.135 ± 0.106
5.643SerGlu: 5.643 ± 0.554
3.762SerPhe: 3.762 ± 0.671
8.777SerGly: 8.777 ± 0.66
0.0SerHis: 0.0 ± 0.0
3.135SerIle: 3.135 ± 0.106
3.135SerLys: 3.135 ± 1.704
6.27SerLeu: 6.27 ± 2.023
4.389SerMet: 4.389 ± 0.575
0.0SerAsn: 0.0 ± 0.0
4.389SerPro: 4.389 ± 0.575
5.016SerGln: 5.016 ± 0.916
3.762SerArg: 3.762 ± 0.234
10.658SerSer: 10.658 ± 3.078
3.135SerThr: 3.135 ± 0.799
6.27SerVal: 6.27 ± 1.598
1.254SerTrp: 1.254 ± 1.129
1.881SerTyr: 1.881 ± 0.788
0.0SerXaa: 0.0 ± 0.0
Thr
3.762ThrAla: 3.762 ± 0.671
0.627ThrCys: 0.627 ± 0.341
2.508ThrAsp: 2.508 ± 0.458
4.389ThrGlu: 4.389 ± 1.235
3.762ThrPhe: 3.762 ± 0.671
3.135ThrGly: 3.135 ± 0.106
0.627ThrHis: 0.627 ± 0.341
1.254ThrIle: 1.254 ± 0.224
0.627ThrLys: 0.627 ± 0.341
3.762ThrLeu: 3.762 ± 2.045
3.762ThrMet: 3.762 ± 2.045
1.881ThrAsn: 1.881 ± 1.022
1.881ThrPro: 1.881 ± 0.788
1.881ThrGln: 1.881 ± 0.117
1.254ThrArg: 1.254 ± 0.224
3.762ThrSer: 3.762 ± 0.234
3.135ThrThr: 3.135 ± 1.704
5.016ThrVal: 5.016 ± 0.011
0.0ThrTrp: 0.0 ± 0.0
3.135ThrTyr: 3.135 ± 0.106
0.0ThrXaa: 0.0 ± 0.0
Val
9.404ValAla: 9.404 ± 0.586
0.0ValCys: 0.0 ± 0.0
5.643ValAsp: 5.643 ± 2.162
6.27ValGlu: 6.27 ± 0.692
3.135ValPhe: 3.135 ± 0.106
4.389ValGly: 4.389 ± 1.48
2.508ValHis: 2.508 ± 0.458
6.27ValIle: 6.27 ± 1.598
0.627ValLys: 0.627 ± 0.341
6.897ValLeu: 6.897 ± 1.683
3.135ValMet: 3.135 ± 0.449
1.881ValAsn: 1.881 ± 0.788
3.135ValPro: 3.135 ± 0.799
2.508ValGln: 2.508 ± 1.353
5.016ValArg: 5.016 ± 0.916
7.524ValSer: 7.524 ± 0.436
5.643ValThr: 5.643 ± 2.162
6.27ValVal: 6.27 ± 0.692
0.627ValTrp: 0.627 ± 0.341
1.254ValTyr: 1.254 ± 0.224
0.0ValXaa: 0.0 ± 0.0
Trp
1.254TrpAla: 1.254 ± 0.224
0.0TrpCys: 0.0 ± 0.0
1.881TrpAsp: 1.881 ± 0.788
1.254TrpGlu: 1.254 ± 0.224
0.0TrpPhe: 0.0 ± 0.0
0.627TrpGly: 0.627 ± 0.564
0.627TrpHis: 0.627 ± 0.564
0.0TrpIle: 0.0 ± 0.0
1.254TrpLys: 1.254 ± 0.224
0.627TrpLeu: 0.627 ± 0.564
0.627TrpMet: 0.627 ± 0.341
1.254TrpAsn: 1.254 ± 0.224
0.0TrpPro: 0.0 ± 0.0
0.627TrpGln: 0.627 ± 0.564
1.881TrpArg: 1.881 ± 0.788
0.627TrpSer: 0.627 ± 0.341
0.627TrpThr: 0.627 ± 0.564
0.627TrpVal: 0.627 ± 0.341
0.627TrpTrp: 0.627 ± 0.341
1.254TrpTyr: 1.254 ± 0.682
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.881TyrAla: 1.881 ± 0.117
0.627TyrCys: 0.627 ± 0.564
1.254TyrAsp: 1.254 ± 0.224
1.254TyrGlu: 1.254 ± 0.682
1.881TyrPhe: 1.881 ± 1.022
0.627TyrGly: 0.627 ± 0.341
1.254TyrHis: 1.254 ± 0.224
1.881TyrIle: 1.881 ± 1.022
0.627TyrLys: 0.627 ± 0.564
3.135TyrLeu: 3.135 ± 0.106
1.254TyrMet: 1.254 ± 0.224
1.881TyrAsn: 1.881 ± 0.788
1.254TyrPro: 1.254 ± 0.224
0.0TyrGln: 0.0 ± 0.0
1.881TyrArg: 1.881 ± 0.117
2.508TyrSer: 2.508 ± 0.458
0.0TyrThr: 0.0 ± 0.0
1.254TyrVal: 1.254 ± 0.682
0.627TyrTrp: 0.627 ± 0.564
0.627TyrTyr: 0.627 ± 0.341
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1596 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski