Amino acid dipepetide frequency for Beihai sesarmid crab virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.203AlaAla: 7.203 ± 3.163
0.8AlaCys: 0.8 ± 0.916
7.203AlaAsp: 7.203 ± 4.434
4.002AlaGlu: 4.002 ± 0.501
3.601AlaPhe: 3.601 ± 2.217
2.401AlaGly: 2.401 ± 1.063
0.0AlaHis: 0.0 ± 0.0
4.002AlaIle: 4.002 ± 0.769
6.403AlaLys: 6.403 ± 0.977
8.003AlaLeu: 8.003 ± 1.003
4.002AlaMet: 4.002 ± 0.501
1.601AlaAsn: 1.601 ± 0.562
1.601AlaPro: 1.601 ± 0.562
2.001AlaGln: 2.001 ± 0.886
4.002AlaArg: 4.002 ± 0.769
4.002AlaSer: 4.002 ± 0.769
3.601AlaThr: 3.601 ± 0.946
4.002AlaVal: 4.002 ± 0.769
0.8AlaTrp: 0.8 ± 0.354
1.2AlaTyr: 1.2 ± 0.532
0.0AlaXaa: 0.0 ± 0.0
Cys
0.8CysAla: 0.8 ± 0.916
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.4CysGly: 0.4 ± 0.177
0.8CysHis: 0.8 ± 0.354
0.8CysIle: 0.8 ± 0.354
1.601CysLys: 1.601 ± 0.562
0.4CysLeu: 0.4 ± 1.093
0.0CysMet: 0.0 ± 0.0
0.8CysAsn: 0.8 ± 0.354
0.0CysPro: 0.0 ± 0.0
0.4CysGln: 0.4 ± 0.177
0.8CysArg: 0.8 ± 0.354
0.4CysSer: 0.4 ± 1.093
0.8CysThr: 0.8 ± 0.354
0.8CysVal: 0.8 ± 0.354
0.4CysTrp: 0.4 ± 0.177
1.2CysTyr: 1.2 ± 0.739
0.0CysXaa: 0.0 ± 0.0
Asp
4.002AspAla: 4.002 ± 2.04
0.4AspCys: 0.4 ± 0.177
3.201AspAsp: 3.201 ± 1.417
2.801AspGlu: 2.801 ± 0.03
2.001AspPhe: 2.001 ± 0.886
2.001AspGly: 2.001 ± 1.655
0.4AspHis: 0.4 ± 0.177
3.601AspIle: 3.601 ± 0.324
3.201AspLys: 3.201 ± 1.124
5.202AspLeu: 5.202 ± 4.049
2.801AspMet: 2.801 ± 0.03
1.601AspAsn: 1.601 ± 1.832
2.401AspPro: 2.401 ± 1.063
1.601AspGln: 1.601 ± 0.709
1.601AspArg: 1.601 ± 0.709
2.001AspSer: 2.001 ± 0.886
1.2AspThr: 1.2 ± 0.532
4.402AspVal: 4.402 ± 0.678
1.601AspTrp: 1.601 ± 1.832
2.001AspTyr: 2.001 ± 1.655
0.0AspXaa: 0.0 ± 0.0
Glu
5.602GluAla: 5.602 ± 1.331
0.8GluCys: 0.8 ± 0.354
4.002GluAsp: 4.002 ± 0.769
5.202GluGlu: 5.202 ± 2.303
4.402GluPhe: 4.402 ± 0.678
3.201GluGly: 3.201 ± 1.417
0.4GluHis: 0.4 ± 0.177
5.602GluIle: 5.602 ± 1.21
3.601GluLys: 3.601 ± 1.595
8.403GluLeu: 8.403 ± 2.45
5.202GluMet: 5.202 ± 0.238
2.401GluAsn: 2.401 ± 1.063
2.801GluPro: 2.801 ± 1.24
0.8GluGln: 0.8 ± 0.354
3.601GluArg: 3.601 ± 1.595
6.403GluSer: 6.403 ± 0.294
4.002GluThr: 4.002 ± 0.769
6.403GluVal: 6.403 ± 0.294
0.0GluTrp: 0.0 ± 0.0
3.201GluTyr: 3.201 ± 1.417
0.0GluXaa: 0.0 ± 0.0
Phe
2.401PheAla: 2.401 ± 2.748
1.2PheCys: 1.2 ± 2.009
1.601PheAsp: 1.601 ± 0.709
4.002PheGlu: 4.002 ± 0.501
0.8PhePhe: 0.8 ± 0.354
1.601PheGly: 1.601 ± 0.709
0.8PheHis: 0.8 ± 0.354
4.402PheIle: 4.402 ± 0.678
5.202PheLys: 5.202 ± 2.303
3.201PheLeu: 3.201 ± 0.147
2.401PheMet: 2.401 ± 2.663
2.401PheAsn: 2.401 ± 1.063
1.601PhePro: 1.601 ± 0.562
1.2PheGln: 1.2 ± 0.532
3.201PheArg: 3.201 ± 0.147
2.001PheSer: 2.001 ± 0.385
2.001PheThr: 2.001 ± 0.886
3.601PheVal: 3.601 ± 1.595
0.0PheTrp: 0.0 ± 0.0
2.001PheTyr: 2.001 ± 0.886
0.0PheXaa: 0.0 ± 0.0
Gly
2.801GlyAla: 2.801 ± 1.301
0.4GlyCys: 0.4 ± 0.177
1.2GlyAsp: 1.2 ± 2.009
5.602GlyGlu: 5.602 ± 0.06
1.601GlyPhe: 1.601 ± 0.709
2.401GlyGly: 2.401 ± 0.207
0.0GlyHis: 0.0 ± 0.0
5.602GlyIle: 5.602 ± 0.06
4.002GlyLys: 4.002 ± 0.769
4.002GlyLeu: 4.002 ± 0.769
1.2GlyMet: 1.2 ± 0.532
0.8GlyAsn: 0.8 ± 0.354
0.8GlyPro: 0.8 ± 0.354
1.601GlyGln: 1.601 ± 0.709
4.402GlyArg: 4.402 ± 1.949
3.201GlySer: 3.201 ± 0.147
3.601GlyThr: 3.601 ± 4.758
3.601GlyVal: 3.601 ± 0.324
0.0GlyTrp: 0.0 ± 0.0
2.801GlyTyr: 2.801 ± 1.24
0.0GlyXaa: 0.0 ± 0.0
His
1.601HisAla: 1.601 ± 1.832
0.4HisCys: 0.4 ± 0.177
0.8HisAsp: 0.8 ± 0.354
0.8HisGlu: 0.8 ± 0.354
0.0HisPhe: 0.0 ± 0.0
1.2HisGly: 1.2 ± 0.532
0.0HisHis: 0.0 ± 0.0
0.8HisIle: 0.8 ± 0.916
2.001HisLys: 2.001 ± 0.886
2.801HisLeu: 2.801 ± 1.24
0.0HisMet: 0.0 ± 0.0
0.8HisAsn: 0.8 ± 0.354
0.4HisPro: 0.4 ± 0.177
0.8HisGln: 0.8 ± 0.354
1.601HisArg: 1.601 ± 0.709
0.4HisSer: 0.4 ± 0.177
0.8HisThr: 0.8 ± 0.916
1.601HisVal: 1.601 ± 0.709
0.0HisTrp: 0.0 ± 0.0
0.4HisTyr: 0.4 ± 1.093
0.0HisXaa: 0.0 ± 0.0
Ile
5.602IleAla: 5.602 ± 0.06
0.4IleCys: 0.4 ± 0.177
3.601IleAsp: 3.601 ± 0.324
5.602IleGlu: 5.602 ± 2.601
2.401IlePhe: 2.401 ± 0.207
2.401IleGly: 2.401 ± 0.207
2.801IleHis: 2.801 ± 1.24
5.602IleIle: 5.602 ± 2.601
7.603IleLys: 7.603 ± 0.825
5.202IleLeu: 5.202 ± 1.033
3.201IleMet: 3.201 ± 1.417
2.801IleAsn: 2.801 ± 0.03
3.601IlePro: 3.601 ± 0.946
2.401IleGln: 2.401 ± 1.063
4.402IleArg: 4.402 ± 1.949
8.003IleSer: 8.003 ± 1.538
3.601IleThr: 3.601 ± 0.324
3.601IleVal: 3.601 ± 0.946
0.4IleTrp: 0.4 ± 0.177
2.001IleTyr: 2.001 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
7.603LysAla: 7.603 ± 0.825
0.4LysCys: 0.4 ± 1.093
3.601LysAsp: 3.601 ± 0.946
6.803LysGlu: 6.803 ± 3.012
4.002LysPhe: 4.002 ± 0.769
5.202LysGly: 5.202 ± 1.033
1.601LysHis: 1.601 ± 0.709
7.203LysIle: 7.203 ± 1.893
6.002LysLys: 6.002 ± 1.387
4.802LysLeu: 4.802 ± 0.415
4.002LysMet: 4.002 ± 1.772
3.601LysAsn: 3.601 ± 1.595
3.601LysPro: 3.601 ± 1.595
2.001LysGln: 2.001 ± 0.385
6.403LysArg: 6.403 ± 0.294
4.402LysSer: 4.402 ± 0.592
2.801LysThr: 2.801 ± 0.03
4.802LysVal: 4.802 ± 0.415
0.0LysTrp: 0.0 ± 0.0
1.2LysTyr: 1.2 ± 0.739
0.0LysXaa: 0.0 ± 0.0
Leu
3.601LeuAla: 3.601 ± 0.324
0.8LeuCys: 0.8 ± 0.354
4.402LeuAsp: 4.402 ± 0.678
6.002LeuGlu: 6.002 ± 0.117
4.802LeuPhe: 4.802 ± 2.126
4.002LeuGly: 4.002 ± 0.501
1.601LeuHis: 1.601 ± 1.832
6.002LeuIle: 6.002 ± 2.424
5.202LeuLys: 5.202 ± 1.508
4.402LeuLeu: 4.402 ± 0.592
2.401LeuMet: 2.401 ± 0.367
4.002LeuAsn: 4.002 ± 0.769
2.001LeuPro: 2.001 ± 0.886
2.001LeuGln: 2.001 ± 0.886
6.002LeuArg: 6.002 ± 0.117
7.603LeuSer: 7.603 ± 0.445
8.003LeuThr: 8.003 ± 3.543
6.002LeuVal: 6.002 ± 2.424
0.0LeuTrp: 0.0 ± 0.0
2.801LeuTyr: 2.801 ± 1.24
0.0LeuXaa: 0.0 ± 0.0
Met
4.802MetAla: 4.802 ± 0.856
0.0MetCys: 0.0 ± 0.0
1.601MetAsp: 1.601 ± 0.709
2.801MetGlu: 2.801 ± 1.24
2.001MetPhe: 2.001 ± 0.886
0.8MetGly: 0.8 ± 0.354
1.601MetHis: 1.601 ± 0.709
2.801MetIle: 2.801 ± 1.24
4.002MetLys: 4.002 ± 0.769
2.401MetLeu: 2.401 ± 1.063
1.2MetMet: 1.2 ± 0.532
2.801MetAsn: 2.801 ± 2.571
1.2MetPro: 1.2 ± 0.532
0.4MetGln: 0.4 ± 0.177
2.801MetArg: 2.801 ± 0.03
5.602MetSer: 5.602 ± 1.331
1.601MetThr: 1.601 ± 0.709
2.001MetVal: 2.001 ± 0.385
0.4MetTrp: 0.4 ± 0.177
1.2MetTyr: 1.2 ± 0.532
0.0MetXaa: 0.0 ± 0.0
Asn
2.801AsnAla: 2.801 ± 0.03
1.601AsnCys: 1.601 ± 0.562
1.601AsnAsp: 1.601 ± 0.709
4.002AsnGlu: 4.002 ± 0.501
2.401AsnPhe: 2.401 ± 1.478
1.2AsnGly: 1.2 ± 0.739
0.0AsnHis: 0.0 ± 0.0
1.601AsnIle: 1.601 ± 0.562
2.801AsnLys: 2.801 ± 1.24
4.402AsnLeu: 4.402 ± 1.862
0.4AsnMet: 0.4 ± 0.177
1.601AsnAsn: 1.601 ± 0.709
3.201AsnPro: 3.201 ± 1.417
1.601AsnGln: 1.601 ± 0.562
0.8AsnArg: 0.8 ± 0.354
1.601AsnSer: 1.601 ± 1.832
2.401AsnThr: 2.401 ± 1.478
4.802AsnVal: 4.802 ± 1.685
1.2AsnTrp: 1.2 ± 0.532
0.4AsnTyr: 0.4 ± 0.177
0.0AsnXaa: 0.0 ± 0.0
Pro
1.601ProAla: 1.601 ± 0.709
0.0ProCys: 0.0 ± 0.0
1.601ProAsp: 1.601 ± 0.709
2.401ProGlu: 2.401 ± 1.063
1.2ProPhe: 1.2 ± 0.532
1.2ProGly: 1.2 ± 0.739
0.8ProHis: 0.8 ± 0.354
3.601ProIle: 3.601 ± 0.324
3.601ProLys: 3.601 ± 0.946
2.401ProLeu: 2.401 ± 0.207
2.801ProMet: 2.801 ± 0.03
0.8ProAsn: 0.8 ± 0.354
1.2ProPro: 1.2 ± 0.532
1.601ProGln: 1.601 ± 3.103
2.001ProArg: 2.001 ± 0.886
1.2ProSer: 1.2 ± 0.532
2.401ProThr: 2.401 ± 0.207
2.801ProVal: 2.801 ± 1.24
0.0ProTrp: 0.0 ± 0.0
1.601ProTyr: 1.601 ± 0.709
0.0ProXaa: 0.0 ± 0.0
Gln
1.601GlnAla: 1.601 ± 0.709
0.0GlnCys: 0.0 ± 0.0
1.2GlnAsp: 1.2 ± 0.532
2.001GlnGlu: 2.001 ± 0.886
0.4GlnPhe: 0.4 ± 0.177
2.001GlnGly: 2.001 ± 1.655
1.2GlnHis: 1.2 ± 0.532
2.001GlnIle: 2.001 ± 1.655
1.601GlnLys: 1.601 ± 0.709
0.4GlnLeu: 0.4 ± 0.177
0.8GlnMet: 0.8 ± 0.354
1.601GlnAsn: 1.601 ± 0.562
1.2GlnPro: 1.2 ± 0.532
0.0GlnGln: 0.0 ± 0.0
0.8GlnArg: 0.8 ± 0.354
3.601GlnSer: 3.601 ± 0.324
1.601GlnThr: 1.601 ± 0.562
1.601GlnVal: 1.601 ± 0.562
0.0GlnTrp: 0.0 ± 0.0
0.8GlnTyr: 0.8 ± 0.354
0.0GlnXaa: 0.0 ± 0.0
Arg
5.202ArgAla: 5.202 ± 2.779
0.8ArgCys: 0.8 ± 0.354
0.4ArgAsp: 0.4 ± 0.177
2.001ArgGlu: 2.001 ± 0.886
3.601ArgPhe: 3.601 ± 0.324
4.002ArgGly: 4.002 ± 0.501
0.8ArgHis: 0.8 ± 0.916
5.602ArgIle: 5.602 ± 2.48
3.201ArgLys: 3.201 ± 1.417
6.002ArgLeu: 6.002 ± 1.387
4.402ArgMet: 4.402 ± 0.678
0.4ArgAsn: 0.4 ± 0.177
2.001ArgPro: 2.001 ± 0.385
0.4ArgGln: 0.4 ± 0.177
4.002ArgArg: 4.002 ± 1.772
5.602ArgSer: 5.602 ± 1.21
5.202ArgThr: 5.202 ± 1.033
6.002ArgVal: 6.002 ± 1.387
0.8ArgTrp: 0.8 ± 0.916
1.601ArgTyr: 1.601 ± 0.562
0.0ArgXaa: 0.0 ± 0.0
Ser
3.601SerAla: 3.601 ± 0.324
1.2SerCys: 1.2 ± 0.532
2.801SerAsp: 2.801 ± 1.301
7.203SerGlu: 7.203 ± 1.919
2.801SerPhe: 2.801 ± 0.03
6.002SerGly: 6.002 ± 2.424
2.001SerHis: 2.001 ± 0.385
5.202SerIle: 5.202 ± 1.033
10.004SerLys: 10.004 ± 0.653
3.201SerLeu: 3.201 ± 0.147
2.401SerMet: 2.401 ± 0.207
4.802SerAsn: 4.802 ± 1.685
2.801SerPro: 2.801 ± 1.24
1.2SerGln: 1.2 ± 0.532
3.601SerArg: 3.601 ± 0.946
4.802SerSer: 4.802 ± 1.685
4.002SerThr: 4.002 ± 2.04
4.402SerVal: 4.402 ± 0.592
0.8SerTrp: 0.8 ± 0.354
3.201SerTyr: 3.201 ± 0.147
0.0SerXaa: 0.0 ± 0.0
Thr
3.601ThrAla: 3.601 ± 0.324
0.4ThrCys: 0.4 ± 0.177
3.201ThrAsp: 3.201 ± 1.124
5.202ThrGlu: 5.202 ± 2.303
2.401ThrPhe: 2.401 ± 1.063
4.802ThrGly: 4.802 ± 1.685
0.8ThrHis: 0.8 ± 0.354
3.601ThrIle: 3.601 ± 0.946
2.801ThrLys: 2.801 ± 0.03
5.202ThrLeu: 5.202 ± 0.238
0.8ThrMet: 0.8 ± 0.354
2.801ThrAsn: 2.801 ± 1.301
1.2ThrPro: 1.2 ± 0.739
1.2ThrGln: 1.2 ± 0.739
4.802ThrArg: 4.802 ± 1.685
4.002ThrSer: 4.002 ± 0.769
4.402ThrThr: 4.402 ± 0.592
3.601ThrVal: 3.601 ± 0.324
0.4ThrTrp: 0.4 ± 0.177
1.601ThrTyr: 1.601 ± 0.709
0.0ThrXaa: 0.0 ± 0.0
Val
3.201ValAla: 3.201 ± 1.124
0.4ValCys: 0.4 ± 0.177
3.201ValAsp: 3.201 ± 2.394
5.602ValGlu: 5.602 ± 1.21
4.402ValPhe: 4.402 ± 0.678
3.601ValGly: 3.601 ± 0.324
0.8ValHis: 0.8 ± 0.354
4.402ValIle: 4.402 ± 1.949
4.802ValLys: 4.802 ± 0.856
6.403ValLeu: 6.403 ± 0.294
3.201ValMet: 3.201 ± 1.417
2.801ValAsn: 2.801 ± 0.03
2.801ValPro: 2.801 ± 3.842
1.601ValGln: 1.601 ± 0.709
6.803ValArg: 6.803 ± 0.471
6.403ValSer: 6.403 ± 0.294
2.401ValThr: 2.401 ± 0.207
2.001ValVal: 2.001 ± 0.886
0.4ValTrp: 0.4 ± 0.177
2.401ValTyr: 2.401 ± 1.478
0.0ValXaa: 0.0 ± 0.0
Trp
0.8TrpAla: 0.8 ± 0.354
0.4TrpCys: 0.4 ± 0.177
0.8TrpAsp: 0.8 ± 0.916
0.4TrpGlu: 0.4 ± 0.177
0.4TrpPhe: 0.4 ± 0.177
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.4TrpIle: 0.4 ± 0.177
0.8TrpLys: 0.8 ± 0.354
1.601TrpLeu: 1.601 ± 0.562
0.0TrpMet: 0.0 ± 0.0
0.4TrpAsn: 0.4 ± 1.093
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.8TrpSer: 0.8 ± 0.354
0.4TrpThr: 0.4 ± 0.177
0.4TrpVal: 0.4 ± 0.177
0.0TrpTrp: 0.0 ± 0.0
0.4TrpTyr: 0.4 ± 1.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.001TyrAla: 2.001 ± 0.886
0.0TyrCys: 0.0 ± 0.0
2.001TyrAsp: 2.001 ± 0.886
3.601TyrGlu: 3.601 ± 0.946
2.801TyrPhe: 2.801 ± 1.301
1.2TyrGly: 1.2 ± 0.739
0.8TyrHis: 0.8 ± 0.916
2.401TyrIle: 2.401 ± 1.063
1.601TyrLys: 1.601 ± 0.709
3.201TyrLeu: 3.201 ± 1.417
0.4TyrMet: 0.4 ± 0.177
1.601TyrAsn: 1.601 ± 0.562
0.4TyrPro: 0.4 ± 0.177
1.601TyrGln: 1.601 ± 1.832
0.4TyrArg: 0.4 ± 0.177
4.002TyrSer: 4.002 ± 0.501
2.001TyrThr: 2.001 ± 0.886
1.2TyrVal: 1.2 ± 0.532
0.8TyrTrp: 0.8 ± 0.916
1.601TyrTyr: 1.601 ± 0.709
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2500 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski