Amino acid dipepetide frequency for Beihai mollusks virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.749AlaAla: 5.749 ± 1.718
2.875AlaCys: 2.875 ± 0.311
3.285AlaAsp: 3.285 ± 1.232
4.517AlaGlu: 4.517 ± 1.157
2.875AlaPhe: 2.875 ± 0.859
6.571AlaGly: 6.571 ± 1.88
1.643AlaHis: 1.643 ± 0.846
2.875AlaIle: 2.875 ± 0.311
3.285AlaLys: 3.285 ± 0.522
5.749AlaLeu: 5.749 ± 0.548
0.821AlaMet: 0.821 ± 0.423
4.517AlaAsn: 4.517 ± 0.598
1.232AlaPro: 1.232 ± 1.12
2.875AlaGln: 2.875 ± 0.859
3.696AlaArg: 3.696 ± 0.436
3.696AlaSer: 3.696 ± 0.436
6.982AlaThr: 6.982 ± 1.668
3.696AlaVal: 3.696 ± 0.149
0.0AlaTrp: 0.0 ± 0.0
4.107AlaTyr: 4.107 ± 0.36
0.0AlaXaa: 0.0 ± 0.0
Cys
0.821CysAla: 0.821 ± 0.423
0.0CysCys: 0.0 ± 0.0
1.232CysAsp: 1.232 ± 0.05
0.821CysGlu: 0.821 ± 0.162
0.0CysPhe: 0.0 ± 0.0
0.821CysGly: 0.821 ± 0.423
0.411CysHis: 0.411 ± 0.373
0.821CysIle: 0.821 ± 0.423
0.821CysLys: 0.821 ± 0.423
1.643CysLeu: 1.643 ± 0.846
0.411CysMet: 0.411 ± 0.211
1.232CysAsn: 1.232 ± 0.634
0.821CysPro: 0.821 ± 0.162
0.821CysGln: 0.821 ± 0.423
0.821CysArg: 0.821 ± 0.747
0.821CysSer: 0.821 ± 0.423
0.411CysThr: 0.411 ± 0.211
1.232CysVal: 1.232 ± 0.634
0.0CysTrp: 0.0 ± 0.0
1.643CysTyr: 1.643 ± 0.261
0.0CysXaa: 0.0 ± 0.0
Asp
4.928AspAla: 4.928 ± 0.971
0.821AspCys: 0.821 ± 0.162
4.107AspAsp: 4.107 ± 0.945
5.749AspGlu: 5.749 ± 0.548
3.696AspPhe: 3.696 ± 0.734
2.464AspGly: 2.464 ± 0.486
0.0AspHis: 0.0 ± 0.0
5.339AspIle: 5.339 ± 1.579
4.517AspLys: 4.517 ± 1.157
5.749AspLeu: 5.749 ± 0.037
2.053AspMet: 2.053 ± 0.595
0.821AspAsn: 0.821 ± 0.423
2.053AspPro: 2.053 ± 1.282
2.053AspGln: 2.053 ± 0.112
0.411AspArg: 0.411 ± 0.211
4.107AspSer: 4.107 ± 1.394
2.053AspThr: 2.053 ± 0.697
3.696AspVal: 3.696 ± 1.606
0.411AspTrp: 0.411 ± 0.211
3.696AspTyr: 3.696 ± 0.436
0.0AspXaa: 0.0 ± 0.0
Glu
3.285GluAla: 3.285 ± 0.063
2.464GluCys: 2.464 ± 0.684
4.517GluAsp: 4.517 ± 0.013
4.107GluGlu: 4.107 ± 1.53
2.875GluPhe: 2.875 ± 0.895
2.464GluGly: 2.464 ± 0.486
2.053GluHis: 2.053 ± 0.473
2.875GluIle: 2.875 ± 0.859
4.517GluLys: 4.517 ± 1.157
6.16GluLeu: 6.16 ± 0.922
0.821GluMet: 0.821 ± 0.162
3.285GluAsn: 3.285 ± 0.522
3.285GluPro: 3.285 ± 0.063
2.464GluGln: 2.464 ± 0.099
3.285GluArg: 3.285 ± 0.522
3.285GluSer: 3.285 ± 0.522
2.053GluThr: 2.053 ± 0.112
3.285GluVal: 3.285 ± 0.648
1.232GluTrp: 1.232 ± 0.634
1.643GluTyr: 1.643 ± 0.261
0.0GluXaa: 0.0 ± 0.0
Phe
3.696PheAla: 3.696 ± 0.436
0.821PheCys: 0.821 ± 0.423
3.285PheAsp: 3.285 ± 1.107
3.696PheGlu: 3.696 ± 0.149
1.643PhePhe: 1.643 ± 0.324
4.517PheGly: 4.517 ± 0.598
1.232PheHis: 1.232 ± 0.05
2.875PheIle: 2.875 ± 0.895
2.464PheLys: 2.464 ± 0.684
2.875PheLeu: 2.875 ± 0.311
1.643PheMet: 1.643 ± 0.261
3.285PheAsn: 3.285 ± 0.522
3.285PhePro: 3.285 ± 0.522
2.464PheGln: 2.464 ± 0.486
2.464PheArg: 2.464 ± 1.655
2.053PheSer: 2.053 ± 0.112
2.053PheThr: 2.053 ± 0.112
2.053PheVal: 2.053 ± 1.057
0.821PheTrp: 0.821 ± 0.423
2.464PheTyr: 2.464 ± 0.099
0.0PheXaa: 0.0 ± 0.0
Gly
3.696GlyAla: 3.696 ± 1.021
0.0GlyCys: 0.0 ± 0.0
4.517GlyAsp: 4.517 ± 0.572
4.107GlyGlu: 4.107 ± 1.394
3.285GlyPhe: 3.285 ± 1.107
3.285GlyGly: 3.285 ± 0.063
0.411GlyHis: 0.411 ± 0.211
4.517GlyIle: 4.517 ± 0.598
3.696GlyLys: 3.696 ± 1.903
4.107GlyLeu: 4.107 ± 0.809
0.821GlyMet: 0.821 ± 0.162
6.16GlyAsn: 6.16 ± 0.922
1.643GlyPro: 1.643 ± 0.324
1.232GlyGln: 1.232 ± 0.535
3.696GlyArg: 3.696 ± 0.436
3.696GlySer: 3.696 ± 0.734
2.875GlyThr: 2.875 ± 1.444
4.517GlyVal: 4.517 ± 1.183
0.821GlyTrp: 0.821 ± 0.747
2.875GlyTyr: 2.875 ± 0.859
0.0GlyXaa: 0.0 ± 0.0
His
1.643HisAla: 1.643 ± 0.846
0.0HisCys: 0.0 ± 0.0
0.821HisAsp: 0.821 ± 0.162
0.821HisGlu: 0.821 ± 0.423
0.821HisPhe: 0.821 ± 0.423
1.643HisGly: 1.643 ± 0.846
0.411HisHis: 0.411 ± 0.211
2.053HisIle: 2.053 ± 0.473
1.643HisLys: 1.643 ± 0.909
0.821HisLeu: 0.821 ± 0.423
1.643HisMet: 1.643 ± 0.909
2.053HisAsn: 2.053 ± 0.473
0.821HisPro: 0.821 ± 0.162
0.0HisGln: 0.0 ± 0.0
0.411HisArg: 0.411 ± 0.211
1.232HisSer: 1.232 ± 0.634
2.875HisThr: 2.875 ± 0.311
0.411HisVal: 0.411 ± 0.211
0.0HisTrp: 0.0 ± 0.0
0.821HisTyr: 0.821 ± 0.162
0.0HisXaa: 0.0 ± 0.0
Ile
5.749IleAla: 5.749 ± 0.037
0.821IleCys: 0.821 ± 0.423
3.285IleAsp: 3.285 ± 0.063
3.285IleGlu: 3.285 ± 0.063
1.643IlePhe: 1.643 ± 0.261
2.875IleGly: 2.875 ± 0.311
0.821IleHis: 0.821 ± 0.423
2.464IleIle: 2.464 ± 0.099
4.928IleLys: 4.928 ± 1.368
3.696IleLeu: 3.696 ± 1.903
4.107IleMet: 4.107 ± 0.945
5.749IleAsn: 5.749 ± 0.548
2.875IlePro: 2.875 ± 0.859
0.821IleGln: 0.821 ± 0.423
4.928IleArg: 4.928 ± 0.386
5.339IleSer: 5.339 ± 0.41
4.928IleThr: 4.928 ± 0.386
2.875IleVal: 2.875 ± 0.274
0.411IleTrp: 0.411 ± 0.373
1.232IleTyr: 1.232 ± 0.634
0.0IleXaa: 0.0 ± 0.0
Lys
4.928LysAla: 4.928 ± 0.783
0.0LysCys: 0.0 ± 0.0
4.107LysAsp: 4.107 ± 2.115
4.107LysGlu: 4.107 ± 1.53
1.232LysPhe: 1.232 ± 0.05
2.464LysGly: 2.464 ± 0.099
2.875LysHis: 2.875 ± 1.48
4.107LysIle: 4.107 ± 0.36
4.517LysLys: 4.517 ± 2.326
4.517LysLeu: 4.517 ± 1.157
2.464LysMet: 2.464 ± 1.269
3.285LysAsn: 3.285 ± 0.522
4.928LysPro: 4.928 ± 0.198
1.643LysGln: 1.643 ± 0.846
2.875LysArg: 2.875 ± 0.274
3.285LysSer: 3.285 ± 0.522
2.464LysThr: 2.464 ± 0.684
5.749LysVal: 5.749 ± 1.791
0.821LysTrp: 0.821 ± 0.423
4.928LysTyr: 4.928 ± 0.198
0.0LysXaa: 0.0 ± 0.0
Leu
4.928LeuAla: 4.928 ± 0.198
2.464LeuCys: 2.464 ± 0.684
5.749LeuAsp: 5.749 ± 0.548
6.571LeuGlu: 6.571 ± 0.125
5.339LeuPhe: 5.339 ± 0.41
6.16LeuGly: 6.16 ± 0.248
2.464LeuHis: 2.464 ± 0.099
4.928LeuIle: 4.928 ± 0.783
4.928LeuLys: 4.928 ± 1.368
5.749LeuLeu: 5.749 ± 1.791
1.643LeuMet: 1.643 ± 0.846
5.339LeuAsn: 5.339 ± 0.175
2.464LeuPro: 2.464 ± 0.486
1.232LeuGln: 1.232 ± 0.634
3.285LeuArg: 3.285 ± 0.648
3.696LeuSer: 3.696 ± 1.318
6.16LeuThr: 6.16 ± 0.337
7.392LeuVal: 7.392 ± 0.287
0.821LeuTrp: 0.821 ± 0.747
2.053LeuTyr: 2.053 ± 0.112
0.0LeuXaa: 0.0 ± 0.0
Met
2.053MetAla: 2.053 ± 0.112
0.821MetCys: 0.821 ± 0.162
0.821MetAsp: 0.821 ± 0.423
0.0MetGlu: 0.0 ± 0.0
1.232MetPhe: 1.232 ± 0.05
0.411MetGly: 0.411 ± 0.211
1.232MetHis: 1.232 ± 0.05
2.464MetIle: 2.464 ± 0.684
2.875MetLys: 2.875 ± 0.311
5.339MetLeu: 5.339 ± 0.41
0.821MetMet: 0.821 ± 0.423
3.285MetAsn: 3.285 ± 1.232
0.821MetPro: 0.821 ± 0.162
0.411MetGln: 0.411 ± 0.211
1.232MetArg: 1.232 ± 0.05
1.643MetSer: 1.643 ± 0.261
0.821MetThr: 0.821 ± 0.162
0.411MetVal: 0.411 ± 0.211
0.0MetTrp: 0.0 ± 0.0
0.821MetTyr: 0.821 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
5.339AsnAla: 5.339 ± 1.579
0.821AsnCys: 0.821 ± 0.162
0.821AsnAsp: 0.821 ± 0.423
2.875AsnGlu: 2.875 ± 0.274
3.696AsnPhe: 3.696 ± 0.149
2.464AsnGly: 2.464 ± 1.07
1.232AsnHis: 1.232 ± 0.634
4.517AsnIle: 4.517 ± 1.741
3.285AsnLys: 3.285 ± 0.522
4.517AsnLeu: 4.517 ± 0.013
2.053AsnMet: 2.053 ± 0.112
4.107AsnAsn: 4.107 ± 0.36
4.928AsnPro: 4.928 ± 0.971
0.821AsnGln: 0.821 ± 0.162
1.232AsnArg: 1.232 ± 0.05
5.339AsnSer: 5.339 ± 0.76
3.696AsnThr: 3.696 ± 0.149
5.749AsnVal: 5.749 ± 2.303
0.411AsnTrp: 0.411 ± 0.373
2.464AsnTyr: 2.464 ± 0.099
0.0AsnXaa: 0.0 ± 0.0
Pro
1.643ProAla: 1.643 ± 1.493
0.411ProCys: 0.411 ± 0.211
3.696ProAsp: 3.696 ± 1.021
2.875ProGlu: 2.875 ± 0.311
2.464ProPhe: 2.464 ± 0.486
2.053ProGly: 2.053 ± 0.112
0.821ProHis: 0.821 ± 0.162
2.875ProIle: 2.875 ± 0.859
2.053ProLys: 2.053 ± 1.057
5.339ProLeu: 5.339 ± 1.345
0.411ProMet: 0.411 ± 0.234
1.643ProAsn: 1.643 ± 0.324
0.0ProPro: 0.0 ± 0.0
0.411ProGln: 0.411 ± 0.211
2.464ProArg: 2.464 ± 0.099
4.517ProSer: 4.517 ± 0.013
2.464ProThr: 2.464 ± 0.486
2.053ProVal: 2.053 ± 0.697
0.821ProTrp: 0.821 ± 0.747
2.875ProTyr: 2.875 ± 0.859
0.0ProXaa: 0.0 ± 0.0
Gln
1.643GlnAla: 1.643 ± 0.324
0.0GlnCys: 0.0 ± 0.0
0.821GlnAsp: 0.821 ± 0.747
1.643GlnGlu: 1.643 ± 0.846
1.232GlnPhe: 1.232 ± 0.05
3.696GlnGly: 3.696 ± 1.021
0.821GlnHis: 0.821 ± 0.162
1.232GlnIle: 1.232 ± 0.05
2.053GlnLys: 2.053 ± 0.473
3.696GlnLeu: 3.696 ± 0.734
0.411GlnMet: 0.411 ± 0.373
0.0GlnAsn: 0.0 ± 0.0
0.411GlnPro: 0.411 ± 0.373
0.821GlnGln: 0.821 ± 0.162
2.875GlnArg: 2.875 ± 0.895
4.517GlnSer: 4.517 ± 0.598
1.643GlnThr: 1.643 ± 0.324
1.232GlnVal: 1.232 ± 0.05
0.821GlnTrp: 0.821 ± 0.747
0.821GlnTyr: 0.821 ± 0.162
0.0GlnXaa: 0.0 ± 0.0
Arg
3.285ArgAla: 3.285 ± 0.648
1.232ArgCys: 1.232 ± 0.634
3.696ArgAsp: 3.696 ± 0.149
3.285ArgGlu: 3.285 ± 0.648
2.464ArgPhe: 2.464 ± 0.099
2.053ArgGly: 2.053 ± 0.697
1.643ArgHis: 1.643 ± 0.909
4.107ArgIle: 4.107 ± 0.945
5.339ArgLys: 5.339 ± 0.995
2.464ArgLeu: 2.464 ± 1.07
1.643ArgMet: 1.643 ± 0.324
1.232ArgAsn: 1.232 ± 0.05
2.053ArgPro: 2.053 ± 0.697
0.821ArgGln: 0.821 ± 0.162
1.232ArgArg: 1.232 ± 0.05
1.232ArgSer: 1.232 ± 0.535
1.643ArgThr: 1.643 ± 0.846
4.107ArgVal: 4.107 ± 0.225
0.0ArgTrp: 0.0 ± 0.0
2.875ArgTyr: 2.875 ± 0.274
0.0ArgXaa: 0.0 ± 0.0
Ser
4.517SerAla: 4.517 ± 0.013
0.821SerCys: 0.821 ± 0.423
3.285SerAsp: 3.285 ± 1.232
3.285SerGlu: 3.285 ± 0.522
3.696SerPhe: 3.696 ± 0.734
4.517SerGly: 4.517 ± 1.183
1.232SerHis: 1.232 ± 0.634
4.517SerIle: 4.517 ± 1.183
5.339SerLys: 5.339 ± 1.579
4.517SerLeu: 4.517 ± 0.572
0.821SerMet: 0.821 ± 0.747
3.696SerAsn: 3.696 ± 1.318
3.696SerPro: 3.696 ± 1.021
3.285SerGln: 3.285 ± 0.648
2.875SerArg: 2.875 ± 0.859
4.928SerSer: 4.928 ± 0.783
2.464SerThr: 2.464 ± 0.486
3.696SerVal: 3.696 ± 0.734
0.821SerTrp: 0.821 ± 0.747
2.464SerTyr: 2.464 ± 1.07
0.0SerXaa: 0.0 ± 0.0
Thr
4.928ThrAla: 4.928 ± 1.556
0.411ThrCys: 0.411 ± 0.211
2.875ThrAsp: 2.875 ± 1.444
2.875ThrGlu: 2.875 ± 0.274
2.875ThrPhe: 2.875 ± 0.859
3.696ThrGly: 3.696 ± 1.021
0.821ThrHis: 0.821 ± 0.423
4.517ThrIle: 4.517 ± 0.598
2.875ThrLys: 2.875 ± 0.895
5.339ThrLeu: 5.339 ± 0.175
2.053ThrMet: 2.053 ± 0.697
4.107ThrAsn: 4.107 ± 0.225
2.053ThrPro: 2.053 ± 0.112
2.875ThrGln: 2.875 ± 0.274
3.285ThrArg: 3.285 ± 0.063
3.696ThrSer: 3.696 ± 1.021
2.875ThrThr: 2.875 ± 0.859
3.696ThrVal: 3.696 ± 0.149
0.0ThrTrp: 0.0 ± 0.0
2.875ThrTyr: 2.875 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
4.517ValAla: 4.517 ± 1.768
0.411ValCys: 0.411 ± 0.211
3.696ValAsp: 3.696 ± 1.606
3.285ValGlu: 3.285 ± 0.063
4.107ValPhe: 4.107 ± 0.945
4.928ValGly: 4.928 ± 0.783
0.0ValHis: 0.0 ± 0.0
3.696ValIle: 3.696 ± 0.436
2.053ValLys: 2.053 ± 0.697
5.749ValLeu: 5.749 ± 1.791
1.232ValMet: 1.232 ± 0.535
2.053ValAsn: 2.053 ± 0.473
3.696ValPro: 3.696 ± 0.436
2.464ValGln: 2.464 ± 0.684
3.285ValArg: 3.285 ± 0.522
4.928ValSer: 4.928 ± 0.971
5.339ValThr: 5.339 ± 1.345
6.571ValVal: 6.571 ± 1.629
0.821ValTrp: 0.821 ± 0.423
3.285ValTyr: 3.285 ± 0.063
0.0ValXaa: 0.0 ± 0.0
Trp
0.411TrpAla: 0.411 ± 0.373
0.0TrpCys: 0.0 ± 0.0
1.232TrpAsp: 1.232 ± 0.05
0.411TrpGlu: 0.411 ± 0.211
0.411TrpPhe: 0.411 ± 0.373
0.411TrpGly: 0.411 ± 0.373
0.0TrpHis: 0.0 ± 0.0
0.411TrpIle: 0.411 ± 0.211
0.0TrpLys: 0.0 ± 0.0
1.232TrpLeu: 1.232 ± 0.05
0.411TrpMet: 0.411 ± 0.211
0.821TrpAsn: 0.821 ± 0.747
0.0TrpPro: 0.0 ± 0.0
0.821TrpGln: 0.821 ± 0.747
0.411TrpArg: 0.411 ± 0.211
0.411TrpSer: 0.411 ± 0.373
1.643TrpThr: 1.643 ± 0.909
0.0TrpVal: 0.0 ± 0.0
0.411TrpTrp: 0.411 ± 0.211
0.821TrpTyr: 0.821 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.285TyrAla: 3.285 ± 0.648
0.411TyrCys: 0.411 ± 0.211
3.285TyrAsp: 3.285 ± 0.522
1.643TyrGlu: 1.643 ± 0.846
4.107TyrPhe: 4.107 ± 0.225
2.464TyrGly: 2.464 ± 0.099
0.821TyrHis: 0.821 ± 0.747
1.643TyrIle: 1.643 ± 0.261
4.107TyrLys: 4.107 ± 0.945
4.107TyrLeu: 4.107 ± 0.225
0.821TyrMet: 0.821 ± 0.423
3.696TyrAsn: 3.696 ± 1.021
0.821TyrPro: 0.821 ± 0.423
2.053TyrGln: 2.053 ± 1.282
1.643TyrArg: 1.643 ± 0.261
2.053TyrSer: 2.053 ± 1.282
3.285TyrThr: 3.285 ± 0.063
3.696TyrVal: 3.696 ± 0.149
0.821TyrTrp: 0.821 ± 0.747
1.643TyrTyr: 1.643 ± 0.846
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2436 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski