Amino acid dipepetide frequency for Beihai sipunculid worm virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.105AlaAla: 9.105 ± 1.883
1.188AlaCys: 1.188 ± 0.645
2.375AlaAsp: 2.375 ± 0.075
3.563AlaGlu: 3.563 ± 0.113
3.959AlaPhe: 3.959 ± 1.947
5.542AlaGly: 5.542 ± 1.087
1.979AlaHis: 1.979 ± 0.291
3.563AlaIle: 3.563 ± 1.479
2.771AlaLys: 2.771 ± 0.14
5.938AlaLeu: 5.938 ± 0.494
1.584AlaMet: 1.584 ± 0.177
2.771AlaAsn: 2.771 ± 0.823
4.751AlaPro: 4.751 ± 1.898
2.771AlaGln: 2.771 ± 0.543
3.167AlaArg: 3.167 ± 0.328
5.542AlaSer: 5.542 ± 2.453
4.751AlaThr: 4.751 ± 3.566
7.126AlaVal: 7.126 ± 0.909
1.584AlaTrp: 1.584 ± 0.506
3.959AlaTyr: 3.959 ± 0.581
0.0AlaXaa: 0.0 ± 0.0
Cys
1.188CysAla: 1.188 ± 0.645
0.0CysCys: 0.0 ± 0.0
0.792CysAsp: 0.792 ± 0.43
0.0CysGlu: 0.0 ± 0.0
0.792CysPhe: 0.792 ± 0.253
3.167CysGly: 3.167 ± 1.721
0.396CysHis: 0.396 ± 0.215
0.792CysIle: 0.792 ± 0.43
1.188CysLys: 1.188 ± 0.038
0.396CysLeu: 0.396 ± 0.215
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.792CysPro: 0.792 ± 0.253
0.396CysGln: 0.396 ± 0.215
0.396CysArg: 0.396 ± 0.215
1.584CysSer: 1.584 ± 0.506
1.188CysThr: 1.188 ± 0.645
1.979CysVal: 1.979 ± 1.075
0.792CysTrp: 0.792 ± 0.43
0.396CysTyr: 0.396 ± 0.215
0.0CysXaa: 0.0 ± 0.0
Asp
3.563AspAla: 3.563 ± 0.796
1.188AspCys: 1.188 ± 0.645
3.959AspAsp: 3.959 ± 0.785
2.771AspGlu: 2.771 ± 0.823
3.563AspPhe: 3.563 ± 0.796
3.167AspGly: 3.167 ± 0.328
1.584AspHis: 1.584 ± 0.177
4.751AspIle: 4.751 ± 0.532
1.979AspLys: 1.979 ± 1.075
4.355AspLeu: 4.355 ± 1.0
1.188AspMet: 1.188 ± 0.645
3.563AspAsn: 3.563 ± 0.57
3.563AspPro: 3.563 ± 1.479
1.188AspGln: 1.188 ± 0.645
1.188AspArg: 1.188 ± 0.038
3.563AspSer: 3.563 ± 0.113
4.751AspThr: 4.751 ± 0.151
4.355AspVal: 4.355 ± 1.049
1.584AspTrp: 1.584 ± 0.506
0.792AspTyr: 0.792 ± 0.43
0.0AspXaa: 0.0 ± 0.0
Glu
3.959GluAla: 3.959 ± 0.785
0.396GluCys: 0.396 ± 0.215
3.563GluAsp: 3.563 ± 0.57
3.167GluGlu: 3.167 ± 1.038
3.167GluPhe: 3.167 ± 1.038
4.751GluGly: 4.751 ± 1.517
0.792GluHis: 0.792 ± 0.43
2.771GluIle: 2.771 ± 0.14
2.375GluLys: 2.375 ± 1.291
3.167GluLeu: 3.167 ± 0.328
1.979GluMet: 1.979 ± 1.77
3.167GluAsn: 3.167 ± 1.038
4.355GluPro: 4.355 ± 1.732
1.584GluGln: 1.584 ± 0.177
3.167GluArg: 3.167 ± 1.038
2.375GluSer: 2.375 ± 0.608
5.938GluThr: 5.938 ± 1.555
4.751GluVal: 4.751 ± 0.151
1.584GluTrp: 1.584 ± 0.86
1.584GluTyr: 1.584 ± 0.506
0.0GluXaa: 0.0 ± 0.0
Phe
6.334PheAla: 6.334 ± 0.657
1.188PheCys: 1.188 ± 0.721
4.355PheAsp: 4.355 ± 0.366
3.959PheGlu: 3.959 ± 0.581
2.771PhePhe: 2.771 ± 1.909
3.167PheGly: 3.167 ± 0.355
1.979PheHis: 1.979 ± 0.291
3.563PheIle: 3.563 ± 1.253
2.375PheLys: 2.375 ± 1.291
2.771PheLeu: 2.771 ± 0.823
1.584PheMet: 1.584 ± 0.177
2.375PheAsn: 2.375 ± 0.758
0.792PhePro: 0.792 ± 0.253
1.584PheGln: 1.584 ± 0.86
3.959PheArg: 3.959 ± 1.264
3.167PheSer: 3.167 ± 1.011
2.375PheThr: 2.375 ± 1.441
2.375PheVal: 2.375 ± 0.758
0.0PheTrp: 0.0 ± 0.0
0.792PheTyr: 0.792 ± 0.936
0.0PheXaa: 0.0 ± 0.0
Gly
3.959GlyAla: 3.959 ± 1.947
0.396GlyCys: 0.396 ± 0.215
5.146GlyAsp: 5.146 ± 0.747
4.355GlyGlu: 4.355 ± 2.415
4.355GlyPhe: 4.355 ± 0.366
3.959GlyGly: 3.959 ± 1.468
1.188GlyHis: 1.188 ± 0.038
4.751GlyIle: 4.751 ± 1.898
5.146GlyLys: 5.146 ± 1.43
5.146GlyLeu: 5.146 ± 0.619
1.188GlyMet: 1.188 ± 0.038
1.979GlyAsn: 1.979 ± 0.974
1.188GlyPro: 1.188 ± 0.645
1.979GlyGln: 1.979 ± 0.392
2.375GlyArg: 2.375 ± 0.758
4.355GlySer: 4.355 ± 0.317
5.542GlyThr: 5.542 ± 1.087
5.938GlyVal: 5.938 ± 0.189
0.396GlyTrp: 0.396 ± 0.468
0.792GlyTyr: 0.792 ± 0.43
0.0GlyXaa: 0.0 ± 0.0
His
2.771HisAla: 2.771 ± 0.14
0.792HisCys: 0.792 ± 0.43
0.792HisAsp: 0.792 ± 0.253
1.188HisGlu: 1.188 ± 0.721
1.188HisPhe: 1.188 ± 0.645
1.584HisGly: 1.584 ± 0.177
0.396HisHis: 0.396 ± 0.215
1.584HisIle: 1.584 ± 0.506
1.979HisLys: 1.979 ± 0.392
2.375HisLeu: 2.375 ± 1.291
1.188HisMet: 1.188 ± 0.038
0.396HisAsn: 0.396 ± 0.468
1.188HisPro: 1.188 ± 0.038
0.0HisGln: 0.0 ± 0.0
3.167HisArg: 3.167 ± 1.038
1.584HisSer: 1.584 ± 0.86
1.188HisThr: 1.188 ± 0.038
1.188HisVal: 1.188 ± 0.038
0.792HisTrp: 0.792 ± 0.253
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.959IleAla: 3.959 ± 0.102
1.584IleCys: 1.584 ± 0.177
1.979IleAsp: 1.979 ± 0.392
1.979IleGlu: 1.979 ± 0.291
2.375IlePhe: 2.375 ± 0.075
3.167IleGly: 3.167 ± 2.377
0.792IleHis: 0.792 ± 0.43
2.771IleIle: 2.771 ± 0.543
2.771IleLys: 2.771 ± 0.14
3.959IleLeu: 3.959 ± 1.468
1.979IleMet: 1.979 ± 1.214
1.979IleAsn: 1.979 ± 0.392
2.771IlePro: 2.771 ± 0.543
1.584IleGln: 1.584 ± 0.177
4.751IleArg: 4.751 ± 1.215
2.771IleSer: 2.771 ± 0.14
3.563IleThr: 3.563 ± 0.113
5.542IleVal: 5.542 ± 0.404
0.0IleTrp: 0.0 ± 0.0
1.584IleTyr: 1.584 ± 0.506
0.0IleXaa: 0.0 ± 0.0
Lys
4.751LysAla: 4.751 ± 0.151
0.396LysCys: 0.396 ± 0.215
2.375LysAsp: 2.375 ± 1.291
2.771LysGlu: 2.771 ± 0.14
1.584LysPhe: 1.584 ± 0.86
3.167LysGly: 3.167 ± 0.355
2.375LysHis: 2.375 ± 0.608
1.584LysIle: 1.584 ± 0.177
2.771LysLys: 2.771 ± 1.506
2.771LysLeu: 2.771 ± 0.14
0.0LysMet: 0.0 ± 0.0
3.959LysAsn: 3.959 ± 0.785
1.584LysPro: 1.584 ± 0.177
0.396LysGln: 0.396 ± 0.215
2.771LysArg: 2.771 ± 0.14
2.375LysSer: 2.375 ± 0.608
2.375LysThr: 2.375 ± 0.608
4.751LysVal: 4.751 ± 0.532
0.396LysTrp: 0.396 ± 0.215
3.563LysTyr: 3.563 ± 1.936
0.0LysXaa: 0.0 ± 0.0
Leu
7.918LeuAla: 7.918 ± 0.479
1.188LeuCys: 1.188 ± 0.645
3.563LeuAsp: 3.563 ± 0.113
2.771LeuGlu: 2.771 ± 1.226
2.771LeuPhe: 2.771 ± 0.14
5.542LeuGly: 5.542 ± 0.279
1.188LeuHis: 1.188 ± 0.645
2.375LeuIle: 2.375 ± 0.075
3.959LeuLys: 3.959 ± 0.785
3.563LeuLeu: 3.563 ± 1.253
0.396LeuMet: 0.396 ± 0.215
2.375LeuAsn: 2.375 ± 0.608
4.355LeuPro: 4.355 ± 1.0
3.167LeuGln: 3.167 ± 1.721
5.542LeuArg: 5.542 ± 0.404
6.73LeuSer: 6.73 ± 0.925
7.522LeuThr: 7.522 ± 2.06
4.751LeuVal: 4.751 ± 0.532
1.584LeuTrp: 1.584 ± 0.177
2.375LeuTyr: 2.375 ± 1.291
0.0LeuXaa: 0.0 ± 0.0
Met
2.771MetAla: 2.771 ± 0.543
0.396MetCys: 0.396 ± 0.215
0.396MetAsp: 0.396 ± 0.215
1.584MetGlu: 1.584 ± 0.177
0.396MetPhe: 0.396 ± 0.468
2.375MetGly: 2.375 ± 0.758
0.792MetHis: 0.792 ± 0.253
1.188MetIle: 1.188 ± 0.038
1.188MetLys: 1.188 ± 0.721
1.188MetLeu: 1.188 ± 0.038
0.792MetMet: 0.792 ± 0.253
0.792MetAsn: 0.792 ± 0.43
0.792MetPro: 0.792 ± 0.936
1.188MetGln: 1.188 ± 0.721
1.188MetArg: 1.188 ± 0.038
0.792MetSer: 0.792 ± 0.253
1.979MetThr: 1.979 ± 0.291
1.584MetVal: 1.584 ± 0.506
0.792MetTrp: 0.792 ± 0.43
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.771AsnAla: 2.771 ± 0.14
1.188AsnCys: 1.188 ± 0.645
1.584AsnAsp: 1.584 ± 0.506
1.979AsnGlu: 1.979 ± 0.392
1.188AsnPhe: 1.188 ± 0.645
4.355AsnGly: 4.355 ± 1.0
1.188AsnHis: 1.188 ± 0.645
1.979AsnIle: 1.979 ± 0.974
2.375AsnLys: 2.375 ± 0.608
3.563AsnLeu: 3.563 ± 0.57
1.584AsnMet: 1.584 ± 0.506
0.396AsnAsn: 0.396 ± 0.215
1.979AsnPro: 1.979 ± 0.974
1.584AsnGln: 1.584 ± 0.177
2.375AsnArg: 2.375 ± 0.758
4.751AsnSer: 4.751 ± 0.532
1.979AsnThr: 1.979 ± 0.291
3.563AsnVal: 3.563 ± 1.936
1.188AsnTrp: 1.188 ± 0.038
1.188AsnTyr: 1.188 ± 0.645
0.0AsnXaa: 0.0 ± 0.0
Pro
3.167ProAla: 3.167 ± 0.328
0.396ProCys: 0.396 ± 0.215
3.959ProAsp: 3.959 ± 0.785
3.959ProGlu: 3.959 ± 0.785
3.563ProPhe: 3.563 ± 0.57
2.375ProGly: 2.375 ± 0.608
0.396ProHis: 0.396 ± 0.468
0.792ProIle: 0.792 ± 0.43
1.188ProLys: 1.188 ± 0.645
4.355ProLeu: 4.355 ± 0.317
2.375ProMet: 2.375 ± 1.441
1.188ProAsn: 1.188 ± 0.721
2.771ProPro: 2.771 ± 0.823
3.563ProGln: 3.563 ± 0.796
1.584ProArg: 1.584 ± 0.177
3.563ProSer: 3.563 ± 0.113
3.167ProThr: 3.167 ± 0.328
7.918ProVal: 7.918 ± 1.845
0.792ProTrp: 0.792 ± 0.936
1.979ProTyr: 1.979 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
2.771GlnAla: 2.771 ± 0.543
0.0GlnCys: 0.0 ± 0.0
2.771GlnAsp: 2.771 ± 0.823
1.584GlnGlu: 1.584 ± 0.86
0.792GlnPhe: 0.792 ± 0.253
1.584GlnGly: 1.584 ± 0.177
1.584GlnHis: 1.584 ± 0.506
1.188GlnIle: 1.188 ± 0.038
1.979GlnLys: 1.979 ± 0.392
3.563GlnLeu: 3.563 ± 0.113
1.188GlnMet: 1.188 ± 0.645
0.396GlnAsn: 0.396 ± 0.215
1.584GlnPro: 1.584 ± 0.86
0.396GlnGln: 0.396 ± 0.215
2.375GlnArg: 2.375 ± 0.075
1.979GlnSer: 1.979 ± 0.392
1.584GlnThr: 1.584 ± 0.506
1.584GlnVal: 1.584 ± 0.506
0.792GlnTrp: 0.792 ± 0.43
1.188GlnTyr: 1.188 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
1.584ArgAla: 1.584 ± 0.506
0.792ArgCys: 0.792 ± 0.43
1.979ArgAsp: 1.979 ± 0.291
2.771ArgGlu: 2.771 ± 0.823
2.771ArgPhe: 2.771 ± 0.543
2.771ArgGly: 2.771 ± 1.909
2.375ArgHis: 2.375 ± 0.608
3.959ArgIle: 3.959 ± 1.947
3.959ArgLys: 3.959 ± 1.468
6.334ArgLeu: 6.334 ± 0.026
1.584ArgMet: 1.584 ± 0.506
4.751ArgAsn: 4.751 ± 1.215
4.751ArgPro: 4.751 ± 0.834
1.584ArgGln: 1.584 ± 0.177
5.146ArgArg: 5.146 ± 1.43
3.959ArgSer: 3.959 ± 0.102
1.584ArgThr: 1.584 ± 0.177
2.771ArgVal: 2.771 ± 0.823
1.188ArgTrp: 1.188 ± 0.645
3.563ArgTyr: 3.563 ± 0.113
0.0ArgXaa: 0.0 ± 0.0
Ser
5.542SerAla: 5.542 ± 0.404
1.188SerCys: 1.188 ± 0.038
3.959SerAsp: 3.959 ± 1.947
3.959SerGlu: 3.959 ± 1.468
3.167SerPhe: 3.167 ± 1.011
2.771SerGly: 2.771 ± 0.823
0.0SerHis: 0.0 ± 0.0
4.751SerIle: 4.751 ± 0.151
1.979SerLys: 1.979 ± 0.392
6.334SerLeu: 6.334 ± 0.71
1.584SerMet: 1.584 ± 0.86
3.167SerAsn: 3.167 ± 1.011
4.751SerPro: 4.751 ± 0.151
1.979SerGln: 1.979 ± 0.291
3.563SerArg: 3.563 ± 0.113
5.542SerSer: 5.542 ± 0.962
4.355SerThr: 4.355 ± 0.366
5.938SerVal: 5.938 ± 0.189
1.188SerTrp: 1.188 ± 0.645
2.375SerTyr: 2.375 ± 0.075
0.0SerXaa: 0.0 ± 0.0
Thr
5.542ThrAla: 5.542 ± 2.453
0.792ThrCys: 0.792 ± 0.43
3.563ThrAsp: 3.563 ± 0.796
1.584ThrGlu: 1.584 ± 1.189
4.751ThrPhe: 4.751 ± 0.834
4.355ThrGly: 4.355 ± 0.366
1.979ThrHis: 1.979 ± 1.075
3.563ThrIle: 3.563 ± 1.479
1.584ThrLys: 1.584 ± 0.177
3.563ThrLeu: 3.563 ± 1.479
1.188ThrMet: 1.188 ± 0.721
3.563ThrAsn: 3.563 ± 0.113
5.938ThrPro: 5.938 ± 0.189
2.375ThrGln: 2.375 ± 0.075
3.959ThrArg: 3.959 ± 0.102
5.542ThrSer: 5.542 ± 2.453
5.146ThrThr: 5.146 ± 2.668
4.355ThrVal: 4.355 ± 0.366
0.792ThrTrp: 0.792 ± 0.43
2.771ThrTyr: 2.771 ± 0.543
0.0ThrXaa: 0.0 ± 0.0
Val
3.563ValAla: 3.563 ± 1.479
1.979ValCys: 1.979 ± 0.392
6.334ValAsp: 6.334 ± 0.657
10.689ValGlu: 10.689 ± 1.027
3.959ValPhe: 3.959 ± 1.264
4.355ValGly: 4.355 ± 1.049
3.167ValHis: 3.167 ± 0.328
4.355ValIle: 4.355 ± 0.317
3.959ValLys: 3.959 ± 0.581
4.751ValLeu: 4.751 ± 0.151
0.396ValMet: 0.396 ± 0.468
3.959ValAsn: 3.959 ± 1.468
3.167ValPro: 3.167 ± 0.355
1.979ValGln: 1.979 ± 0.974
4.355ValArg: 4.355 ± 0.317
4.751ValSer: 4.751 ± 1.215
4.751ValThr: 4.751 ± 0.151
5.938ValVal: 5.938 ± 1.86
0.792ValTrp: 0.792 ± 0.43
3.563ValTyr: 3.563 ± 0.113
0.0ValXaa: 0.0 ± 0.0
Trp
0.396TrpAla: 0.396 ± 0.215
0.792TrpCys: 0.792 ± 0.43
0.0TrpAsp: 0.0 ± 0.0
1.979TrpGlu: 1.979 ± 0.392
0.792TrpPhe: 0.792 ± 0.253
0.0TrpGly: 0.0 ± 0.0
0.792TrpHis: 0.792 ± 0.253
1.188TrpIle: 1.188 ± 0.645
0.0TrpLys: 0.0 ± 0.0
1.979TrpLeu: 1.979 ± 0.291
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.792TrpPro: 0.792 ± 0.43
1.188TrpGln: 1.188 ± 0.645
2.375TrpArg: 2.375 ± 1.441
1.979TrpSer: 1.979 ± 1.075
1.188TrpThr: 1.188 ± 0.038
1.188TrpVal: 1.188 ± 0.038
0.396TrpTrp: 0.396 ± 0.215
0.396TrpTyr: 0.396 ± 0.215
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.375TyrAla: 2.375 ± 1.441
0.396TyrCys: 0.396 ± 0.215
3.167TyrAsp: 3.167 ± 1.038
1.979TyrGlu: 1.979 ± 0.392
3.563TyrPhe: 3.563 ± 0.113
1.979TyrGly: 1.979 ± 1.075
0.396TyrHis: 0.396 ± 0.215
0.792TyrIle: 0.792 ± 0.253
1.188TyrLys: 1.188 ± 0.038
3.563TyrLeu: 3.563 ± 0.57
0.0TyrMet: 0.0 ± 0.0
1.979TyrAsn: 1.979 ± 0.392
1.188TyrPro: 1.188 ± 0.721
0.396TyrGln: 0.396 ± 0.215
2.771TyrArg: 2.771 ± 0.823
1.188TyrSer: 1.188 ± 0.721
1.979TyrThr: 1.979 ± 0.291
3.167TyrVal: 3.167 ± 0.355
0.792TyrTrp: 0.792 ± 0.253
2.375TyrTyr: 2.375 ± 0.608
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2527 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski