Amino acid dipepetide frequency for Spissistilus festinus virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.715AlaAla: 12.715 ± 5.258
0.82AlaCys: 0.82 ± 0.527
3.281AlaAsp: 3.281 ± 0.941
5.332AlaGlu: 5.332 ± 0.077
1.641AlaPhe: 1.641 ± 1.281
5.742AlaGly: 5.742 ± 0.77
2.051AlaHis: 2.051 ± 0.734
6.973AlaIle: 6.973 ± 0.774
2.051AlaLys: 2.051 ± 0.734
13.536AlaLeu: 13.536 ± 0.523
0.41AlaMet: 0.41 ± 0.32
5.742AlaAsn: 5.742 ± 1.565
7.383AlaPro: 7.383 ± 1.095
5.332AlaGln: 5.332 ± 1.091
11.895AlaArg: 11.895 ± 2.388
9.844AlaSer: 9.844 ± 0.097
4.922AlaThr: 4.922 ± 0.34
5.742AlaVal: 5.742 ± 2.149
2.871AlaTrp: 2.871 ± 0.677
3.692AlaTyr: 3.692 ± 1.204
0.0AlaXaa: 0.0 ± 0.0
Cys
1.231CysAla: 1.231 ± 0.791
0.0CysCys: 0.0 ± 0.0
0.41CysAsp: 0.41 ± 0.264
0.82CysGlu: 0.82 ± 0.527
0.41CysPhe: 0.41 ± 0.264
0.41CysGly: 0.41 ± 0.264
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.051CysLeu: 2.051 ± 0.734
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.41CysPro: 0.41 ± 0.264
0.41CysGln: 0.41 ± 0.32
0.41CysArg: 0.41 ± 0.264
1.641CysSer: 1.641 ± 0.697
2.461CysThr: 2.461 ± 0.17
0.41CysVal: 0.41 ± 0.264
0.82CysTrp: 0.82 ± 0.057
0.41CysTyr: 0.41 ± 0.32
0.0CysXaa: 0.0 ± 0.0
Asp
4.512AspAla: 4.512 ± 0.02
0.0AspCys: 0.0 ± 0.0
2.461AspAsp: 2.461 ± 1.581
0.82AspGlu: 0.82 ± 0.641
2.051AspPhe: 2.051 ± 1.018
3.692AspGly: 3.692 ± 0.62
0.82AspHis: 0.82 ± 0.057
3.281AspIle: 3.281 ± 1.395
2.461AspLys: 2.461 ± 0.997
6.973AspLeu: 6.973 ± 0.977
0.0AspMet: 0.0 ± 0.0
1.641AspAsn: 1.641 ± 0.113
3.281AspPro: 3.281 ± 0.811
0.82AspGln: 0.82 ± 0.527
0.0AspArg: 0.0 ± 0.0
2.461AspSer: 2.461 ± 0.414
3.281AspThr: 3.281 ± 0.227
3.692AspVal: 3.692 ± 1.788
1.641AspTrp: 1.641 ± 1.054
1.231AspTyr: 1.231 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
6.153GluAla: 6.153 ± 0.718
0.82GluCys: 0.82 ± 0.057
2.051GluAsp: 2.051 ± 1.018
2.051GluGlu: 2.051 ± 0.734
0.0GluPhe: 0.0 ± 0.0
2.051GluGly: 2.051 ± 1.018
0.82GluHis: 0.82 ± 0.641
1.231GluIle: 1.231 ± 0.377
1.231GluLys: 1.231 ± 0.207
2.871GluLeu: 2.871 ± 0.677
1.641GluMet: 1.641 ± 0.113
2.461GluAsn: 2.461 ± 0.414
4.102GluPro: 4.102 ± 0.284
0.0GluGln: 0.0 ± 0.0
2.461GluArg: 2.461 ± 0.17
2.461GluSer: 2.461 ± 0.754
0.82GluThr: 0.82 ± 0.057
4.922GluVal: 4.922 ± 0.243
2.461GluTrp: 2.461 ± 0.754
2.051GluTyr: 2.051 ± 0.15
0.0GluXaa: 0.0 ± 0.0
Phe
0.82PheAla: 0.82 ± 0.527
0.82PheCys: 0.82 ± 0.641
1.231PheAsp: 1.231 ± 0.207
1.641PheGlu: 1.641 ± 0.113
0.82PhePhe: 0.82 ± 0.641
2.461PheGly: 2.461 ± 0.754
0.41PheHis: 0.41 ± 0.32
1.231PheIle: 1.231 ± 0.377
0.82PheLys: 0.82 ± 0.527
0.82PheLeu: 0.82 ± 0.641
0.41PheMet: 0.41 ± 0.32
0.82PheAsn: 0.82 ± 0.057
3.692PhePro: 3.692 ± 0.547
1.231PheGln: 1.231 ± 0.207
1.231PheArg: 1.231 ± 0.377
1.641PheSer: 1.641 ± 0.47
2.461PheThr: 2.461 ± 0.17
1.231PheVal: 1.231 ± 0.377
0.82PheTrp: 0.82 ± 0.527
0.82PheTyr: 0.82 ± 0.527
0.0PheXaa: 0.0 ± 0.0
Gly
7.793GlyAla: 7.793 ± 1.999
1.231GlyCys: 1.231 ± 0.791
6.973GlyAsp: 6.973 ± 0.774
3.692GlyGlu: 3.692 ± 1.131
2.461GlyPhe: 2.461 ± 0.997
6.973GlyGly: 6.973 ± 0.19
1.231GlyHis: 1.231 ± 0.791
2.871GlyIle: 2.871 ± 0.491
2.051GlyLys: 2.051 ± 1.318
9.024GlyLeu: 9.024 ± 1.711
0.41GlyMet: 0.41 ± 0.264
1.641GlyAsn: 1.641 ± 0.113
5.332GlyPro: 5.332 ± 2.412
3.281GlyGln: 3.281 ± 0.811
6.973GlyArg: 6.973 ± 0.19
7.383GlySer: 7.383 ± 0.073
3.281GlyThr: 3.281 ± 0.941
5.742GlyVal: 5.742 ± 2.149
1.641GlyTrp: 1.641 ± 0.113
1.641GlyTyr: 1.641 ± 0.47
0.0GlyXaa: 0.0 ± 0.0
His
0.82HisAla: 0.82 ± 0.057
0.41HisCys: 0.41 ± 0.264
0.41HisAsp: 0.41 ± 0.32
0.41HisGlu: 0.41 ± 0.264
0.41HisPhe: 0.41 ± 0.264
1.231HisGly: 1.231 ± 0.377
1.231HisHis: 1.231 ± 0.207
1.231HisIle: 1.231 ± 0.791
0.41HisLys: 0.41 ± 0.32
3.281HisLeu: 3.281 ± 0.941
0.0HisMet: 0.0 ± 0.0
0.82HisAsn: 0.82 ± 0.641
3.692HisPro: 3.692 ± 0.62
1.641HisGln: 1.641 ± 0.47
0.41HisArg: 0.41 ± 0.32
1.641HisSer: 1.641 ± 0.697
1.641HisThr: 1.641 ± 0.113
2.461HisVal: 2.461 ± 0.414
0.41HisTrp: 0.41 ± 0.264
0.82HisTyr: 0.82 ± 0.057
0.0HisXaa: 0.0 ± 0.0
Ile
4.512IleAla: 4.512 ± 1.147
0.41IleCys: 0.41 ± 0.32
0.0IleAsp: 0.0 ± 0.0
0.41IleGlu: 0.41 ± 0.32
1.231IlePhe: 1.231 ± 0.377
3.281IleGly: 3.281 ± 0.227
0.82IleHis: 0.82 ± 0.641
2.051IleIle: 2.051 ± 0.734
0.0IleLys: 0.0 ± 0.0
2.461IleLeu: 2.461 ± 0.17
2.051IleMet: 2.051 ± 0.434
2.051IleAsn: 2.051 ± 0.15
2.871IlePro: 2.871 ± 0.491
2.871IleGln: 2.871 ± 0.491
3.281IleArg: 3.281 ± 0.357
4.102IleSer: 4.102 ± 0.3
3.692IleThr: 3.692 ± 0.037
0.82IleVal: 0.82 ± 0.641
0.41IleTrp: 0.41 ± 0.32
1.231IleTyr: 1.231 ± 0.377
0.0IleXaa: 0.0 ± 0.0
Lys
1.641LysAla: 1.641 ± 1.054
0.41LysCys: 0.41 ± 0.264
0.82LysAsp: 0.82 ± 0.527
1.231LysGlu: 1.231 ± 0.791
0.82LysPhe: 0.82 ± 0.057
2.871LysGly: 2.871 ± 0.677
0.41LysHis: 0.41 ± 0.264
0.82LysIle: 0.82 ± 0.057
0.41LysLys: 0.41 ± 0.264
1.641LysLeu: 1.641 ± 0.113
0.41LysMet: 0.41 ± 0.264
1.641LysAsn: 1.641 ± 0.47
0.82LysPro: 0.82 ± 0.057
1.641LysGln: 1.641 ± 0.47
2.461LysArg: 2.461 ± 1.581
0.82LysSer: 0.82 ± 0.527
0.41LysThr: 0.41 ± 0.264
2.871LysVal: 2.871 ± 0.491
0.82LysTrp: 0.82 ± 0.057
0.82LysTyr: 0.82 ± 0.057
0.0LysXaa: 0.0 ± 0.0
Leu
14.766LeuAla: 14.766 ± 1.897
0.41LeuCys: 0.41 ± 0.32
4.102LeuAsp: 4.102 ± 2.051
3.692LeuGlu: 3.692 ± 1.788
2.871LeuPhe: 2.871 ± 0.093
8.203LeuGly: 8.203 ± 0.016
1.231LeuHis: 1.231 ± 0.207
2.871LeuIle: 2.871 ± 0.093
3.281LeuLys: 3.281 ± 0.227
10.664LeuLeu: 10.664 ± 1.597
2.461LeuMet: 2.461 ± 0.997
4.922LeuAsn: 4.922 ± 0.34
7.793LeuPro: 7.793 ± 0.92
4.512LeuGln: 4.512 ± 0.604
7.383LeuArg: 7.383 ± 0.657
4.922LeuSer: 4.922 ± 0.827
7.383LeuThr: 7.383 ± 1.241
4.102LeuVal: 4.102 ± 0.3
2.871LeuTrp: 2.871 ± 0.093
5.332LeuTyr: 5.332 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
3.692MetAla: 3.692 ± 1.788
0.0MetCys: 0.0 ± 0.0
0.41MetAsp: 0.41 ± 0.264
1.231MetGlu: 1.231 ± 0.961
1.641MetPhe: 1.641 ± 0.113
1.231MetGly: 1.231 ± 0.377
0.41MetHis: 0.41 ± 0.264
0.0MetIle: 0.0 ± 0.0
1.231MetLys: 1.231 ± 0.377
0.82MetLeu: 0.82 ± 0.057
0.0MetMet: 0.0 ± 0.0
0.41MetAsn: 0.41 ± 0.32
0.82MetPro: 0.82 ± 0.527
0.0MetGln: 0.0 ± 0.0
1.231MetArg: 1.231 ± 0.791
1.641MetSer: 1.641 ± 0.697
0.82MetThr: 0.82 ± 0.057
0.82MetVal: 0.82 ± 0.057
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.332AsnAla: 5.332 ± 0.661
0.0AsnCys: 0.0 ± 0.0
2.051AsnAsp: 2.051 ± 0.434
1.231AsnGlu: 1.231 ± 0.377
2.051AsnPhe: 2.051 ± 0.434
1.641AsnGly: 1.641 ± 0.113
0.41AsnHis: 0.41 ± 0.264
0.82AsnIle: 0.82 ± 0.057
0.0AsnLys: 0.0 ± 0.0
6.153AsnLeu: 6.153 ± 1.301
1.231AsnMet: 1.231 ± 0.377
2.051AsnAsn: 2.051 ± 0.434
4.512AsnPro: 4.512 ± 1.188
0.41AsnGln: 0.41 ± 0.32
1.641AsnArg: 1.641 ± 0.47
1.231AsnSer: 1.231 ± 0.377
1.641AsnThr: 1.641 ± 1.054
2.461AsnVal: 2.461 ± 0.754
0.41AsnTrp: 0.41 ± 0.264
1.231AsnTyr: 1.231 ± 0.791
0.0AsnXaa: 0.0 ± 0.0
Pro
6.973ProAla: 6.973 ± 1.942
0.82ProCys: 0.82 ± 0.057
2.871ProAsp: 2.871 ± 0.093
4.922ProGlu: 4.922 ± 1.508
0.41ProPhe: 0.41 ± 0.32
7.793ProGly: 7.793 ± 0.92
2.461ProHis: 2.461 ± 0.754
2.051ProIle: 2.051 ± 1.018
1.641ProLys: 1.641 ± 0.697
7.793ProLeu: 7.793 ± 0.92
1.641ProMet: 1.641 ± 0.113
2.461ProAsn: 2.461 ± 0.17
13.126ProPro: 13.126 ± 7.329
4.922ProGln: 4.922 ± 3.259
4.102ProArg: 4.102 ± 0.3
6.563ProSer: 6.563 ± 1.622
6.153ProThr: 6.153 ± 2.469
4.102ProVal: 4.102 ± 0.284
1.641ProTrp: 1.641 ± 0.113
1.641ProTyr: 1.641 ± 0.47
0.0ProXaa: 0.0 ± 0.0
Gln
3.281GlnAla: 3.281 ± 0.811
1.641GlnCys: 1.641 ± 0.47
2.051GlnAsp: 2.051 ± 0.15
1.641GlnGlu: 1.641 ± 0.697
0.0GlnPhe: 0.0 ± 0.0
5.332GlnGly: 5.332 ± 0.661
2.051GlnHis: 2.051 ± 0.434
1.231GlnIle: 1.231 ± 0.377
0.41GlnLys: 0.41 ± 0.264
4.512GlnLeu: 4.512 ± 1.731
1.231GlnMet: 1.231 ± 0.637
0.82GlnAsn: 0.82 ± 0.527
4.102GlnPro: 4.102 ± 2.305
2.461GlnGln: 2.461 ± 0.414
2.051GlnArg: 2.051 ± 0.734
3.692GlnSer: 3.692 ± 0.037
2.871GlnThr: 2.871 ± 0.491
2.871GlnVal: 2.871 ± 0.093
1.231GlnTrp: 1.231 ± 0.207
0.41GlnTyr: 0.41 ± 0.264
0.0GlnXaa: 0.0 ± 0.0
Arg
9.434ArgAla: 9.434 ± 2.558
0.41ArgCys: 0.41 ± 0.264
4.512ArgAsp: 4.512 ± 1.731
4.102ArgGlu: 4.102 ± 0.284
2.051ArgPhe: 2.051 ± 0.434
4.102ArgGly: 4.102 ± 0.884
2.051ArgHis: 2.051 ± 0.734
0.82ArgIle: 0.82 ± 0.057
1.231ArgLys: 1.231 ± 0.791
9.434ArgLeu: 9.434 ± 0.223
0.82ArgMet: 0.82 ± 0.057
2.461ArgAsn: 2.461 ± 0.17
3.281ArgPro: 3.281 ± 0.227
2.871ArgGln: 2.871 ± 0.093
4.102ArgArg: 4.102 ± 0.3
4.922ArgSer: 4.922 ± 0.827
2.461ArgThr: 2.461 ± 0.414
4.512ArgVal: 4.512 ± 0.02
0.41ArgTrp: 0.41 ± 0.32
0.41ArgTyr: 0.41 ± 0.32
0.0ArgXaa: 0.0 ± 0.0
Ser
7.793SerAla: 7.793 ± 0.831
0.41SerCys: 0.41 ± 0.264
1.641SerAsp: 1.641 ± 0.113
3.281SerGlu: 3.281 ± 0.811
1.231SerPhe: 1.231 ± 0.377
7.793SerGly: 7.793 ± 0.337
2.461SerHis: 2.461 ± 0.17
4.922SerIle: 4.922 ± 0.827
3.692SerLys: 3.692 ± 1.788
7.383SerLeu: 7.383 ± 0.657
0.82SerMet: 0.82 ± 0.057
0.82SerAsn: 0.82 ± 0.057
6.153SerPro: 6.153 ± 1.301
4.102SerGln: 4.102 ± 0.3
3.281SerArg: 3.281 ± 0.357
4.922SerSer: 4.922 ± 1.995
2.051SerThr: 2.051 ± 0.434
5.742SerVal: 5.742 ± 1.354
1.231SerTrp: 1.231 ± 0.377
1.231SerTyr: 1.231 ± 0.207
0.0SerXaa: 0.0 ± 0.0
Thr
4.922ThrAla: 4.922 ± 0.34
1.641ThrCys: 1.641 ± 0.113
3.692ThrAsp: 3.692 ± 0.547
2.051ThrGlu: 2.051 ± 0.15
1.641ThrPhe: 1.641 ± 0.697
4.922ThrGly: 4.922 ± 0.243
2.051ThrHis: 2.051 ± 0.734
1.641ThrIle: 1.641 ± 0.113
1.231ThrLys: 1.231 ± 0.207
3.692ThrLeu: 3.692 ± 0.547
0.41ThrMet: 0.41 ± 0.264
1.231ThrAsn: 1.231 ± 0.377
4.512ThrPro: 4.512 ± 0.02
1.641ThrGln: 1.641 ± 0.47
6.973ThrArg: 6.973 ± 0.774
4.922ThrSer: 4.922 ± 0.827
3.281ThrThr: 3.281 ± 0.941
2.051ThrVal: 2.051 ± 1.018
1.231ThrTrp: 1.231 ± 0.377
1.641ThrTyr: 1.641 ± 1.054
0.0ThrXaa: 0.0 ± 0.0
Val
7.793ValAla: 7.793 ± 1.415
1.231ValCys: 1.231 ± 0.791
3.281ValAsp: 3.281 ± 0.227
2.461ValGlu: 2.461 ± 0.17
1.641ValPhe: 1.641 ± 0.47
7.793ValGly: 7.793 ± 1.999
1.231ValHis: 1.231 ± 0.207
1.231ValIle: 1.231 ± 0.207
0.82ValLys: 0.82 ± 0.527
4.102ValLeu: 4.102 ± 0.3
1.231ValMet: 1.231 ± 0.207
2.461ValAsn: 2.461 ± 0.414
6.563ValPro: 6.563 ± 3.373
3.281ValGln: 3.281 ± 0.227
2.871ValArg: 2.871 ± 0.677
4.102ValSer: 4.102 ± 1.468
4.102ValThr: 4.102 ± 0.3
6.973ValVal: 6.973 ± 0.19
2.051ValTrp: 2.051 ± 0.15
1.641ValTyr: 1.641 ± 0.47
0.0ValXaa: 0.0 ± 0.0
Trp
4.512TrpAla: 4.512 ± 1.147
0.41TrpCys: 0.41 ± 0.32
0.41TrpAsp: 0.41 ± 0.32
0.82TrpGlu: 0.82 ± 0.641
0.82TrpPhe: 0.82 ± 0.527
2.051TrpGly: 2.051 ± 1.601
0.41TrpHis: 0.41 ± 0.32
2.871TrpIle: 2.871 ± 0.093
0.0TrpLys: 0.0 ± 0.0
3.281TrpLeu: 3.281 ± 1.524
0.41TrpMet: 0.41 ± 0.32
0.41TrpAsn: 0.41 ± 0.32
0.82TrpPro: 0.82 ± 0.641
1.231TrpGln: 1.231 ± 0.791
0.41TrpArg: 0.41 ± 0.264
0.82TrpSer: 0.82 ± 0.527
0.82TrpThr: 0.82 ± 0.057
2.461TrpVal: 2.461 ± 0.997
0.41TrpTrp: 0.41 ± 0.32
1.231TrpTyr: 1.231 ± 0.961
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.281TyrAla: 3.281 ± 0.227
0.41TyrCys: 0.41 ± 0.264
2.461TyrAsp: 2.461 ± 0.754
0.82TyrGlu: 0.82 ± 0.057
1.231TyrPhe: 1.231 ± 0.791
2.051TyrGly: 2.051 ± 0.434
0.82TyrHis: 0.82 ± 0.527
0.41TyrIle: 0.41 ± 0.264
0.82TyrLys: 0.82 ± 0.527
3.281TyrLeu: 3.281 ± 1.524
0.41TyrMet: 0.41 ± 0.314
1.641TyrAsn: 1.641 ± 1.281
0.82TyrPro: 0.82 ± 0.665
1.231TyrGln: 1.231 ± 0.791
1.231TyrArg: 1.231 ± 0.377
1.231TyrSer: 1.231 ± 0.207
0.82TyrThr: 0.82 ± 0.641
2.871TyrVal: 2.871 ± 1.261
1.231TyrTrp: 1.231 ± 0.377
1.231TyrTyr: 1.231 ± 0.207
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2439 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski