Amino acid dipepetide frequency for Colletotrichum gloeosporioides chrysovirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.626AlaAla: 4.626 ± 0.985
1.068AlaCys: 1.068 ± 0.068
4.27AlaAsp: 4.27 ± 0.272
6.406AlaGlu: 6.406 ± 1.12
2.847AlaPhe: 2.847 ± 1.643
4.982AlaGly: 4.982 ± 0.585
1.068AlaHis: 1.068 ± 0.517
6.406AlaIle: 6.406 ± 1.438
4.27AlaLys: 4.27 ± 0.272
3.915AlaLeu: 3.915 ± 0.945
2.135AlaMet: 2.135 ± 0.696
1.779AlaAsn: 1.779 ± 0.613
1.423AlaPro: 1.423 ± 0.622
3.559AlaGln: 3.559 ± 1.264
3.559AlaArg: 3.559 ± 0.09
3.915AlaSer: 3.915 ± 0.211
3.559AlaThr: 3.559 ± 0.769
3.915AlaVal: 3.915 ± 1.193
1.068AlaTrp: 1.068 ± 0.517
3.203AlaTyr: 3.203 ± 1.04
0.356AlaXaa: 0.356 ± 0.3
Cys
1.068CysAla: 1.068 ± 0.454
0.0CysCys: 0.0 ± 0.0
1.068CysAsp: 1.068 ± 0.487
0.712CysGlu: 0.712 ± 0.581
0.712CysPhe: 0.712 ± 0.246
0.712CysGly: 0.712 ± 0.318
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.712CysLys: 0.712 ± 0.285
1.068CysLeu: 1.068 ± 0.546
0.712CysMet: 0.712 ± 0.507
0.712CysAsn: 0.712 ± 0.285
0.356CysPro: 0.356 ± 0.253
0.356CysGln: 0.356 ± 0.291
0.712CysArg: 0.712 ± 0.246
2.135CysSer: 2.135 ± 0.955
0.356CysThr: 0.356 ± 0.253
0.356CysVal: 0.356 ± 0.253
0.0CysTrp: 0.0 ± 0.0
1.423CysTyr: 1.423 ± 0.789
0.0CysXaa: 0.0 ± 0.0
Asp
6.05AspAla: 6.05 ± 1.082
0.712AspCys: 0.712 ± 0.318
4.626AspAsp: 4.626 ± 1.656
4.626AspGlu: 4.626 ± 0.466
2.135AspPhe: 2.135 ± 1.108
3.915AspGly: 3.915 ± 0.67
1.779AspHis: 1.779 ± 0.214
2.135AspIle: 2.135 ± 0.136
2.491AspLys: 2.491 ± 0.554
6.05AspLeu: 6.05 ± 0.59
2.491AspMet: 2.491 ± 1.134
4.27AspAsn: 4.27 ± 1.442
2.135AspPro: 2.135 ± 0.335
1.068AspGln: 1.068 ± 0.399
3.915AspArg: 3.915 ± 0.211
1.779AspSer: 1.779 ± 1.063
1.779AspThr: 1.779 ± 0.443
3.915AspVal: 3.915 ± 0.518
2.847AspTrp: 2.847 ± 0.863
3.203AspTyr: 3.203 ± 0.273
0.0AspXaa: 0.0 ± 0.0
Glu
4.982GluAla: 4.982 ± 1.108
0.0GluCys: 0.0 ± 0.0
4.27GluAsp: 4.27 ± 0.372
6.762GluGlu: 6.762 ± 1.842
5.338GluPhe: 5.338 ± 0.905
5.338GluGly: 5.338 ± 1.934
2.491GluHis: 2.491 ± 0.825
5.338GluIle: 5.338 ± 1.237
4.27GluLys: 4.27 ± 0.213
5.694GluLeu: 5.694 ± 0.813
3.915GluMet: 3.915 ± 0.475
3.203GluAsn: 3.203 ± 0.653
1.423GluPro: 1.423 ± 0.678
1.423GluGln: 1.423 ± 0.622
6.05GluArg: 6.05 ± 1.374
4.27GluSer: 4.27 ± 1.087
2.491GluThr: 2.491 ± 0.132
9.609GluVal: 9.609 ± 2.297
1.423GluTrp: 1.423 ± 0.328
3.203GluTyr: 3.203 ± 1.021
0.356GluXaa: 0.356 ± 0.253
Phe
1.423PheAla: 1.423 ± 1.013
0.712PheCys: 0.712 ± 0.507
2.491PheAsp: 2.491 ± 0.629
5.338PheGlu: 5.338 ± 0.643
2.847PhePhe: 2.847 ± 0.863
3.203PheGly: 3.203 ± 1.083
1.423PheHis: 1.423 ± 0.343
1.423PheIle: 1.423 ± 1.013
3.915PheLys: 3.915 ± 1.132
1.423PheLeu: 1.423 ± 0.328
2.491PheMet: 2.491 ± 0.949
1.423PheAsn: 1.423 ± 0.19
1.068PhePro: 1.068 ± 0.068
0.712PheGln: 0.712 ± 0.246
3.203PheArg: 3.203 ± 0.851
3.559PheSer: 3.559 ± 0.414
2.491PheThr: 2.491 ± 0.825
2.491PheVal: 2.491 ± 0.554
0.356PheTrp: 0.356 ± 0.253
1.068PheTyr: 1.068 ± 0.517
0.0PheXaa: 0.0 ± 0.0
Gly
6.762GlyAla: 6.762 ± 0.65
0.356GlyCys: 0.356 ± 0.3
4.626GlyAsp: 4.626 ± 1.449
5.694GlyGlu: 5.694 ± 0.351
2.847GlyPhe: 2.847 ± 0.863
5.694GlyGly: 5.694 ± 1.877
0.712GlyHis: 0.712 ± 0.507
3.915GlyIle: 3.915 ± 1.984
6.406GlyLys: 6.406 ± 0.452
7.829GlyLeu: 7.829 ± 2.278
2.491GlyMet: 2.491 ± 0.4
2.135GlyAsn: 2.135 ± 0.136
0.356GlyPro: 0.356 ± 0.253
4.27GlyGln: 4.27 ± 0.711
4.982GlyArg: 4.982 ± 0.656
5.338GlySer: 5.338 ± 0.986
2.491GlyThr: 2.491 ± 0.629
4.626GlyVal: 4.626 ± 1.083
0.356GlyTrp: 0.356 ± 0.253
2.847GlyTyr: 2.847 ± 0.689
0.0GlyXaa: 0.0 ± 0.0
His
1.068HisAla: 1.068 ± 0.068
0.0HisCys: 0.0 ± 0.0
1.068HisAsp: 1.068 ± 0.454
1.068HisGlu: 1.068 ± 0.487
0.712HisPhe: 0.712 ± 0.285
1.423HisGly: 1.423 ± 0.343
0.712HisHis: 0.712 ± 0.507
0.356HisIle: 0.356 ± 0.291
2.135HisLys: 2.135 ± 0.418
0.712HisLeu: 0.712 ± 0.318
1.423HisMet: 1.423 ± 0.343
1.423HisAsn: 1.423 ± 0.622
0.356HisPro: 0.356 ± 0.253
0.356HisGln: 0.356 ± 0.3
1.068HisArg: 1.068 ± 0.454
1.779HisSer: 1.779 ± 1.063
1.068HisThr: 1.068 ± 0.872
1.779HisVal: 1.779 ± 0.632
0.712HisTrp: 0.712 ± 0.246
1.068HisTyr: 1.068 ± 0.068
0.0HisXaa: 0.0 ± 0.0
Ile
4.626IleAla: 4.626 ± 1.42
0.356IleCys: 0.356 ± 0.3
5.338IleAsp: 5.338 ± 0.986
3.203IleGlu: 3.203 ± 0.204
0.712IlePhe: 0.712 ± 0.318
4.626IleGly: 4.626 ± 1.083
1.068IleHis: 1.068 ± 0.068
0.712IleIle: 0.712 ± 0.246
1.779IleLys: 1.779 ± 0.443
2.491IleLeu: 2.491 ± 0.132
2.847IleMet: 2.847 ± 0.312
2.135IleAsn: 2.135 ± 0.738
3.915IlePro: 3.915 ± 0.67
1.779IleGln: 1.779 ± 0.214
3.203IleArg: 3.203 ± 0.664
2.135IleSer: 2.135 ± 0.598
0.712IleThr: 0.712 ± 0.581
4.27IleVal: 4.27 ± 0.669
0.0IleTrp: 0.0 ± 0.0
1.423IleTyr: 1.423 ± 0.343
0.0IleXaa: 0.0 ± 0.0
Lys
3.559LysAla: 3.559 ± 0.791
0.712LysCys: 0.712 ± 0.318
3.559LysAsp: 3.559 ± 0.791
6.05LysGlu: 6.05 ± 1.082
2.847LysPhe: 2.847 ± 0.303
5.338LysGly: 5.338 ± 0.663
0.712LysHis: 0.712 ± 0.246
2.491LysIle: 2.491 ± 0.132
7.117LysLys: 7.117 ± 1.352
4.982LysLeu: 4.982 ± 1.337
1.423LysMet: 1.423 ± 0.57
2.135LysAsn: 2.135 ± 0.598
1.423LysPro: 1.423 ± 0.19
2.847LysGln: 2.847 ± 0.696
2.847LysArg: 2.847 ± 1.074
4.982LysSer: 4.982 ± 0.373
3.203LysThr: 3.203 ± 1.06
6.406LysVal: 6.406 ± 1.658
1.068LysTrp: 1.068 ± 0.546
2.847LysTyr: 2.847 ± 0.381
0.0LysXaa: 0.0 ± 0.0
Leu
4.982LeuAla: 4.982 ± 0.846
2.491LeuCys: 2.491 ± 0.949
6.05LeuAsp: 6.05 ± 1.278
3.915LeuGlu: 3.915 ± 1.419
2.491LeuPhe: 2.491 ± 1.126
6.762LeuGly: 6.762 ± 0.188
0.356LeuHis: 0.356 ± 0.253
4.626LeuIle: 4.626 ± 0.81
3.203LeuLys: 3.203 ± 0.701
3.915LeuLeu: 3.915 ± 0.81
3.559LeuMet: 3.559 ± 1.068
4.626LeuAsn: 4.626 ± 0.823
2.491LeuPro: 2.491 ± 0.554
1.068LeuGln: 1.068 ± 0.487
3.915LeuArg: 3.915 ± 0.742
6.762LeuSer: 6.762 ± 0.863
2.847LeuThr: 2.847 ± 0.799
4.27LeuVal: 4.27 ± 0.711
0.712LeuTrp: 0.712 ± 0.6
1.779LeuTyr: 1.779 ± 0.214
0.0LeuXaa: 0.0 ± 0.0
Met
2.847MetAla: 2.847 ± 0.201
1.423MetCys: 1.423 ± 0.789
1.068MetAsp: 1.068 ± 0.068
3.915MetGlu: 3.915 ± 1.139
1.423MetPhe: 1.423 ± 0.343
2.847MetGly: 2.847 ± 1.132
1.068MetHis: 1.068 ± 0.76
0.356MetIle: 0.356 ± 0.3
3.203MetLys: 3.203 ± 0.664
2.491MetLeu: 2.491 ± 0.825
2.491MetMet: 2.491 ± 0.567
1.779MetAsn: 1.779 ± 0.613
1.779MetPro: 1.779 ± 0.385
1.068MetGln: 1.068 ± 0.546
3.559MetArg: 3.559 ± 0.486
2.847MetSer: 2.847 ± 0.303
3.203MetThr: 3.203 ± 0.851
4.626MetVal: 4.626 ± 1.617
0.356MetTrp: 0.356 ± 0.253
1.423MetTyr: 1.423 ± 0.328
0.0MetXaa: 0.0 ± 0.0
Asn
2.491AsnAla: 2.491 ± 0.132
0.356AsnCys: 0.356 ± 0.3
2.847AsnAsp: 2.847 ± 0.657
3.915AsnGlu: 3.915 ± 1.306
3.203AsnPhe: 3.203 ± 1.363
3.559AsnGly: 3.559 ± 0.46
0.356AsnHis: 0.356 ± 0.291
2.491AsnIle: 2.491 ± 0.567
2.135AsnLys: 2.135 ± 0.485
2.491AsnLeu: 2.491 ± 0.132
2.491AsnMet: 2.491 ± 0.456
1.068AsnAsn: 1.068 ± 0.399
1.423AsnPro: 1.423 ± 1.2
0.712AsnGln: 0.712 ± 0.246
2.491AsnArg: 2.491 ± 0.845
1.779AsnSer: 1.779 ± 0.385
3.559AsnThr: 3.559 ± 0.886
2.135AsnVal: 2.135 ± 0.136
0.0AsnTrp: 0.0 ± 0.0
0.356AsnTyr: 0.356 ± 0.3
0.356AsnXaa: 0.356 ± 0.3
Pro
1.779ProAla: 1.779 ± 0.214
0.712ProCys: 0.712 ± 0.246
2.847ProAsp: 2.847 ± 0.794
2.491ProGlu: 2.491 ± 1.126
0.712ProPhe: 0.712 ± 0.318
2.135ProGly: 2.135 ± 0.696
0.356ProHis: 0.356 ± 0.291
1.068ProIle: 1.068 ± 0.399
2.847ProLys: 2.847 ± 0.657
1.779ProLeu: 1.779 ± 0.715
0.712ProMet: 0.712 ± 0.246
1.068ProAsn: 1.068 ± 0.068
0.356ProPro: 0.356 ± 0.253
1.068ProGln: 1.068 ± 0.068
2.135ProArg: 2.135 ± 0.335
1.779ProSer: 1.779 ± 0.711
1.779ProThr: 1.779 ± 1.072
0.356ProVal: 0.356 ± 0.291
0.356ProTrp: 0.356 ± 0.291
1.779ProTyr: 1.779 ± 0.214
0.0ProXaa: 0.0 ± 0.0
Gln
1.068GlnAla: 1.068 ± 0.399
0.712GlnCys: 0.712 ± 0.507
1.068GlnAsp: 1.068 ± 0.9
1.779GlnGlu: 1.779 ± 0.443
1.779GlnPhe: 1.779 ± 0.783
1.068GlnGly: 1.068 ± 0.517
0.356GlnHis: 0.356 ± 0.3
1.068GlnIle: 1.068 ± 0.487
3.559GlnLys: 3.559 ± 0.414
3.203GlnLeu: 3.203 ± 0.493
2.491GlnMet: 2.491 ± 0.845
2.135GlnAsn: 2.135 ± 1.358
0.356GlnPro: 0.356 ± 0.3
1.779GlnGln: 1.779 ± 0.625
2.847GlnArg: 2.847 ± 0.201
0.712GlnSer: 0.712 ± 0.246
1.068GlnThr: 1.068 ± 0.517
2.135GlnVal: 2.135 ± 0.335
0.0GlnTrp: 0.0 ± 0.0
2.135GlnTyr: 2.135 ± 0.485
0.0GlnXaa: 0.0 ± 0.0
Arg
4.982ArgAla: 4.982 ± 1.076
1.068ArgCys: 1.068 ± 0.872
3.203ArgAsp: 3.203 ± 1.363
5.338ArgGlu: 5.338 ± 0.536
2.135ArgPhe: 2.135 ± 0.485
6.05ArgGly: 6.05 ± 0.691
1.423ArgHis: 1.423 ± 0.622
3.203ArgIle: 3.203 ± 1.144
4.27ArgLys: 4.27 ± 0.985
6.05ArgLeu: 6.05 ± 1.013
3.203ArgMet: 3.203 ± 0.851
1.779ArgAsn: 1.779 ± 0.711
0.712ArgPro: 0.712 ± 0.318
1.423ArgGln: 1.423 ± 0.328
4.626ArgArg: 4.626 ± 0.089
6.406ArgSer: 6.406 ± 2.053
2.135ArgThr: 2.135 ± 0.544
3.559ArgVal: 3.559 ± 0.486
1.423ArgTrp: 1.423 ± 0.492
3.203ArgTyr: 3.203 ± 0.734
0.0ArgXaa: 0.0 ± 0.0
Ser
3.559SerAla: 3.559 ± 1.211
0.0SerCys: 0.0 ± 0.0
4.27SerAsp: 4.27 ± 1.39
4.982SerGlu: 4.982 ± 1.232
2.491SerPhe: 2.491 ± 0.554
6.05SerGly: 6.05 ± 0.653
1.423SerHis: 1.423 ± 0.492
3.203SerIle: 3.203 ± 0.953
3.915SerLys: 3.915 ± 1.189
4.982SerLeu: 4.982 ± 0.24
2.847SerMet: 2.847 ± 0.894
2.135SerAsn: 2.135 ± 0.855
1.423SerPro: 1.423 ± 0.19
0.712SerGln: 0.712 ± 0.507
4.982SerArg: 4.982 ± 1.989
5.694SerSer: 5.694 ± 2.002
5.338SerThr: 5.338 ± 0.798
5.338SerVal: 5.338 ± 1.021
1.068SerTrp: 1.068 ± 0.068
2.847SerTyr: 2.847 ± 0.201
0.356SerXaa: 0.356 ± 0.3
Thr
3.915ThrAla: 3.915 ± 0.67
1.068ThrCys: 1.068 ± 0.531
2.135ThrAsp: 2.135 ± 1.355
3.559ThrGlu: 3.559 ± 0.583
2.135ThrPhe: 2.135 ± 0.335
2.135ThrGly: 2.135 ± 0.136
1.423ThrHis: 1.423 ± 0.794
3.203ThrIle: 3.203 ± 0.664
3.915ThrLys: 3.915 ± 0.321
3.203ThrLeu: 3.203 ± 0.851
1.068ThrMet: 1.068 ± 0.399
1.068ThrAsn: 1.068 ± 0.517
1.779ThrPro: 1.779 ± 0.842
1.779ThrGln: 1.779 ± 0.862
3.915ThrArg: 3.915 ± 0.333
2.491ThrSer: 2.491 ± 1.302
1.779ThrThr: 1.779 ± 0.385
4.27ThrVal: 4.27 ± 1.251
1.423ThrTrp: 1.423 ± 0.678
0.712ThrTyr: 0.712 ± 0.285
0.0ThrXaa: 0.0 ± 0.0
Val
3.915ValAla: 3.915 ± 0.96
0.356ValCys: 0.356 ± 0.291
4.982ValAsp: 4.982 ± 1.624
6.406ValGlu: 6.406 ± 0.623
3.203ValPhe: 3.203 ± 0.87
3.915ValGly: 3.915 ± 0.502
2.135ValHis: 2.135 ± 1.092
4.626ValIle: 4.626 ± 1.222
2.491ValLys: 2.491 ± 1.641
3.915ValLeu: 3.915 ± 0.945
2.491ValMet: 2.491 ± 0.456
2.491ValAsn: 2.491 ± 0.949
3.203ValPro: 3.203 ± 1.106
3.559ValGln: 3.559 ± 1.255
4.982ValArg: 4.982 ± 0.686
4.27ValSer: 4.27 ± 1.848
4.626ValThr: 4.626 ± 0.88
4.27ValVal: 4.27 ± 0.731
2.135ValTrp: 2.135 ± 0.973
2.847ValTyr: 2.847 ± 1.164
0.356ValXaa: 0.356 ± 0.253
Trp
1.423TrpAla: 1.423 ± 0.328
0.356TrpCys: 0.356 ± 0.253
0.712TrpAsp: 0.712 ± 0.246
1.779TrpGlu: 1.779 ± 0.443
0.356TrpPhe: 0.356 ± 0.253
0.712TrpGly: 0.712 ± 0.285
0.712TrpHis: 0.712 ± 0.581
0.0TrpIle: 0.0 ± 0.0
1.068TrpLys: 1.068 ± 0.517
1.423TrpLeu: 1.423 ± 0.19
0.356TrpMet: 0.356 ± 0.3
1.068TrpAsn: 1.068 ± 0.487
0.0TrpPro: 0.0 ± 0.0
0.356TrpGln: 0.356 ± 0.253
1.068TrpArg: 1.068 ± 0.399
1.779TrpSer: 1.779 ± 0.214
0.712TrpThr: 0.712 ± 0.246
1.423TrpVal: 1.423 ± 0.492
0.0TrpTrp: 0.0 ± 0.0
0.356TrpTyr: 0.356 ± 0.3
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.847TyrAla: 2.847 ± 1.074
0.712TyrCys: 0.712 ± 0.318
1.423TyrAsp: 1.423 ± 0.343
3.559TyrGlu: 3.559 ± 0.969
2.135TyrPhe: 2.135 ± 1.108
4.27TyrGly: 4.27 ± 0.372
0.712TyrHis: 0.712 ± 0.246
1.068TyrIle: 1.068 ± 0.546
2.491TyrLys: 2.491 ± 0.4
2.847TyrLeu: 2.847 ± 1.074
1.779TyrMet: 1.779 ± 0.443
1.779TyrAsn: 1.779 ± 0.286
2.135TyrPro: 2.135 ± 0.922
1.423TyrGln: 1.423 ± 0.492
2.135TyrArg: 2.135 ± 0.855
3.203TyrSer: 3.203 ± 0.389
1.423TyrThr: 1.423 ± 0.678
1.423TyrVal: 1.423 ± 0.492
0.356TyrTrp: 0.356 ± 0.253
0.356TyrTyr: 0.356 ± 0.253
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.356XaaAla: 0.356 ± 0.3
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.356XaaGlu: 0.356 ± 0.253
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.356XaaLys: 0.356 ± 0.3
0.356XaaLeu: 0.356 ± 0.253
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.356XaaThr: 0.356 ± 0.3
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
1.068XaaXaa: 1.068 ± 0.9
Statistics based on 3 proteins (2811 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski