Amino acid dipepetide frequency for Campoletis sonorensis ichnovirus (strain Texas A&M) (CsIV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.121AlaAla: 4.121 ± 2.091
0.0AlaCys: 0.0 ± 0.0
4.808AlaAsp: 4.808 ± 1.343
4.808AlaGlu: 4.808 ± 1.446
2.06AlaPhe: 2.06 ± 1.816
3.434AlaGly: 3.434 ± 1.754
0.687AlaHis: 0.687 ± 0.585
3.434AlaIle: 3.434 ± 1.663
1.374AlaLys: 1.374 ± 0.771
2.747AlaLeu: 2.747 ± 1.739
0.687AlaMet: 0.687 ± 0.612
0.687AlaAsn: 0.687 ± 0.762
4.121AlaPro: 4.121 ± 1.58
1.374AlaGln: 1.374 ± 0.697
1.374AlaArg: 1.374 ± 0.809
2.06AlaSer: 2.06 ± 1.296
0.687AlaThr: 0.687 ± 0.612
6.181AlaVal: 6.181 ± 0.576
2.06AlaTrp: 2.06 ± 0.711
2.06AlaTyr: 2.06 ± 1.084
0.0AlaXaa: 0.0 ± 0.0
Cys
0.687CysAla: 0.687 ± 0.585
3.434CysCys: 3.434 ± 1.113
2.06CysAsp: 2.06 ± 1.543
2.06CysGlu: 2.06 ± 0.535
1.374CysPhe: 1.374 ± 0.936
1.374CysGly: 1.374 ± 0.77
0.0CysHis: 0.0 ± 0.0
3.434CysIle: 3.434 ± 2.309
0.0CysLys: 0.0 ± 0.0
2.06CysLeu: 2.06 ± 0.711
0.687CysMet: 0.687 ± 0.762
1.374CysAsn: 1.374 ± 1.224
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.06CysArg: 2.06 ± 1.22
2.06CysSer: 2.06 ± 1.175
2.747CysThr: 2.747 ± 0.742
2.06CysVal: 2.06 ± 0.535
0.687CysTrp: 0.687 ± 0.585
2.06CysTyr: 2.06 ± 1.073
0.0CysXaa: 0.0 ± 0.0
Asp
2.06AspAla: 2.06 ± 1.168
0.687AspCys: 0.687 ± 0.605
0.0AspAsp: 0.0 ± 0.0
2.747AspGlu: 2.747 ± 1.464
2.747AspPhe: 2.747 ± 1.624
5.495AspGly: 5.495 ± 2.085
0.687AspHis: 0.687 ± 0.612
2.06AspIle: 2.06 ± 0.535
2.06AspLys: 2.06 ± 1.543
1.374AspLeu: 1.374 ± 0.809
0.687AspMet: 0.687 ± 0.616
3.434AspAsn: 3.434 ± 0.412
2.747AspPro: 2.747 ± 1.538
1.374AspGln: 1.374 ± 0.936
3.434AspArg: 3.434 ± 0.855
2.06AspSer: 2.06 ± 1.543
1.374AspThr: 1.374 ± 0.77
6.181AspVal: 6.181 ± 1.578
2.06AspTrp: 2.06 ± 0.766
2.06AspTyr: 2.06 ± 1.29
0.0AspXaa: 0.0 ± 0.0
Glu
2.06GluAla: 2.06 ± 1.754
0.687GluCys: 0.687 ± 0.612
8.242GluAsp: 8.242 ± 1.621
5.495GluGlu: 5.495 ± 1.886
4.808GluPhe: 4.808 ± 2.426
3.434GluGly: 3.434 ± 1.412
0.687GluHis: 0.687 ± 0.762
0.687GluIle: 0.687 ± 0.762
2.747GluLys: 2.747 ± 1.197
10.989GluLeu: 10.989 ± 3.293
1.374GluMet: 1.374 ± 1.322
4.808GluAsn: 4.808 ± 1.745
5.495GluPro: 5.495 ± 1.993
0.687GluGln: 0.687 ± 0.612
2.06GluArg: 2.06 ± 1.29
7.555GluSer: 7.555 ± 3.122
4.121GluThr: 4.121 ± 1.364
1.374GluVal: 1.374 ± 0.883
1.374GluTrp: 1.374 ± 0.629
1.374GluTyr: 1.374 ± 0.698
0.0GluXaa: 0.0 ± 0.0
Phe
4.808PheAla: 4.808 ± 1.457
2.06PheCys: 2.06 ± 1.084
2.06PheAsp: 2.06 ± 0.803
2.06PheGlu: 2.06 ± 0.949
4.121PhePhe: 4.121 ± 0.72
4.121PheGly: 4.121 ± 1.916
2.06PheHis: 2.06 ± 0.933
2.06PheIle: 2.06 ± 1.296
3.434PheLys: 3.434 ± 1.516
9.615PheLeu: 9.615 ± 1.864
2.06PheMet: 2.06 ± 0.714
2.06PheAsn: 2.06 ± 1.816
3.434PhePro: 3.434 ± 1.651
0.687PheGln: 0.687 ± 0.585
4.121PheArg: 4.121 ± 2.146
2.06PheSer: 2.06 ± 1.073
2.06PheThr: 2.06 ± 0.711
1.374PheVal: 1.374 ± 0.77
1.374PheTrp: 1.374 ± 0.629
4.121PheTyr: 4.121 ± 2.146
0.0PheXaa: 0.0 ± 0.0
Gly
1.374GlyAla: 1.374 ± 0.936
1.374GlyCys: 1.374 ± 0.698
2.06GlyAsp: 2.06 ± 0.766
4.808GlyGlu: 4.808 ± 1.65
2.06GlyPhe: 2.06 ± 1.462
4.121GlyGly: 4.121 ± 1.069
0.0GlyHis: 0.0 ± 0.0
8.929GlyIle: 8.929 ± 1.033
5.495GlyLys: 5.495 ± 1.392
4.121GlyLeu: 4.121 ± 1.499
0.687GlyMet: 0.687 ± 1.111
2.06GlyAsn: 2.06 ± 0.535
4.121GlyPro: 4.121 ± 1.567
2.747GlyGln: 2.747 ± 1.189
3.434GlyArg: 3.434 ± 2.224
2.06GlySer: 2.06 ± 0.535
3.434GlyThr: 3.434 ± 1.44
1.374GlyVal: 1.374 ± 0.936
0.687GlyTrp: 0.687 ± 0.616
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.687HisAla: 0.687 ± 0.612
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.374HisGlu: 1.374 ± 1.524
1.374HisPhe: 1.374 ± 0.771
1.374HisGly: 1.374 ± 0.697
0.687HisHis: 0.687 ± 0.612
0.687HisIle: 0.687 ± 0.762
1.374HisLys: 1.374 ± 0.821
0.687HisLeu: 0.687 ± 0.605
1.374HisMet: 1.374 ± 0.629
1.374HisAsn: 1.374 ± 0.629
0.687HisPro: 0.687 ± 0.585
2.06HisGln: 2.06 ± 0.949
1.374HisArg: 1.374 ± 0.629
3.434HisSer: 3.434 ± 1.538
1.374HisThr: 1.374 ± 0.698
1.374HisVal: 1.374 ± 1.224
0.0HisTrp: 0.0 ± 0.0
3.434HisTyr: 3.434 ± 1.663
0.0HisXaa: 0.0 ± 0.0
Ile
2.747IleAla: 2.747 ± 0.914
3.434IleCys: 3.434 ± 1.44
4.808IleAsp: 4.808 ± 1.757
4.121IleGlu: 4.121 ± 2.014
3.434IlePhe: 3.434 ± 1.5
2.747IleGly: 2.747 ± 0.905
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
4.121IleLys: 4.121 ± 1.389
5.495IleLeu: 5.495 ± 2.517
1.374IleMet: 1.374 ± 1.211
5.495IleAsn: 5.495 ± 2.085
3.434IlePro: 3.434 ± 0.827
0.0IleGln: 0.0 ± 0.0
2.747IleArg: 2.747 ± 1.034
6.181IleSer: 6.181 ± 2.873
4.808IleThr: 4.808 ± 1.65
4.121IleVal: 4.121 ± 1.578
1.374IleTrp: 1.374 ± 0.629
4.121IleTyr: 4.121 ± 1.069
0.0IleXaa: 0.0 ± 0.0
Lys
1.374LysAla: 1.374 ± 0.629
1.374LysCys: 1.374 ± 0.869
2.06LysAsp: 2.06 ± 1.29
4.121LysGlu: 4.121 ± 1.341
2.747LysPhe: 2.747 ± 1.464
1.374LysGly: 1.374 ± 0.697
2.06LysHis: 2.06 ± 0.766
5.495LysIle: 5.495 ± 1.827
0.687LysLys: 0.687 ± 0.762
5.495LysLeu: 5.495 ± 1.081
2.747LysMet: 2.747 ± 1.644
2.06LysAsn: 2.06 ± 1.296
10.989LysPro: 10.989 ± 4.88
2.06LysGln: 2.06 ± 0.973
1.374LysArg: 1.374 ± 0.698
4.808LysSer: 4.808 ± 1.51
2.06LysThr: 2.06 ± 1.29
4.121LysVal: 4.121 ± 1.391
0.0LysTrp: 0.0 ± 0.0
3.434LysTyr: 3.434 ± 0.974
0.0LysXaa: 0.0 ± 0.0
Leu
10.302LeuAla: 10.302 ± 3.62
2.747LeuCys: 2.747 ± 1.395
2.06LeuAsp: 2.06 ± 1.543
4.121LeuGlu: 4.121 ± 2.166
5.495LeuPhe: 5.495 ± 0.87
6.868LeuGly: 6.868 ± 1.588
6.868LeuHis: 6.868 ± 1.639
2.747LeuIle: 2.747 ± 0.503
4.808LeuLys: 4.808 ± 2.269
7.555LeuLeu: 7.555 ± 2.29
1.374LeuMet: 1.374 ± 0.779
6.181LeuAsn: 6.181 ± 0.995
4.121LeuPro: 4.121 ± 1.888
2.06LeuGln: 2.06 ± 1.084
1.374LeuArg: 1.374 ± 0.629
8.929LeuSer: 8.929 ± 2.396
6.181LeuThr: 6.181 ± 1.44
5.495LeuVal: 5.495 ± 1.809
2.06LeuTrp: 2.06 ± 0.766
4.121LeuTyr: 4.121 ± 1.485
0.0LeuXaa: 0.0 ± 0.0
Met
0.687MetAla: 0.687 ± 0.762
0.0MetCys: 0.0 ± 0.0
0.687MetAsp: 0.687 ± 0.605
2.06MetGlu: 2.06 ± 0.725
1.374MetPhe: 1.374 ± 1.211
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.687MetIle: 0.687 ± 0.612
4.121MetLys: 4.121 ± 1.234
4.808MetLeu: 4.808 ± 2.716
0.687MetMet: 0.687 ± 0.605
2.06MetAsn: 2.06 ± 1.168
1.374MetPro: 1.374 ± 0.883
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
0.687MetSer: 0.687 ± 0.605
0.687MetThr: 0.687 ± 0.605
3.434MetVal: 3.434 ± 1.552
0.687MetTrp: 0.687 ± 0.612
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.06AsnAla: 2.06 ± 0.725
0.0AsnCys: 0.0 ± 0.0
2.06AsnAsp: 2.06 ± 0.949
1.374AsnGlu: 1.374 ± 1.169
4.808AsnPhe: 4.808 ± 1.05
2.06AsnGly: 2.06 ± 1.084
0.687AsnHis: 0.687 ± 0.605
4.121AsnIle: 4.121 ± 1.452
3.434AsnLys: 3.434 ± 0.855
4.808AsnLeu: 4.808 ± 1.897
1.374AsnMet: 1.374 ± 0.697
3.434AsnAsn: 3.434 ± 1.455
2.747AsnPro: 2.747 ± 1.098
0.687AsnGln: 0.687 ± 0.605
0.687AsnArg: 0.687 ± 0.585
3.434AsnSer: 3.434 ± 0.827
0.687AsnThr: 0.687 ± 0.605
2.747AsnVal: 2.747 ± 1.002
2.747AsnTrp: 2.747 ± 0.844
3.434AsnTyr: 3.434 ± 1.445
0.0AsnXaa: 0.0 ± 0.0
Pro
1.374ProAla: 1.374 ± 0.936
4.121ProCys: 4.121 ± 2.294
2.06ProAsp: 2.06 ± 0.766
8.929ProGlu: 8.929 ± 2.888
0.687ProPhe: 0.687 ± 0.585
3.434ProGly: 3.434 ± 0.827
2.06ProHis: 2.06 ± 1.073
4.808ProIle: 4.808 ± 2.073
5.495ProLys: 5.495 ± 2.534
7.555ProLeu: 7.555 ± 1.357
2.06ProMet: 2.06 ± 0.535
2.06ProAsn: 2.06 ± 1.266
2.06ProPro: 2.06 ± 0.725
2.06ProGln: 2.06 ± 0.535
0.687ProArg: 0.687 ± 0.585
3.434ProSer: 3.434 ± 0.974
2.06ProThr: 2.06 ± 1.29
4.121ProVal: 4.121 ± 0.879
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
2.747GlnAla: 2.747 ± 0.742
1.374GlnCys: 1.374 ± 1.169
2.747GlnAsp: 2.747 ± 1.872
0.687GlnGlu: 0.687 ± 0.612
3.434GlnPhe: 3.434 ± 0.827
1.374GlnGly: 1.374 ± 0.77
1.374GlnHis: 1.374 ± 0.869
2.747GlnIle: 2.747 ± 1.098
0.0GlnLys: 0.0 ± 0.0
2.06GlnLeu: 2.06 ± 1.266
0.687GlnMet: 0.687 ± 0.612
0.687GlnAsn: 0.687 ± 0.605
0.687GlnPro: 0.687 ± 0.585
2.747GlnGln: 2.747 ± 1.858
0.687GlnArg: 0.687 ± 0.612
3.434GlnSer: 3.434 ± 1.081
0.0GlnThr: 0.0 ± 0.0
2.06GlnVal: 2.06 ± 0.949
0.0GlnTrp: 0.0 ± 0.0
1.374GlnTyr: 1.374 ± 0.771
0.0GlnXaa: 0.0 ± 0.0
Arg
0.687ArgAla: 0.687 ± 0.605
1.374ArgCys: 1.374 ± 0.698
1.374ArgAsp: 1.374 ± 1.211
3.434ArgGlu: 3.434 ± 1.754
4.121ArgPhe: 4.121 ± 1.069
2.06ArgGly: 2.06 ± 0.725
0.0ArgHis: 0.0 ± 0.0
2.747ArgIle: 2.747 ± 1.624
1.374ArgLys: 1.374 ± 1.169
6.868ArgLeu: 6.868 ± 2.022
0.0ArgMet: 0.0 ± 0.0
2.06ArgAsn: 2.06 ± 0.857
2.06ArgPro: 2.06 ± 0.799
2.06ArgGln: 2.06 ± 1.073
0.687ArgArg: 0.687 ± 0.605
2.747ArgSer: 2.747 ± 0.914
2.06ArgThr: 2.06 ± 1.132
0.0ArgVal: 0.0 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
2.06SerAla: 2.06 ± 0.857
0.687SerCys: 0.687 ± 0.616
2.06SerAsp: 2.06 ± 0.535
3.434SerGlu: 3.434 ± 1.647
1.374SerPhe: 1.374 ± 1.224
5.495SerGly: 5.495 ± 2.123
2.06SerHis: 2.06 ± 1.084
6.181SerIle: 6.181 ± 3.858
4.121SerLys: 4.121 ± 2.183
4.121SerLeu: 4.121 ± 2.795
2.06SerMet: 2.06 ± 1.073
2.06SerAsn: 2.06 ± 1.29
6.868SerPro: 6.868 ± 2.418
2.747SerGln: 2.747 ± 1.245
3.434SerArg: 3.434 ± 1.685
8.929SerSer: 8.929 ± 3.417
9.615SerThr: 9.615 ± 2.416
4.121SerVal: 4.121 ± 1.182
0.0SerTrp: 0.0 ± 0.0
2.06SerTyr: 2.06 ± 0.803
0.0SerXaa: 0.0 ± 0.0
Thr
3.434ThrAla: 3.434 ± 1.164
2.06ThrCys: 2.06 ± 0.711
2.06ThrAsp: 2.06 ± 1.135
4.808ThrGlu: 4.808 ± 2.71
2.747ThrPhe: 2.747 ± 1.098
1.374ThrGly: 1.374 ± 0.629
1.374ThrHis: 1.374 ± 0.821
1.374ThrIle: 1.374 ± 0.936
6.868ThrLys: 6.868 ± 1.828
9.615ThrLeu: 9.615 ± 1.601
0.687ThrMet: 0.687 ± 0.605
2.06ThrAsn: 2.06 ± 0.973
2.06ThrPro: 2.06 ± 0.949
0.687ThrGln: 0.687 ± 0.762
2.06ThrArg: 2.06 ± 1.084
4.808ThrSer: 4.808 ± 1.208
2.06ThrThr: 2.06 ± 1.073
2.747ThrVal: 2.747 ± 0.927
0.0ThrTrp: 0.0 ± 0.0
0.687ThrTyr: 0.687 ± 0.605
0.0ThrXaa: 0.0 ± 0.0
Val
2.747ValAla: 2.747 ± 1.767
1.374ValCys: 1.374 ± 0.77
2.06ValAsp: 2.06 ± 1.519
4.808ValGlu: 4.808 ± 1.229
1.374ValPhe: 1.374 ± 0.629
2.747ValGly: 2.747 ± 1.644
2.747ValHis: 2.747 ± 1.259
7.555ValIle: 7.555 ± 1.82
2.06ValLys: 2.06 ± 0.799
4.121ValLeu: 4.121 ± 2.13
2.747ValMet: 2.747 ± 1.242
2.747ValAsn: 2.747 ± 0.927
1.374ValPro: 1.374 ± 1.224
6.181ValGln: 6.181 ± 1.598
0.687ValArg: 0.687 ± 0.612
4.121ValSer: 4.121 ± 2.333
6.868ValThr: 6.868 ± 1.473
9.615ValVal: 9.615 ± 1.869
0.0ValTrp: 0.0 ± 0.0
1.374ValTyr: 1.374 ± 0.629
0.0ValXaa: 0.0 ± 0.0
Trp
0.687TrpAla: 0.687 ± 0.616
0.0TrpCys: 0.0 ± 0.0
1.374TrpAsp: 1.374 ± 1.524
2.747TrpGlu: 2.747 ± 0.914
4.808TrpPhe: 4.808 ± 1.242
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.687TrpIle: 0.687 ± 0.605
2.747TrpLys: 2.747 ± 1.189
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.687TrpPro: 0.687 ± 0.762
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.687TrpThr: 0.687 ± 0.616
1.374TrpVal: 1.374 ± 0.698
0.0TrpTrp: 0.0 ± 0.0
0.687TrpTyr: 0.687 ± 0.585
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.06TyrAla: 2.06 ± 0.799
3.434TyrCys: 3.434 ± 0.412
0.687TyrAsp: 0.687 ± 0.605
2.06TyrGlu: 2.06 ± 0.535
4.808TyrPhe: 4.808 ± 1.897
2.747TyrGly: 2.747 ± 0.503
0.687TyrHis: 0.687 ± 0.612
4.121TyrIle: 4.121 ± 1.069
4.121TyrLys: 4.121 ± 1.394
1.374TyrLeu: 1.374 ± 0.698
0.0TyrMet: 0.0 ± 0.0
0.687TyrAsn: 0.687 ± 0.612
0.0TyrPro: 0.0 ± 0.0
0.687TyrGln: 0.687 ± 0.585
3.434TyrArg: 3.434 ± 1.651
0.687TyrSer: 0.687 ± 0.605
0.0TyrThr: 0.0 ± 0.0
3.434TyrVal: 3.434 ± 1.21
1.374TyrTrp: 1.374 ± 0.936
1.374TyrTyr: 1.374 ± 0.629
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1457 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski