Amino acid dipepetide frequency for Wenzhou channeled applesnail virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.177AlaAla: 4.177 ± 0.711
1.285AlaCys: 1.285 ± 0.731
3.213AlaAsp: 3.213 ± 0.745
3.535AlaGlu: 3.535 ± 0.562
2.892AlaPhe: 2.892 ± 1.443
5.141AlaGly: 5.141 ± 0.678
1.607AlaHis: 1.607 ± 0.4
6.427AlaIle: 6.427 ± 0.054
3.213AlaLys: 3.213 ± 0.23
5.463AlaLeu: 5.463 ± 0.02
1.928AlaMet: 1.928 ± 0.503
2.571AlaAsn: 2.571 ± 0.433
3.213AlaPro: 3.213 ± 1.26
3.856AlaGln: 3.856 ± 0.894
2.571AlaArg: 2.571 ± 0.948
5.463AlaSer: 5.463 ± 3.583
3.213AlaThr: 3.213 ± 1.26
5.141AlaVal: 5.141 ± 0.163
1.607AlaTrp: 1.607 ± 0.63
2.892AlaTyr: 2.892 ± 0.928
0.0AlaXaa: 0.0 ± 0.0
Cys
2.892CysAla: 2.892 ± 0.413
0.321CysCys: 0.321 ± 0.183
0.643CysAsp: 0.643 ± 0.149
1.285CysGlu: 1.285 ± 0.217
1.928CysPhe: 1.928 ± 0.582
1.285CysGly: 1.285 ± 0.731
0.321CysHis: 0.321 ± 0.183
0.0CysIle: 0.0 ± 0.0
2.571CysLys: 2.571 ± 0.433
1.285CysLeu: 1.285 ± 0.217
0.964CysMet: 0.964 ± 0.034
0.964CysAsn: 0.964 ± 0.034
0.321CysPro: 0.321 ± 0.332
0.321CysGln: 0.321 ± 0.183
1.285CysArg: 1.285 ± 0.731
3.213CysSer: 3.213 ± 1.829
0.964CysThr: 0.964 ± 0.034
1.285CysVal: 1.285 ± 0.217
0.643CysTrp: 0.643 ± 0.366
0.643CysTyr: 0.643 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
4.177AspAla: 4.177 ± 0.197
1.285AspCys: 1.285 ± 0.731
4.82AspAsp: 4.82 ± 1.199
4.82AspGlu: 4.82 ± 0.346
4.177AspPhe: 4.177 ± 1.863
4.82AspGly: 4.82 ± 0.169
0.643AspHis: 0.643 ± 0.149
3.856AspIle: 3.856 ± 0.135
1.285AspLys: 1.285 ± 0.217
2.571AspLeu: 2.571 ± 0.596
1.285AspMet: 1.285 ± 0.217
3.535AspAsn: 3.535 ± 0.048
2.571AspPro: 2.571 ± 0.433
0.964AspGln: 0.964 ± 0.034
1.607AspArg: 1.607 ± 0.4
4.177AspSer: 4.177 ± 0.711
2.249AspThr: 2.249 ± 0.264
3.213AspVal: 3.213 ± 1.775
0.964AspTrp: 0.964 ± 0.549
1.285AspTyr: 1.285 ± 0.217
0.0AspXaa: 0.0 ± 0.0
Glu
6.748GluAla: 6.748 ± 1.781
0.964GluCys: 0.964 ± 0.549
2.571GluAsp: 2.571 ± 0.596
5.784GluGlu: 5.784 ± 0.718
5.141GluPhe: 5.141 ± 0.163
1.928GluGly: 1.928 ± 0.582
0.643GluHis: 0.643 ± 0.366
3.856GluIle: 3.856 ± 0.894
2.571GluLys: 2.571 ± 1.463
5.141GluLeu: 5.141 ± 1.382
0.964GluMet: 0.964 ± 0.034
4.177GluAsn: 4.177 ± 1.348
1.607GluPro: 1.607 ± 0.914
1.285GluGln: 1.285 ± 0.731
3.213GluArg: 3.213 ± 1.314
2.892GluSer: 2.892 ± 0.616
0.964GluThr: 0.964 ± 0.481
3.856GluVal: 3.856 ± 1.165
1.285GluTrp: 1.285 ± 0.217
4.177GluTyr: 4.177 ± 0.318
0.0GluXaa: 0.0 ± 0.0
Phe
4.82PheAla: 4.82 ± 2.405
2.249PheCys: 2.249 ± 0.264
2.571PheAsp: 2.571 ± 0.948
2.249PheGlu: 2.249 ± 0.251
1.607PhePhe: 1.607 ± 0.914
4.499PheGly: 4.499 ± 0.501
0.321PheHis: 0.321 ± 0.332
2.571PheIle: 2.571 ± 0.081
2.571PheLys: 2.571 ± 0.948
4.177PheLeu: 4.177 ± 1.863
1.607PheMet: 1.607 ± 0.4
4.177PheAsn: 4.177 ± 0.318
1.928PhePro: 1.928 ± 0.962
1.928PheGln: 1.928 ± 0.068
1.607PheArg: 1.607 ± 0.115
4.177PheSer: 4.177 ± 3.285
4.499PheThr: 4.499 ± 1.043
4.177PheVal: 4.177 ± 0.197
0.964PheTrp: 0.964 ± 0.481
1.607PheTyr: 1.607 ± 0.115
0.0PheXaa: 0.0 ± 0.0
Gly
3.856GlyAla: 3.856 ± 2.439
1.285GlyCys: 1.285 ± 0.731
5.784GlyAsp: 5.784 ± 1.233
4.499GlyGlu: 4.499 ± 1.531
3.535GlyPhe: 3.535 ± 0.048
1.285GlyGly: 1.285 ± 0.298
1.285GlyHis: 1.285 ± 0.731
2.571GlyIle: 2.571 ± 0.596
5.784GlyLys: 5.784 ± 1.747
2.249GlyLeu: 2.249 ± 0.251
0.964GlyMet: 0.964 ± 0.549
4.499GlyAsn: 4.499 ± 1.043
1.607GlyPro: 1.607 ± 0.115
1.928GlyGln: 1.928 ± 0.447
1.285GlyArg: 1.285 ± 0.217
3.535GlySer: 3.535 ± 2.621
3.535GlyThr: 3.535 ± 0.562
4.177GlyVal: 4.177 ± 1.741
0.321GlyTrp: 0.321 ± 0.332
3.535GlyTyr: 3.535 ± 0.048
0.0GlyXaa: 0.0 ± 0.0
His
0.964HisAla: 0.964 ± 0.034
0.321HisCys: 0.321 ± 0.183
0.964HisAsp: 0.964 ± 0.549
0.964HisGlu: 0.964 ± 0.549
1.285HisPhe: 1.285 ± 0.217
1.928HisGly: 1.928 ± 0.068
0.643HisHis: 0.643 ± 0.366
0.321HisIle: 0.321 ± 0.332
1.285HisLys: 1.285 ± 0.731
2.892HisLeu: 2.892 ± 1.646
0.964HisMet: 0.964 ± 0.034
1.285HisAsn: 1.285 ± 0.731
0.964HisPro: 0.964 ± 0.034
1.607HisGln: 1.607 ± 0.4
0.964HisArg: 0.964 ± 0.549
0.0HisSer: 0.0 ± 0.0
1.607HisThr: 1.607 ± 0.4
2.571HisVal: 2.571 ± 0.948
0.321HisTrp: 0.321 ± 0.183
0.964HisTyr: 0.964 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
2.249IleAla: 2.249 ± 1.28
1.607IleCys: 1.607 ± 0.63
5.141IleAsp: 5.141 ± 0.352
5.463IleGlu: 5.463 ± 1.05
3.535IlePhe: 3.535 ± 0.562
3.856IleGly: 3.856 ± 0.135
0.964IleHis: 0.964 ± 0.549
2.892IleIle: 2.892 ± 1.131
4.499IleLys: 4.499 ± 2.045
7.069IleLeu: 7.069 ± 0.934
0.964IleMet: 0.964 ± 0.034
4.82IleAsn: 4.82 ± 0.346
3.213IlePro: 3.213 ± 1.775
1.285IleGln: 1.285 ± 0.217
2.892IleArg: 2.892 ± 0.616
9.961IleSer: 9.961 ± 2.568
2.571IleThr: 2.571 ± 0.081
3.856IleVal: 3.856 ± 0.379
1.285IleTrp: 1.285 ± 0.298
3.213IleTyr: 3.213 ± 0.23
0.0IleXaa: 0.0 ± 0.0
Lys
3.213LysAla: 3.213 ± 0.284
1.607LysCys: 1.607 ± 0.914
2.571LysAsp: 2.571 ± 1.463
2.571LysGlu: 2.571 ± 1.463
2.571LysPhe: 2.571 ± 0.948
1.607LysGly: 1.607 ± 0.4
1.285LysHis: 1.285 ± 0.731
7.712LysIle: 7.712 ± 1.3
2.571LysLys: 2.571 ± 0.948
3.213LysLeu: 3.213 ± 0.799
2.571LysMet: 2.571 ± 1.463
1.607LysAsn: 1.607 ± 0.4
3.856LysPro: 3.856 ± 0.379
1.928LysGln: 1.928 ± 0.447
1.928LysArg: 1.928 ± 0.582
4.177LysSer: 4.177 ± 1.348
3.856LysThr: 3.856 ± 0.379
4.499LysVal: 4.499 ± 0.501
1.285LysTrp: 1.285 ± 0.731
0.643LysTyr: 0.643 ± 0.149
0.0LysXaa: 0.0 ± 0.0
Leu
5.463LeuAla: 5.463 ± 1.524
2.249LeuCys: 2.249 ± 0.765
3.213LeuAsp: 3.213 ± 0.23
4.499LeuGlu: 4.499 ± 2.56
2.571LeuPhe: 2.571 ± 0.596
3.213LeuGly: 3.213 ± 1.314
1.285LeuHis: 1.285 ± 0.731
5.141LeuIle: 5.141 ± 1.896
4.82LeuLys: 4.82 ± 2.228
3.213LeuLeu: 3.213 ± 0.23
0.321LeuMet: 0.321 ± 0.183
4.499LeuAsn: 4.499 ± 1.531
2.249LeuPro: 2.249 ± 0.264
1.928LeuGln: 1.928 ± 0.447
6.105LeuArg: 6.105 ± 0.129
8.997LeuSer: 8.997 ± 0.487
7.391LeuThr: 7.391 ± 1.971
3.213LeuVal: 3.213 ± 0.23
0.643LeuTrp: 0.643 ± 0.149
1.607LeuTyr: 1.607 ± 0.914
0.0LeuXaa: 0.0 ± 0.0
Met
1.285MetAla: 1.285 ± 0.298
0.643MetCys: 0.643 ± 0.366
1.285MetAsp: 1.285 ± 0.217
1.928MetGlu: 1.928 ± 1.097
0.964MetPhe: 0.964 ± 0.034
1.285MetGly: 1.285 ± 0.813
0.643MetHis: 0.643 ± 0.366
3.856MetIle: 3.856 ± 1.165
1.285MetLys: 1.285 ± 0.731
2.249MetLeu: 2.249 ± 0.765
0.321MetMet: 0.321 ± 0.183
0.964MetAsn: 0.964 ± 0.549
0.643MetPro: 0.643 ± 0.664
0.0MetGln: 0.0 ± 0.0
1.285MetArg: 1.285 ± 0.217
2.249MetSer: 2.249 ± 0.264
1.285MetThr: 1.285 ± 0.217
1.607MetVal: 1.607 ± 0.115
0.0MetTrp: 0.0 ± 0.0
0.643MetTyr: 0.643 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
3.856AsnAla: 3.856 ± 1.924
2.571AsnCys: 2.571 ± 0.948
1.928AsnAsp: 1.928 ± 0.582
3.213AsnGlu: 3.213 ± 1.829
2.571AsnPhe: 2.571 ± 0.081
2.571AsnGly: 2.571 ± 0.081
2.249AsnHis: 2.249 ± 1.28
3.856AsnIle: 3.856 ± 0.379
0.964AsnLys: 0.964 ± 0.481
5.463AsnLeu: 5.463 ± 0.535
0.321AsnMet: 0.321 ± 0.183
4.499AsnAsn: 4.499 ± 0.014
3.213AsnPro: 3.213 ± 0.284
3.213AsnGln: 3.213 ± 0.799
2.892AsnArg: 2.892 ± 0.101
7.069AsnSer: 7.069 ± 1.125
3.535AsnThr: 3.535 ± 1.592
5.463AsnVal: 5.463 ± 0.495
0.643AsnTrp: 0.643 ± 0.366
1.928AsnTyr: 1.928 ± 0.068
0.0AsnXaa: 0.0 ± 0.0
Pro
2.892ProAla: 2.892 ± 1.958
0.964ProCys: 0.964 ± 0.996
1.607ProAsp: 1.607 ± 1.145
3.213ProGlu: 3.213 ± 0.284
1.928ProPhe: 1.928 ± 0.447
1.607ProGly: 1.607 ± 1.145
0.643ProHis: 0.643 ± 0.366
3.856ProIle: 3.856 ± 0.135
2.571ProLys: 2.571 ± 0.433
3.856ProLeu: 3.856 ± 0.65
1.928ProMet: 1.928 ± 0.068
2.571ProAsn: 2.571 ± 0.433
2.249ProPro: 2.249 ± 1.294
1.285ProGln: 1.285 ± 0.731
0.321ProArg: 0.321 ± 0.183
2.892ProSer: 2.892 ± 0.101
4.499ProThr: 4.499 ± 1.558
3.535ProVal: 3.535 ± 1.592
0.643ProTrp: 0.643 ± 0.149
1.928ProTyr: 1.928 ± 0.447
0.0ProXaa: 0.0 ± 0.0
Gln
2.892GlnAla: 2.892 ± 0.928
0.643GlnCys: 0.643 ± 0.149
0.964GlnAsp: 0.964 ± 0.034
1.285GlnGlu: 1.285 ± 0.731
1.285GlnPhe: 1.285 ± 0.298
1.928GlnGly: 1.928 ± 0.962
0.643GlnHis: 0.643 ± 0.366
4.177GlnIle: 4.177 ± 0.318
0.321GlnLys: 0.321 ± 0.183
2.571GlnLeu: 2.571 ± 0.081
0.321GlnMet: 0.321 ± 0.183
1.607GlnAsn: 1.607 ± 0.115
1.607GlnPro: 1.607 ± 0.4
0.321GlnGln: 0.321 ± 0.332
0.964GlnArg: 0.964 ± 0.549
1.928GlnSer: 1.928 ± 0.068
0.964GlnThr: 0.964 ± 0.034
1.928GlnVal: 1.928 ± 0.582
0.321GlnTrp: 0.321 ± 0.183
0.321GlnTyr: 0.321 ± 0.183
0.0GlnXaa: 0.0 ± 0.0
Arg
2.249ArgAla: 2.249 ± 0.765
0.643ArgCys: 0.643 ± 0.366
2.892ArgAsp: 2.892 ± 0.101
2.249ArgGlu: 2.249 ± 0.765
2.892ArgPhe: 2.892 ± 0.101
1.607ArgGly: 1.607 ± 0.115
1.607ArgHis: 1.607 ± 0.4
5.463ArgIle: 5.463 ± 0.02
2.892ArgLys: 2.892 ± 1.131
0.964ArgLeu: 0.964 ± 0.034
0.964ArgMet: 0.964 ± 0.034
1.928ArgAsn: 1.928 ± 0.582
1.928ArgPro: 1.928 ± 0.582
1.285ArgGln: 1.285 ± 0.731
3.856ArgArg: 3.856 ± 0.135
2.249ArgSer: 2.249 ± 0.765
1.607ArgThr: 1.607 ± 0.115
3.535ArgVal: 3.535 ± 0.467
0.321ArgTrp: 0.321 ± 0.183
1.607ArgTyr: 1.607 ± 0.115
0.0ArgXaa: 0.0 ± 0.0
Ser
6.105SerAla: 6.105 ± 1.159
0.643SerCys: 0.643 ± 0.664
5.463SerAsp: 5.463 ± 2.039
3.213SerGlu: 3.213 ± 0.745
4.82SerPhe: 4.82 ± 0.86
8.997SerGly: 8.997 ± 0.027
1.285SerHis: 1.285 ± 0.298
6.105SerIle: 6.105 ± 2.188
5.784SerLys: 5.784 ± 0.312
5.141SerLeu: 5.141 ± 1.192
2.249SerMet: 2.249 ± 0.264
5.784SerAsn: 5.784 ± 0.312
3.856SerPro: 3.856 ± 1.409
1.285SerGln: 1.285 ± 0.217
2.892SerArg: 2.892 ± 0.413
9.319SerSer: 9.319 ± 2.419
3.535SerThr: 3.535 ± 1.077
6.427SerVal: 6.427 ± 2.52
1.928SerTrp: 1.928 ± 0.447
2.249SerTyr: 2.249 ± 0.765
0.0SerXaa: 0.0 ± 0.0
Thr
3.856ThrAla: 3.856 ± 1.409
0.643ThrCys: 0.643 ± 0.366
3.213ThrAsp: 3.213 ± 1.775
2.571ThrGlu: 2.571 ± 0.433
2.249ThrPhe: 2.249 ± 0.779
4.499ThrGly: 4.499 ± 3.102
3.535ThrHis: 3.535 ± 0.467
2.571ThrIle: 2.571 ± 0.948
3.856ThrLys: 3.856 ± 0.135
2.892ThrLeu: 2.892 ± 0.101
2.249ThrMet: 2.249 ± 0.141
3.535ThrAsn: 3.535 ± 2.107
3.535ThrPro: 3.535 ± 1.592
1.928ThrGln: 1.928 ± 0.962
1.928ThrArg: 1.928 ± 0.582
5.463ThrSer: 5.463 ± 3.069
3.213ThrThr: 3.213 ± 1.26
1.928ThrVal: 1.928 ± 0.447
0.643ThrTrp: 0.643 ± 0.149
1.928ThrTyr: 1.928 ± 0.068
0.0ThrXaa: 0.0 ± 0.0
Val
5.141ValAla: 5.141 ± 0.163
1.607ValCys: 1.607 ± 0.914
2.249ValAsp: 2.249 ± 0.765
4.499ValGlu: 4.499 ± 1.043
4.82ValPhe: 4.82 ± 1.375
3.856ValGly: 3.856 ± 0.379
1.607ValHis: 1.607 ± 0.914
1.928ValIle: 1.928 ± 0.068
2.892ValLys: 2.892 ± 0.101
8.355ValLeu: 8.355 ± 0.636
1.607ValMet: 1.607 ± 0.115
5.463ValAsn: 5.463 ± 2.039
3.856ValPro: 3.856 ± 0.135
0.321ValGln: 0.321 ± 0.183
1.928ValArg: 1.928 ± 0.582
6.748ValSer: 6.748 ± 2.337
3.213ValThr: 3.213 ± 1.775
3.856ValVal: 3.856 ± 0.65
0.643ValTrp: 0.643 ± 0.149
0.964ValTyr: 0.964 ± 0.481
0.0ValXaa: 0.0 ± 0.0
Trp
0.643TrpAla: 0.643 ± 0.366
0.0TrpCys: 0.0 ± 0.0
0.643TrpAsp: 0.643 ± 0.366
0.0TrpGlu: 0.0 ± 0.0
1.285TrpPhe: 1.285 ± 0.813
0.321TrpGly: 0.321 ± 0.332
0.643TrpHis: 0.643 ± 0.149
2.892TrpIle: 2.892 ± 0.413
1.285TrpLys: 1.285 ± 0.731
0.321TrpLeu: 0.321 ± 0.332
0.0TrpMet: 0.0 ± 0.0
1.607TrpAsn: 1.607 ± 0.4
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.607TrpArg: 1.607 ± 0.115
0.643TrpSer: 0.643 ± 0.149
1.285TrpThr: 1.285 ± 0.217
0.321TrpVal: 0.321 ± 0.183
0.321TrpTrp: 0.321 ± 0.183
1.285TrpTyr: 1.285 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.928TyrAla: 1.928 ± 1.097
1.607TyrCys: 1.607 ± 0.115
2.571TyrAsp: 2.571 ± 0.948
1.928TyrGlu: 1.928 ± 0.582
2.249TyrPhe: 2.249 ± 0.264
1.285TyrGly: 1.285 ± 0.731
0.964TyrHis: 0.964 ± 0.034
1.285TyrIle: 1.285 ± 0.813
2.249TyrLys: 2.249 ± 0.264
3.535TyrLeu: 3.535 ± 0.562
1.607TyrMet: 1.607 ± 0.115
1.928TyrAsn: 1.928 ± 0.068
2.571TyrPro: 2.571 ± 0.081
0.321TyrGln: 0.321 ± 0.183
1.607TyrArg: 1.607 ± 0.115
1.928TyrSer: 1.928 ± 0.447
2.571TyrThr: 2.571 ± 1.111
0.964TyrVal: 0.964 ± 0.034
0.321TyrTrp: 0.321 ± 0.183
0.964TyrTyr: 0.964 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3113 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski