Amino acid dipepetide frequency for Wuchang Cockroach Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.199AlaAla: 1.199 ± 0.325
2.397AlaCys: 2.397 ± 0.434
2.697AlaAsp: 2.697 ± 0.664
1.199AlaGlu: 1.199 ± 0.638
1.199AlaPhe: 1.199 ± 0.486
2.098AlaGly: 2.098 ± 1.995
1.199AlaHis: 1.199 ± 0.486
4.795AlaIle: 4.795 ± 1.536
6.593AlaLys: 6.593 ± 0.93
1.798AlaLeu: 1.798 ± 0.213
0.899AlaMet: 0.899 ± 0.263
2.098AlaAsn: 2.098 ± 0.778
1.199AlaPro: 1.199 ± 0.672
1.798AlaGln: 1.798 ± 0.664
3.296AlaArg: 3.296 ± 0.449
4.495AlaSer: 4.495 ± 3.248
3.296AlaThr: 3.296 ± 0.868
1.798AlaVal: 1.798 ± 0.355
0.3AlaTrp: 0.3 ± 0.362
0.3AlaTyr: 0.3 ± 0.557
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.997CysAsp: 2.997 ± 1.264
1.199CysGlu: 1.199 ± 0.338
0.3CysPhe: 0.3 ± 0.362
0.599CysGly: 0.599 ± 0.725
0.3CysHis: 0.3 ± 0.362
0.899CysIle: 0.899 ± 0.263
0.899CysLys: 0.899 ± 0.831
1.798CysLeu: 1.798 ± 0.526
1.498CysMet: 1.498 ± 0.459
1.498CysAsn: 1.498 ± 0.513
0.899CysPro: 0.899 ± 0.458
0.599CysGln: 0.599 ± 0.484
0.3CysArg: 0.3 ± 0.557
0.899CysSer: 0.899 ± 0.263
2.997CysThr: 2.997 ± 1.368
1.199CysVal: 1.199 ± 0.547
0.0CysTrp: 0.0 ± 0.0
1.498CysTyr: 1.498 ± 0.459
0.0CysXaa: 0.0 ± 0.0
Asp
2.997AspAla: 2.997 ± 2.421
1.199AspCys: 1.199 ± 0.338
3.296AspAsp: 3.296 ± 0.765
3.296AspGlu: 3.296 ± 1.0
2.397AspPhe: 2.397 ± 1.309
3.296AspGly: 3.296 ± 1.104
0.599AspHis: 0.599 ± 0.319
5.094AspIle: 5.094 ± 0.532
3.596AspLys: 3.596 ± 0.608
5.694AspLeu: 5.694 ± 2.256
0.899AspMet: 0.899 ± 0.263
4.795AspAsn: 4.795 ± 1.334
0.899AspPro: 0.899 ± 0.478
1.498AspGln: 1.498 ± 0.223
1.798AspArg: 1.798 ± 0.598
4.495AspSer: 4.495 ± 1.255
3.296AspThr: 3.296 ± 0.919
4.795AspVal: 4.795 ± 1.776
0.3AspTrp: 0.3 ± 0.159
3.296AspTyr: 3.296 ± 0.919
0.0AspXaa: 0.0 ± 0.0
Glu
1.498GluAla: 1.498 ± 0.459
0.3GluCys: 0.3 ± 0.362
3.296GluAsp: 3.296 ± 0.886
4.495GluGlu: 4.495 ± 2.392
2.397GluPhe: 2.397 ± 1.383
0.899GluGly: 0.899 ± 0.458
0.899GluHis: 0.899 ± 0.478
5.094GluIle: 5.094 ± 1.153
3.596GluLys: 3.596 ± 0.362
8.69GluLeu: 8.69 ± 1.121
1.798GluMet: 1.798 ± 0.213
4.195GluAsn: 4.195 ± 0.926
1.199GluPro: 1.199 ± 0.338
3.596GluGln: 3.596 ± 1.015
2.697GluArg: 2.697 ± 0.423
5.094GluSer: 5.094 ± 1.494
3.896GluThr: 3.896 ± 1.078
1.798GluVal: 1.798 ± 0.957
0.599GluTrp: 0.599 ± 0.319
4.495GluTyr: 4.495 ± 1.219
0.0GluXaa: 0.0 ± 0.0
Phe
1.498PheAla: 1.498 ± 1.512
0.599PheCys: 0.599 ± 0.319
2.697PheAsp: 2.697 ± 1.191
2.098PheGlu: 2.098 ± 0.781
0.3PhePhe: 0.3 ± 0.159
1.498PheGly: 1.498 ± 0.89
0.3PheHis: 0.3 ± 0.362
3.596PheIle: 3.596 ± 0.362
2.098PheLys: 2.098 ± 0.707
3.596PheLeu: 3.596 ± 0.608
1.798PheMet: 1.798 ± 0.213
2.397PheAsn: 2.397 ± 0.544
0.899PhePro: 0.899 ± 0.831
1.498PheGln: 1.498 ± 0.513
2.098PheArg: 2.098 ± 0.199
2.397PheSer: 2.397 ± 0.434
4.495PheThr: 4.495 ± 1.224
1.798PheVal: 1.798 ± 0.213
0.599PheTrp: 0.599 ± 0.274
0.599PheTyr: 0.599 ± 0.274
0.0PheXaa: 0.0 ± 0.0
Gly
2.397GlyAla: 2.397 ± 1.383
1.798GlyCys: 1.798 ± 0.526
2.397GlyAsp: 2.397 ± 0.062
2.697GlyGlu: 2.697 ± 1.048
0.899GlyPhe: 0.899 ± 0.831
2.098GlyGly: 2.098 ± 1.409
2.098GlyHis: 2.098 ± 2.638
4.195GlyIle: 4.195 ± 2.188
4.195GlyLys: 4.195 ± 0.595
4.795GlyLeu: 4.795 ± 1.362
1.498GlyMet: 1.498 ± 0.223
2.397GlyAsn: 2.397 ± 0.544
0.599GlyPro: 0.599 ± 0.484
1.498GlyGln: 1.498 ± 2.192
0.3GlyArg: 0.3 ± 0.159
1.498GlySer: 1.498 ± 0.513
1.199GlyThr: 1.199 ± 0.547
1.798GlyVal: 1.798 ± 0.827
0.599GlyTrp: 0.599 ± 0.319
2.997GlyTyr: 2.997 ± 0.918
0.0GlyXaa: 0.0 ± 0.0
His
0.899HisAla: 0.899 ± 0.478
0.599HisCys: 0.599 ± 0.274
0.899HisAsp: 0.899 ± 0.478
1.498HisGlu: 1.498 ± 0.223
0.0HisPhe: 0.0 ± 0.0
1.798HisGly: 1.798 ± 0.778
0.599HisHis: 0.599 ± 0.608
0.599HisIle: 0.599 ± 0.319
0.899HisLys: 0.899 ± 0.478
2.997HisLeu: 2.997 ± 0.918
1.199HisMet: 1.199 ± 0.156
0.3HisAsn: 0.3 ± 0.159
1.199HisPro: 1.199 ± 0.338
1.498HisGln: 1.498 ± 0.459
0.599HisArg: 0.599 ± 0.319
2.997HisSer: 2.997 ± 1.146
0.899HisThr: 0.899 ± 0.263
1.498HisVal: 1.498 ± 1.698
0.599HisTrp: 0.599 ± 0.725
2.697HisTyr: 2.697 ± 0.79
0.0HisXaa: 0.0 ± 0.0
Ile
4.495IleAla: 4.495 ± 0.837
0.899IleCys: 0.899 ± 0.622
3.296IleAsp: 3.296 ± 0.31
5.094IleGlu: 5.094 ± 1.463
2.397IlePhe: 2.397 ± 1.526
2.997IleGly: 2.997 ± 0.785
2.397IleHis: 2.397 ± 0.768
4.495IleIle: 4.495 ± 1.959
4.795IleLys: 4.795 ± 1.865
6.293IleLeu: 6.293 ± 1.158
3.596IleMet: 3.596 ± 1.16
8.99IleAsn: 8.99 ± 0.672
2.098IlePro: 2.098 ± 0.682
2.997IleGln: 2.997 ± 0.918
2.098IleArg: 2.098 ± 0.745
4.495IleSer: 4.495 ± 1.085
4.495IleThr: 4.495 ± 0.735
5.694IleVal: 5.694 ± 1.787
0.599IleTrp: 0.599 ± 0.725
3.296IleTyr: 3.296 ± 0.684
0.0IleXaa: 0.0 ± 0.0
Lys
3.296LysAla: 3.296 ± 2.176
1.798LysCys: 1.798 ± 1.244
4.495LysAsp: 4.495 ± 0.873
4.495LysGlu: 4.495 ± 1.219
3.296LysPhe: 3.296 ± 1.027
2.098LysGly: 2.098 ± 0.778
3.296LysHis: 3.296 ± 1.054
6.293LysIle: 6.293 ± 0.626
6.892LysLys: 6.892 ± 1.925
7.492LysLeu: 7.492 ± 0.171
1.498LysMet: 1.498 ± 0.56
3.596LysAsn: 3.596 ± 0.323
3.596LysPro: 3.596 ± 1.628
2.098LysGln: 2.098 ± 1.409
2.997LysArg: 2.997 ± 0.699
3.596LysSer: 3.596 ± 1.015
6.892LysThr: 6.892 ± 0.791
5.993LysVal: 5.993 ± 1.098
0.3LysTrp: 0.3 ± 0.159
4.795LysTyr: 4.795 ± 0.883
0.0LysXaa: 0.0 ± 0.0
Leu
6.892LeuAla: 6.892 ± 0.791
1.199LeuCys: 1.199 ± 0.338
7.192LeuAsp: 7.192 ± 1.512
4.495LeuGlu: 4.495 ± 0.731
2.098LeuPhe: 2.098 ± 1.16
5.094LeuGly: 5.094 ± 1.012
2.697LeuHis: 2.697 ± 0.58
6.593LeuIle: 6.593 ± 1.615
6.593LeuLys: 6.593 ± 1.202
9.29LeuLeu: 9.29 ± 2.085
2.697LeuMet: 2.697 ± 0.138
5.394LeuAsn: 5.394 ± 1.159
4.195LeuPro: 4.195 ± 0.595
3.896LeuGln: 3.896 ± 0.966
3.596LeuArg: 3.596 ± 1.196
7.192LeuSer: 7.192 ± 0.815
6.892LeuThr: 6.892 ± 0.79
4.795LeuVal: 4.795 ± 1.244
0.899LeuTrp: 0.899 ± 0.263
4.195LeuTyr: 4.195 ± 1.491
0.0LeuXaa: 0.0 ± 0.0
Met
0.899MetAla: 0.899 ± 0.622
0.3MetCys: 0.3 ± 0.362
1.798MetAsp: 1.798 ± 0.664
2.098MetGlu: 2.098 ± 1.409
2.397MetPhe: 2.397 ± 0.677
0.899MetGly: 0.899 ± 0.622
1.498MetHis: 1.498 ± 0.56
2.098MetIle: 2.098 ± 1.116
0.899MetLys: 0.899 ± 0.458
4.495MetLeu: 4.495 ± 0.731
0.0MetMet: 0.0 ± 0.0
0.899MetAsn: 0.899 ± 0.263
1.798MetPro: 1.798 ± 0.213
0.899MetGln: 0.899 ± 0.478
1.798MetArg: 1.798 ± 0.598
1.798MetSer: 1.798 ± 0.916
1.498MetThr: 1.498 ± 0.223
1.199MetVal: 1.199 ± 0.98
0.0MetTrp: 0.0 ± 0.0
2.098MetTyr: 2.098 ± 0.585
0.0MetXaa: 0.0 ± 0.0
Asn
1.498AsnAla: 1.498 ± 0.459
2.098AsnCys: 2.098 ± 0.778
3.596AsnAsp: 3.596 ± 1.329
5.394AsnGlu: 5.394 ± 2.101
2.997AsnPhe: 2.997 ± 1.368
3.596AsnGly: 3.596 ± 2.017
0.599AsnHis: 0.599 ± 0.274
5.394AsnIle: 5.394 ± 1.187
5.993AsnLys: 5.993 ± 0.67
6.293AsnLeu: 6.293 ± 1.158
1.498AsnMet: 1.498 ± 0.513
5.094AsnAsn: 5.094 ± 1.034
3.296AsnPro: 3.296 ± 0.31
1.199AsnGln: 1.199 ± 0.638
4.495AsnArg: 4.495 ± 1.365
4.795AsnSer: 4.795 ± 1.334
2.098AsnThr: 2.098 ± 0.931
4.495AsnVal: 4.495 ± 1.255
1.498AsnTrp: 1.498 ± 0.459
2.697AsnTyr: 2.697 ± 1.048
0.0AsnXaa: 0.0 ± 0.0
Pro
1.498ProAla: 1.498 ± 0.513
0.899ProCys: 0.899 ± 0.46
1.498ProAsp: 1.498 ± 0.56
2.997ProGlu: 2.997 ± 0.447
2.397ProPhe: 2.397 ± 0.062
1.199ProGly: 1.199 ± 0.547
0.3ProHis: 0.3 ± 0.362
2.997ProIle: 2.997 ± 0.918
2.997ProLys: 2.997 ± 1.09
2.098ProLeu: 2.098 ± 0.707
1.498ProMet: 1.498 ± 0.513
3.596ProAsn: 3.596 ± 1.053
1.498ProPro: 1.498 ± 0.513
1.199ProGln: 1.199 ± 0.984
1.199ProArg: 1.199 ± 0.638
3.896ProSer: 3.896 ± 1.078
2.397ProThr: 2.397 ± 0.922
2.098ProVal: 2.098 ± 0.199
0.0ProTrp: 0.0 ± 0.0
2.098ProTyr: 2.098 ± 0.199
0.0ProXaa: 0.0 ± 0.0
Gln
3.896GlnAla: 3.896 ± 0.498
0.599GlnCys: 0.599 ± 0.319
2.098GlnAsp: 2.098 ± 0.931
1.199GlnGlu: 1.199 ± 0.638
1.498GlnPhe: 1.498 ± 0.929
0.899GlnGly: 0.899 ± 0.478
0.899GlnHis: 0.899 ± 0.478
2.098GlnIle: 2.098 ± 0.199
2.098GlnLys: 2.098 ± 0.931
4.795GlnLeu: 4.795 ± 1.672
0.899GlnMet: 0.899 ± 0.478
1.199GlnAsn: 1.199 ± 0.638
1.199GlnPro: 1.199 ± 1.129
1.199GlnGln: 1.199 ± 0.338
1.498GlnArg: 1.498 ± 0.513
2.098GlnSer: 2.098 ± 1.334
3.896GlnThr: 3.896 ± 1.025
2.397GlnVal: 2.397 ± 0.544
0.0GlnTrp: 0.0 ± 0.0
1.498GlnTyr: 1.498 ± 1.512
0.0GlnXaa: 0.0 ± 0.0
Arg
1.798ArgAla: 1.798 ± 0.213
0.599ArgCys: 0.599 ± 1.114
1.798ArgAsp: 1.798 ± 0.664
2.697ArgGlu: 2.697 ± 0.138
2.098ArgPhe: 2.098 ± 0.199
2.997ArgGly: 2.997 ± 0.623
1.498ArgHis: 1.498 ± 0.797
2.397ArgIle: 2.397 ± 0.434
2.697ArgLys: 2.697 ± 1.051
5.394ArgLeu: 5.394 ± 1.989
1.498ArgMet: 1.498 ± 0.223
2.397ArgAsn: 2.397 ± 1.276
2.098ArgPro: 2.098 ± 0.778
1.199ArgGln: 1.199 ± 0.486
1.498ArgArg: 1.498 ± 0.513
1.798ArgSer: 1.798 ± 0.778
3.596ArgThr: 3.596 ± 0.425
2.098ArgVal: 2.098 ± 0.302
0.0ArgTrp: 0.0 ± 0.0
1.199ArgTyr: 1.199 ± 0.338
0.0ArgXaa: 0.0 ± 0.0
Ser
2.697SerAla: 2.697 ± 1.808
2.098SerCys: 2.098 ± 0.682
3.296SerAsp: 3.296 ± 0.31
5.394SerGlu: 5.394 ± 0.416
1.498SerPhe: 1.498 ± 0.513
3.296SerGly: 3.296 ± 2.375
1.798SerHis: 1.798 ± 0.921
6.293SerIle: 6.293 ± 1.075
5.993SerLys: 5.993 ± 1.0
5.993SerLeu: 5.993 ± 1.57
2.098SerMet: 2.098 ± 0.745
5.094SerAsn: 5.094 ± 1.034
2.098SerPro: 2.098 ± 0.302
3.596SerGln: 3.596 ± 0.425
5.094SerArg: 5.094 ± 1.625
4.195SerSer: 4.195 ± 0.928
3.896SerThr: 3.896 ± 0.553
2.997SerVal: 2.997 ± 0.447
0.599SerTrp: 0.599 ± 0.319
1.798SerTyr: 1.798 ± 1.244
0.0SerXaa: 0.0 ± 0.0
Thr
1.498ThrAla: 1.498 ± 0.513
0.899ThrCys: 0.899 ± 0.263
4.495ThrAsp: 4.495 ± 0.248
2.997ThrGlu: 2.997 ± 0.292
3.296ThrPhe: 3.296 ± 1.756
1.498ThrGly: 1.498 ± 0.89
1.199ThrHis: 1.199 ± 0.547
6.593ThrIle: 6.593 ± 2.383
6.293ThrLys: 6.293 ± 0.626
5.094ThrLeu: 5.094 ± 1.494
1.798ThrMet: 1.798 ± 0.778
4.795ThrAsn: 4.795 ± 1.155
3.296ThrPro: 3.296 ± 1.646
2.697ThrGln: 2.697 ± 0.534
2.697ThrArg: 2.697 ± 1.036
3.596ThrSer: 3.596 ± 1.176
5.094ThrThr: 5.094 ± 2.383
5.094ThrVal: 5.094 ± 1.153
0.599ThrTrp: 0.599 ± 0.484
3.896ThrTyr: 3.896 ± 1.279
0.0ThrXaa: 0.0 ± 0.0
Val
2.697ValAla: 2.697 ± 1.375
0.3ValCys: 0.3 ± 0.362
2.697ValAsp: 2.697 ± 0.534
3.896ValGlu: 3.896 ± 0.498
2.098ValPhe: 2.098 ± 1.409
2.697ValGly: 2.697 ± 1.048
1.498ValHis: 1.498 ± 0.459
2.098ValIle: 2.098 ± 0.778
5.993ValLys: 5.993 ± 0.333
4.495ValLeu: 4.495 ± 0.248
1.199ValMet: 1.199 ± 0.338
3.596ValAsn: 3.596 ± 1.121
3.596ValPro: 3.596 ± 0.425
1.498ValGln: 1.498 ± 0.89
2.098ValArg: 2.098 ± 0.199
6.593ValSer: 6.593 ± 0.621
3.296ValThr: 3.296 ± 1.147
2.997ValVal: 2.997 ± 0.731
0.599ValTrp: 0.599 ± 0.319
2.397ValTyr: 2.397 ± 0.768
0.0ValXaa: 0.0 ± 0.0
Trp
0.599TrpAla: 0.599 ± 0.319
0.0TrpCys: 0.0 ± 0.0
0.599TrpAsp: 0.599 ± 0.319
0.599TrpGlu: 0.599 ± 0.725
0.3TrpPhe: 0.3 ± 0.362
0.3TrpGly: 0.3 ± 0.159
0.0TrpHis: 0.0 ± 0.0
1.199TrpIle: 1.199 ± 0.325
0.599TrpLys: 0.599 ± 0.274
0.599TrpLeu: 0.599 ± 0.274
0.3TrpMet: 0.3 ± 0.159
1.199TrpAsn: 1.199 ± 0.547
0.3TrpPro: 0.3 ± 0.159
0.0TrpGln: 0.0 ± 0.0
0.599TrpArg: 0.599 ± 0.319
0.899TrpSer: 0.899 ± 0.263
0.0TrpThr: 0.0 ± 0.0
0.599TrpVal: 0.599 ± 0.319
0.3TrpTrp: 0.3 ± 0.362
0.3TrpTyr: 0.3 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.397TyrAla: 2.397 ± 0.062
1.798TyrCys: 1.798 ± 1.244
2.397TyrAsp: 2.397 ± 0.897
2.697TyrGlu: 2.697 ± 0.79
2.997TyrPhe: 2.997 ± 1.595
2.098TyrGly: 2.098 ± 0.745
0.899TyrHis: 0.899 ± 0.263
2.697TyrIle: 2.697 ± 1.051
5.094TyrLys: 5.094 ± 1.107
4.195TyrLeu: 4.195 ± 0.153
0.899TyrMet: 0.899 ± 0.458
5.094TyrAsn: 5.094 ± 0.532
2.397TyrPro: 2.397 ± 0.677
1.498TyrGln: 1.498 ± 0.797
0.599TyrArg: 0.599 ± 0.319
2.997TyrSer: 2.997 ± 1.264
3.296TyrThr: 3.296 ± 1.625
1.199TyrVal: 1.199 ± 0.547
0.899TyrTrp: 0.899 ± 0.263
0.899TyrTyr: 0.899 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3338 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski