Amino acid dipepetide frequency for Wuchang Cockroach Virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.34AlaAla: 3.34 ± 1.963
2.429AlaCys: 2.429 ± 0.35
2.429AlaAsp: 2.429 ± 1.237
2.429AlaGlu: 2.429 ± 0.35
2.733AlaPhe: 2.733 ± 1.043
3.644AlaGly: 3.644 ± 0.761
2.429AlaHis: 2.429 ± 0.893
6.073AlaIle: 6.073 ± 0.302
2.429AlaLys: 2.429 ± 0.736
7.288AlaLeu: 7.288 ± 1.638
3.037AlaMet: 3.037 ± 0.933
4.251AlaAsn: 4.251 ± 1.009
1.822AlaPro: 1.822 ± 0.618
3.34AlaGln: 3.34 ± 0.688
3.34AlaArg: 3.34 ± 1.674
4.859AlaSer: 4.859 ± 0.901
3.948AlaThr: 3.948 ± 0.359
2.733AlaVal: 2.733 ± 1.018
0.304AlaTrp: 0.304 ± 0.166
3.948AlaTyr: 3.948 ± 0.855
0.0AlaXaa: 0.0 ± 0.0
Cys
0.911CysAla: 0.911 ± 0.413
0.607CysCys: 0.607 ± 0.332
1.518CysAsp: 1.518 ± 0.431
0.0CysGlu: 0.0 ± 0.0
0.304CysPhe: 0.304 ± 0.166
0.304CysGly: 0.304 ± 0.404
0.304CysHis: 0.304 ± 0.166
1.518CysIle: 1.518 ± 0.431
1.215CysLys: 1.215 ± 0.663
1.822CysLeu: 1.822 ± 0.995
0.0CysMet: 0.0 ± 0.0
0.911CysAsn: 0.911 ± 0.254
1.215CysPro: 1.215 ± 0.312
0.607CysGln: 0.607 ± 0.294
1.215CysArg: 1.215 ± 0.588
0.607CysSer: 0.607 ± 0.808
1.215CysThr: 1.215 ± 0.663
1.822CysVal: 1.822 ± 1.04
0.304CysTrp: 0.304 ± 0.166
0.911CysTyr: 0.911 ± 0.254
0.0CysXaa: 0.0 ± 0.0
Asp
2.429AspAla: 2.429 ± 0.611
1.518AspCys: 1.518 ± 0.976
2.429AspAsp: 2.429 ± 0.193
3.34AspGlu: 3.34 ± 0.825
1.822AspPhe: 1.822 ± 0.159
2.429AspGly: 2.429 ± 0.193
1.518AspHis: 1.518 ± 0.431
3.037AspIle: 3.037 ± 0.663
3.644AspLys: 3.644 ± 0.349
5.162AspLeu: 5.162 ± 1.185
0.911AspMet: 0.911 ± 0.54
1.215AspAsn: 1.215 ± 1.052
3.948AspPro: 3.948 ± 1.364
1.822AspGln: 1.822 ± 2.001
3.037AspArg: 3.037 ± 0.487
1.822AspSer: 1.822 ± 0.508
3.34AspThr: 3.34 ± 1.014
4.251AspVal: 4.251 ± 0.96
0.607AspTrp: 0.607 ± 0.458
3.34AspTyr: 3.34 ± 1.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.037GluAla: 3.037 ± 0.999
0.304GluCys: 0.304 ± 0.166
3.037GluAsp: 3.037 ± 1.197
4.555GluGlu: 4.555 ± 0.619
1.518GluPhe: 1.518 ± 0.524
3.948GluGly: 3.948 ± 0.983
1.822GluHis: 1.822 ± 0.573
4.555GluIle: 4.555 ± 0.603
2.733GluLys: 2.733 ± 0.503
5.466GluLeu: 5.466 ± 1.007
1.215GluMet: 1.215 ± 0.588
3.34GluAsn: 3.34 ± 1.707
2.126GluPro: 2.126 ± 0.726
1.518GluGln: 1.518 ± 0.829
3.644GluArg: 3.644 ± 0.504
3.948GluSer: 3.948 ± 1.043
3.037GluThr: 3.037 ± 0.471
4.555GluVal: 4.555 ± 0.997
1.822GluTrp: 1.822 ± 1.463
2.733GluTyr: 2.733 ± 0.734
0.0GluXaa: 0.0 ± 0.0
Phe
2.429PheAla: 2.429 ± 1.175
0.607PheCys: 0.607 ± 0.332
2.126PheAsp: 2.126 ± 0.828
0.911PheGlu: 0.911 ± 0.497
0.304PhePhe: 0.304 ± 0.404
1.822PheGly: 1.822 ± 0.793
0.304PheHis: 0.304 ± 0.166
2.733PheIle: 2.733 ± 1.492
1.518PheLys: 1.518 ± 1.455
4.251PheLeu: 4.251 ± 0.766
1.518PheMet: 1.518 ± 0.431
1.215PheAsn: 1.215 ± 0.663
2.126PhePro: 2.126 ± 1.345
0.911PheGln: 0.911 ± 0.254
2.733PheArg: 2.733 ± 0.734
3.34PheSer: 3.34 ± 0.343
0.911PheThr: 0.911 ± 0.254
2.733PheVal: 2.733 ± 0.096
0.0PheTrp: 0.0 ± 0.0
1.518PheTyr: 1.518 ± 1.455
0.0PheXaa: 0.0 ± 0.0
Gly
1.518GlyAla: 1.518 ± 0.431
0.304GlyCys: 0.304 ± 0.166
1.822GlyAsp: 1.822 ± 0.509
2.733GlyGlu: 2.733 ± 0.622
3.037GlyPhe: 3.037 ± 0.487
1.822GlyGly: 1.822 ± 0.508
1.215GlyHis: 1.215 ± 0.312
6.073GlyIle: 6.073 ± 0.378
2.429GlyLys: 2.429 ± 1.175
4.251GlyLeu: 4.251 ± 0.863
0.911GlyMet: 0.911 ± 0.54
3.644GlyAsn: 3.644 ± 0.318
1.215GlyPro: 1.215 ± 0.837
2.126GlyGln: 2.126 ± 0.828
1.518GlyArg: 1.518 ± 0.917
3.037GlySer: 3.037 ± 0.853
2.733GlyThr: 2.733 ± 0.622
4.555GlyVal: 4.555 ± 1.516
0.607GlyTrp: 0.607 ± 0.7
3.34GlyTyr: 3.34 ± 0.873
0.0GlyXaa: 0.0 ± 0.0
His
3.644HisAla: 3.644 ± 1.146
1.215HisCys: 1.215 ± 0.588
2.429HisAsp: 2.429 ± 0.611
1.518HisGlu: 1.518 ± 0.829
1.215HisPhe: 1.215 ± 0.431
1.215HisGly: 1.215 ± 0.588
1.822HisHis: 1.822 ± 0.995
3.948HisIle: 3.948 ± 1.152
1.215HisLys: 1.215 ± 0.663
4.859HisLeu: 4.859 ± 0.762
0.0HisMet: 0.0 ± 0.0
1.215HisAsn: 1.215 ± 0.663
2.126HisPro: 2.126 ± 0.347
1.822HisGln: 1.822 ± 0.509
1.518HisArg: 1.518 ± 0.431
0.911HisSer: 0.911 ± 0.687
2.733HisThr: 2.733 ± 1.043
1.215HisVal: 1.215 ± 0.385
0.607HisTrp: 0.607 ± 0.458
1.215HisTyr: 1.215 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
6.377IleAla: 6.377 ± 0.89
1.215IleCys: 1.215 ± 0.663
2.126IleAsp: 2.126 ± 0.75
5.162IleGlu: 5.162 ± 1.573
1.215IlePhe: 1.215 ± 0.663
3.644IleGly: 3.644 ± 0.761
3.34IleHis: 3.34 ± 0.343
3.644IleIle: 3.644 ± 1.154
3.037IleLys: 3.037 ± 1.204
6.681IleLeu: 6.681 ± 1.649
3.037IleMet: 3.037 ± 0.663
2.733IleAsn: 2.733 ± 0.596
3.948IlePro: 3.948 ± 0.393
3.948IleGln: 3.948 ± 2.465
4.555IleArg: 4.555 ± 1.098
4.251IleSer: 4.251 ± 2.082
2.733IleThr: 2.733 ± 0.572
4.859IleVal: 4.859 ± 1.351
0.607IleTrp: 0.607 ± 0.294
5.162IleTyr: 5.162 ± 1.573
0.0IleXaa: 0.0 ± 0.0
Lys
2.733LysAla: 2.733 ± 0.762
0.607LysCys: 0.607 ± 0.294
1.518LysAsp: 1.518 ± 0.243
4.251LysGlu: 4.251 ± 0.695
2.733LysPhe: 2.733 ± 0.096
1.518LysGly: 1.518 ± 0.431
1.215LysHis: 1.215 ± 0.312
3.037LysIle: 3.037 ± 0.189
1.822LysLys: 1.822 ± 0.159
6.681LysLeu: 6.681 ± 1.265
1.215LysMet: 1.215 ± 0.663
1.822LysAsn: 1.822 ± 0.995
1.215LysPro: 1.215 ± 0.663
0.911LysGln: 0.911 ± 0.497
1.822LysArg: 1.822 ± 0.618
3.037LysSer: 3.037 ± 1.344
4.251LysThr: 4.251 ± 1.089
3.34LysVal: 3.34 ± 0.851
0.911LysTrp: 0.911 ± 0.497
2.733LysTyr: 2.733 ± 0.734
0.0LysXaa: 0.0 ± 0.0
Leu
5.77LeuAla: 5.77 ± 1.658
0.304LeuCys: 0.304 ± 0.166
6.985LeuAsp: 6.985 ± 1.689
6.073LeuGlu: 6.073 ± 1.325
2.429LeuPhe: 2.429 ± 1.78
4.859LeuGly: 4.859 ± 1.392
3.644LeuHis: 3.644 ± 0.504
6.377LeuIle: 6.377 ± 1.522
5.466LeuLys: 5.466 ± 1.492
10.021LeuLeu: 10.021 ± 1.128
1.518LeuMet: 1.518 ± 0.431
3.644LeuAsn: 3.644 ± 1.017
4.251LeuPro: 4.251 ± 0.198
5.162LeuGln: 5.162 ± 0.913
6.985LeuArg: 6.985 ± 1.447
7.288LeuSer: 7.288 ± 1.008
3.948LeuThr: 3.948 ± 1.465
5.162LeuVal: 5.162 ± 0.256
0.911LeuTrp: 0.911 ± 0.687
4.251LeuTyr: 4.251 ± 1.231
0.0LeuXaa: 0.0 ± 0.0
Met
2.126MetAla: 2.126 ± 0.688
0.607MetCys: 0.607 ± 0.332
1.822MetAsp: 1.822 ± 0.995
1.518MetGlu: 1.518 ± 0.431
0.304MetPhe: 0.304 ± 0.166
0.911MetGly: 0.911 ± 0.497
1.518MetHis: 1.518 ± 0.524
0.607MetIle: 0.607 ± 0.332
1.215MetLys: 1.215 ± 0.663
3.037MetLeu: 3.037 ± 0.189
0.304MetMet: 0.304 ± 0.404
1.518MetAsn: 1.518 ± 0.856
0.911MetPro: 0.911 ± 0.54
0.607MetGln: 0.607 ± 0.294
0.607MetArg: 0.607 ± 0.332
1.518MetSer: 1.518 ± 0.431
3.34MetThr: 3.34 ± 1.181
2.126MetVal: 2.126 ± 0.545
0.0MetTrp: 0.0 ± 0.0
0.607MetTyr: 0.607 ± 0.332
0.0MetXaa: 0.0 ± 0.0
Asn
4.555AsnAla: 4.555 ± 1.098
0.0AsnCys: 0.0 ± 0.0
1.215AsnAsp: 1.215 ± 0.431
3.037AsnGlu: 3.037 ± 0.471
2.429AsnPhe: 2.429 ± 1.326
1.215AsnGly: 1.215 ± 1.401
2.126AsnHis: 2.126 ± 1.31
3.34AsnIle: 3.34 ± 1.181
2.126AsnLys: 2.126 ± 0.726
3.644AsnLeu: 3.644 ± 1.154
1.518AsnMet: 1.518 ± 0.506
2.429AsnAsn: 2.429 ± 1.264
0.911AsnPro: 0.911 ± 0.497
1.822AsnGln: 1.822 ± 1.463
1.822AsnArg: 1.822 ± 1.04
3.948AsnSer: 3.948 ± 0.855
3.34AsnThr: 3.34 ± 0.376
4.555AsnVal: 4.555 ± 1.481
0.607AsnTrp: 0.607 ± 0.458
1.822AsnTyr: 1.822 ± 0.995
0.0AsnXaa: 0.0 ± 0.0
Pro
4.859ProAla: 4.859 ± 1.351
1.518ProCys: 1.518 ± 0.431
1.822ProAsp: 1.822 ± 0.573
2.126ProGlu: 2.126 ± 0.828
1.822ProPhe: 1.822 ± 0.159
1.822ProGly: 1.822 ± 0.881
1.215ProHis: 1.215 ± 0.385
4.555ProIle: 4.555 ± 1.16
1.822ProLys: 1.822 ± 0.995
2.733ProLeu: 2.733 ± 0.925
1.215ProMet: 1.215 ± 0.312
3.037ProAsn: 3.037 ± 0.471
2.126ProPro: 2.126 ± 0.75
1.822ProGln: 1.822 ± 0.159
2.429ProArg: 2.429 ± 0.769
3.34ProSer: 3.34 ± 0.873
2.126ProThr: 2.126 ± 0.215
2.733ProVal: 2.733 ± 0.762
0.607ProTrp: 0.607 ± 0.332
2.733ProTyr: 2.733 ± 0.734
0.0ProXaa: 0.0 ± 0.0
Gln
3.644GlnAla: 3.644 ± 2.143
0.607GlnCys: 0.607 ± 0.808
1.822GlnAsp: 1.822 ± 0.159
2.429GlnGlu: 2.429 ± 0.625
1.518GlnPhe: 1.518 ± 0.243
3.644GlnGly: 3.644 ± 2.161
1.518GlnHis: 1.518 ± 0.829
1.518GlnIle: 1.518 ± 0.431
1.518GlnLys: 1.518 ± 0.524
3.34GlnLeu: 3.34 ± 0.873
0.911GlnMet: 0.911 ± 0.497
1.822GlnAsn: 1.822 ± 1.08
1.215GlnPro: 1.215 ± 0.588
2.733GlnGln: 2.733 ± 2.026
1.822GlnArg: 1.822 ± 0.618
1.215GlnSer: 1.215 ± 0.588
3.037GlnThr: 3.037 ± 0.79
2.429GlnVal: 2.429 ± 0.625
0.911GlnTrp: 0.911 ± 1.001
0.911GlnTyr: 0.911 ± 1.001
0.0GlnXaa: 0.0 ± 0.0
Arg
3.037ArgAla: 3.037 ± 1.012
1.215ArgCys: 1.215 ± 0.312
3.948ArgAsp: 3.948 ± 1.364
3.037ArgGlu: 3.037 ± 0.58
3.037ArgPhe: 3.037 ± 1.712
3.644ArgGly: 3.644 ± 0.925
1.822ArgHis: 1.822 ± 0.995
4.555ArgIle: 4.555 ± 0.997
4.555ArgLys: 4.555 ± 1.292
4.555ArgLeu: 4.555 ± 1.516
0.607ArgMet: 0.607 ± 0.332
2.733ArgAsn: 2.733 ± 0.572
2.126ArgPro: 2.126 ± 0.545
0.911ArgGln: 0.911 ± 0.254
3.644ArgArg: 3.644 ± 0.318
2.733ArgSer: 2.733 ± 0.503
2.126ArgThr: 2.126 ± 0.215
3.948ArgVal: 3.948 ± 0.34
0.304ArgTrp: 0.304 ± 0.166
3.948ArgTyr: 3.948 ± 1.043
0.0ArgXaa: 0.0 ± 0.0
Ser
3.34SerAla: 3.34 ± 1.018
2.429SerCys: 2.429 ± 0.625
3.34SerAsp: 3.34 ± 0.873
3.644SerGlu: 3.644 ± 1.146
2.126SerPhe: 2.126 ± 0.726
3.037SerGly: 3.037 ± 1.877
2.733SerHis: 2.733 ± 0.762
3.948SerIle: 3.948 ± 0.359
3.644SerLys: 3.644 ± 0.349
4.251SerLeu: 4.251 ± 0.863
1.822SerMet: 1.822 ± 2.121
3.644SerAsn: 3.644 ± 1.512
3.644SerPro: 3.644 ± 1.332
1.518SerGln: 1.518 ± 0.672
1.518SerArg: 1.518 ± 0.829
1.822SerSer: 1.822 ± 0.573
4.555SerThr: 4.555 ± 1.078
3.34SerVal: 3.34 ± 0.38
0.911SerTrp: 0.911 ± 0.54
3.34SerTyr: 3.34 ± 1.367
0.0SerXaa: 0.0 ± 0.0
Thr
4.251ThrAla: 4.251 ± 0.863
0.607ThrCys: 0.607 ± 0.458
3.037ThrAsp: 3.037 ± 1.463
3.34ThrGlu: 3.34 ± 0.873
2.429ThrPhe: 2.429 ± 0.862
1.822ThrGly: 1.822 ± 0.573
3.34ThrHis: 3.34 ± 0.376
3.34ThrIle: 3.34 ± 1.403
3.037ThrLys: 3.037 ± 1.344
4.555ThrLeu: 4.555 ± 0.731
0.911ThrMet: 0.911 ± 0.497
1.822ThrAsn: 1.822 ± 0.159
4.859ThrPro: 4.859 ± 0.901
3.037ThrGln: 3.037 ± 0.471
4.251ThrArg: 4.251 ± 0.766
4.251ThrSer: 4.251 ± 1.834
4.251ThrThr: 4.251 ± 0.695
3.948ThrVal: 3.948 ± 1.005
1.215ThrTrp: 1.215 ± 0.312
5.466ThrTyr: 5.466 ± 0.801
0.0ThrXaa: 0.0 ± 0.0
Val
3.948ValAla: 3.948 ± 0.832
0.0ValCys: 0.0 ± 0.0
4.251ValAsp: 4.251 ± 0.198
4.555ValGlu: 4.555 ± 1.059
1.215ValPhe: 1.215 ± 0.588
2.429ValGly: 2.429 ± 0.769
3.037ValHis: 3.037 ± 1.469
4.555ValIle: 4.555 ± 1.62
1.215ValLys: 1.215 ± 0.385
7.288ValLeu: 7.288 ± 0.041
2.126ValMet: 2.126 ± 1.123
3.037ValAsn: 3.037 ± 0.862
3.34ValPro: 3.34 ± 0.851
2.429ValGln: 2.429 ± 0.193
6.377ValArg: 6.377 ± 0.528
3.948ValSer: 3.948 ± 0.34
5.162ValThr: 5.162 ± 1.165
5.466ValVal: 5.466 ± 0.857
0.911ValTrp: 0.911 ± 0.254
3.037ValTyr: 3.037 ± 0.663
0.0ValXaa: 0.0 ± 0.0
Trp
1.518TrpAla: 1.518 ± 0.856
0.304TrpCys: 0.304 ± 0.166
1.518TrpAsp: 1.518 ± 0.672
0.911TrpGlu: 0.911 ± 0.54
0.304TrpPhe: 0.304 ± 0.404
0.607TrpGly: 0.607 ± 0.294
0.304TrpHis: 0.304 ± 0.166
0.0TrpIle: 0.0 ± 0.0
0.911TrpLys: 0.911 ± 0.413
1.215TrpLeu: 1.215 ± 0.431
0.304TrpMet: 0.304 ± 0.166
0.607TrpAsn: 0.607 ± 0.458
0.607TrpPro: 0.607 ± 0.294
0.0TrpGln: 0.0 ± 0.0
0.607TrpArg: 0.607 ± 0.332
0.911TrpSer: 0.911 ± 0.54
0.607TrpThr: 0.607 ± 0.7
0.607TrpVal: 0.607 ± 0.332
0.304TrpTrp: 0.304 ± 0.166
0.911TrpTyr: 0.911 ± 0.54
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.644TyrAla: 3.644 ± 0.523
0.911TyrCys: 0.911 ± 0.254
2.429TyrAsp: 2.429 ± 0.35
3.34TyrGlu: 3.34 ± 0.343
1.518TyrPhe: 1.518 ± 0.856
4.555TyrGly: 4.555 ± 0.603
1.215TyrHis: 1.215 ± 0.663
4.859TyrIle: 4.859 ± 2.186
1.822TyrLys: 1.822 ± 0.508
3.948TyrLeu: 3.948 ± 1.29
1.822TyrMet: 1.822 ± 0.573
1.822TyrAsn: 1.822 ± 0.509
2.733TyrPro: 2.733 ± 0.734
1.518TyrGln: 1.518 ± 0.431
3.037TyrArg: 3.037 ± 1.197
1.822TyrSer: 1.822 ± 0.508
6.377TyrThr: 6.377 ± 0.84
3.948TyrVal: 3.948 ± 0.34
0.607TyrTrp: 0.607 ± 0.294
2.429TyrTyr: 2.429 ± 0.883
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3294 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski