Amino acid dipepetide frequency for Nephila clavipes virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.473AlaAla: 1.473 ± 0.597
0.368AlaCys: 0.368 ± 0.149
1.842AlaAsp: 1.842 ± 0.155
3.683AlaGlu: 3.683 ± 0.31
2.947AlaPhe: 2.947 ± 0.609
2.578AlaGly: 2.578 ± 1.044
1.105AlaHis: 1.105 ± 0.448
3.315AlaIle: 3.315 ± 0.46
2.21AlaLys: 2.21 ± 0.006
6.998AlaLeu: 6.998 ± 0.77
0.737AlaMet: 0.737 ± 0.298
1.842AlaAsn: 1.842 ± 0.155
2.947AlaPro: 2.947 ± 0.292
1.842AlaGln: 1.842 ± 0.746
2.21AlaArg: 2.21 ± 0.895
2.947AlaSer: 2.947 ± 1.193
2.21AlaThr: 2.21 ± 0.006
2.578AlaVal: 2.578 ± 1.659
0.368AlaTrp: 0.368 ± 0.149
2.947AlaTyr: 2.947 ± 0.609
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.368CysCys: 0.368 ± 0.149
1.105CysAsp: 1.105 ± 1.355
0.737CysGlu: 0.737 ± 0.603
0.737CysPhe: 0.737 ± 0.603
0.737CysGly: 0.737 ± 0.298
0.0CysHis: 0.0 ± 0.0
1.105CysIle: 1.105 ± 0.448
0.737CysLys: 0.737 ± 0.603
2.21CysLeu: 2.21 ± 0.907
0.737CysMet: 0.737 ± 0.603
0.737CysAsn: 0.737 ± 0.603
1.842CysPro: 1.842 ± 0.746
0.737CysGln: 0.737 ± 0.603
0.368CysArg: 0.368 ± 0.149
1.105CysSer: 1.105 ± 0.448
0.0CysThr: 0.0 ± 0.0
0.737CysVal: 0.737 ± 0.603
0.0CysTrp: 0.0 ± 0.0
1.105CysTyr: 1.105 ± 1.355
0.0CysXaa: 0.0 ± 0.0
Asp
1.842AspAla: 1.842 ± 0.155
1.842AspCys: 1.842 ± 1.056
3.683AspAsp: 3.683 ± 1.492
3.683AspGlu: 3.683 ± 0.31
1.473AspPhe: 1.473 ± 0.597
2.21AspGly: 2.21 ± 0.006
1.473AspHis: 1.473 ± 0.597
3.683AspIle: 3.683 ± 0.591
2.21AspLys: 2.21 ± 0.895
8.471AspLeu: 8.471 ± 0.728
1.842AspMet: 1.842 ± 0.155
3.315AspAsn: 3.315 ± 0.46
4.42AspPro: 4.42 ± 0.012
3.315AspGln: 3.315 ± 1.343
3.315AspArg: 3.315 ± 1.343
6.998AspSer: 6.998 ± 1.032
3.315AspThr: 3.315 ± 1.361
3.683AspVal: 3.683 ± 0.591
0.368AspTrp: 0.368 ± 0.149
2.578AspTyr: 2.578 ± 1.044
0.0AspXaa: 0.0 ± 0.0
Glu
3.683GluAla: 3.683 ± 1.492
1.842GluCys: 1.842 ± 1.056
4.788GluAsp: 4.788 ± 3.467
3.315GluGlu: 3.315 ± 0.442
0.368GluPhe: 0.368 ± 0.149
2.947GluGly: 2.947 ± 0.292
1.105GluHis: 1.105 ± 0.448
5.893GluIle: 5.893 ± 0.585
3.683GluLys: 3.683 ± 0.591
4.052GluLeu: 4.052 ± 1.062
2.21GluMet: 2.21 ± 0.895
3.683GluAsn: 3.683 ± 0.591
2.947GluPro: 2.947 ± 0.609
2.578GluGln: 2.578 ± 0.143
3.315GluArg: 3.315 ± 1.343
4.788GluSer: 4.788 ± 0.764
2.578GluThr: 2.578 ± 0.143
3.683GluVal: 3.683 ± 0.591
0.737GluTrp: 0.737 ± 0.298
3.315GluTyr: 3.315 ± 1.343
0.0GluXaa: 0.0 ± 0.0
Phe
1.473PheAla: 1.473 ± 0.597
1.105PheCys: 1.105 ± 0.448
2.947PheAsp: 2.947 ± 1.193
1.473PheGlu: 1.473 ± 0.597
0.368PhePhe: 0.368 ± 0.149
0.737PheGly: 0.737 ± 0.298
2.21PheHis: 2.21 ± 0.895
3.315PheIle: 3.315 ± 0.46
4.052PheLys: 4.052 ± 0.74
2.947PheLeu: 2.947 ± 0.609
0.737PheMet: 0.737 ± 0.298
3.315PheAsn: 3.315 ± 1.361
1.105PhePro: 1.105 ± 1.355
0.737PheGln: 0.737 ± 0.603
1.473PheArg: 1.473 ± 0.597
0.737PheSer: 0.737 ± 0.298
2.578PheThr: 2.578 ± 0.758
1.473PheVal: 1.473 ± 0.597
0.0PheTrp: 0.0 ± 0.0
0.368PheTyr: 0.368 ± 0.752
0.0PheXaa: 0.0 ± 0.0
Gly
1.473GlyAla: 1.473 ± 0.597
0.737GlyCys: 0.737 ± 0.298
4.052GlyAsp: 4.052 ± 1.641
2.947GlyGlu: 2.947 ± 0.609
1.473GlyPhe: 1.473 ± 0.304
1.842GlyGly: 1.842 ± 0.746
1.105GlyHis: 1.105 ± 0.448
3.683GlyIle: 3.683 ± 1.211
2.21GlyLys: 2.21 ± 0.895
4.052GlyLeu: 4.052 ± 1.641
0.737GlyMet: 0.737 ± 0.298
2.21GlyAsn: 2.21 ± 0.895
0.368GlyPro: 0.368 ± 0.149
1.842GlyGln: 1.842 ± 0.746
3.315GlyArg: 3.315 ± 0.442
2.578GlySer: 2.578 ± 1.044
1.842GlyThr: 1.842 ± 0.155
2.578GlyVal: 2.578 ± 1.044
0.368GlyTrp: 0.368 ± 0.149
2.21GlyTyr: 2.21 ± 0.006
0.0GlyXaa: 0.0 ± 0.0
His
1.842HisAla: 1.842 ± 1.957
0.0HisCys: 0.0 ± 0.0
0.737HisAsp: 0.737 ± 0.298
1.105HisGlu: 1.105 ± 0.454
2.578HisPhe: 2.578 ± 1.044
0.368HisGly: 0.368 ± 0.149
0.368HisHis: 0.368 ± 0.752
4.052HisIle: 4.052 ± 0.74
1.842HisLys: 1.842 ± 0.746
2.21HisLeu: 2.21 ± 0.907
1.105HisMet: 1.105 ± 0.448
2.21HisAsn: 2.21 ± 0.006
1.105HisPro: 1.105 ± 0.448
1.105HisGln: 1.105 ± 0.454
1.473HisArg: 1.473 ± 0.597
2.578HisSer: 2.578 ± 0.143
1.473HisThr: 1.473 ± 0.597
0.737HisVal: 0.737 ± 0.298
0.368HisTrp: 0.368 ± 0.149
1.473HisTyr: 1.473 ± 0.304
0.0HisXaa: 0.0 ± 0.0
Ile
4.052IleAla: 4.052 ± 1.062
1.473IleCys: 1.473 ± 1.205
4.788IleAsp: 4.788 ± 1.038
5.525IleGlu: 5.525 ± 0.436
2.21IlePhe: 2.21 ± 0.006
2.947IleGly: 2.947 ± 0.609
4.052IleHis: 4.052 ± 1.062
5.157IleIle: 5.157 ± 3.318
6.998IleLys: 6.998 ± 1.933
9.576IleLeu: 9.576 ± 1.528
1.105IleMet: 1.105 ± 0.454
5.525IleAsn: 5.525 ± 0.465
3.683IlePro: 3.683 ± 0.591
1.842IleGln: 1.842 ± 1.056
4.42IleArg: 4.42 ± 0.012
5.157IleSer: 5.157 ± 0.615
5.525IleThr: 5.525 ± 0.465
2.578IleVal: 2.578 ± 0.143
0.368IleTrp: 0.368 ± 0.149
3.315IleTyr: 3.315 ± 0.46
0.0IleXaa: 0.0 ± 0.0
Lys
2.21LysAla: 2.21 ± 0.895
0.0LysCys: 0.0 ± 0.0
4.788LysAsp: 4.788 ± 1.939
4.788LysGlu: 4.788 ± 1.038
1.842LysPhe: 1.842 ± 0.155
2.947LysGly: 2.947 ± 1.193
2.21LysHis: 2.21 ± 0.006
4.788LysIle: 4.788 ± 1.939
3.315LysLys: 3.315 ± 0.442
6.63LysLeu: 6.63 ± 2.685
1.842LysMet: 1.842 ± 0.746
2.578LysAsn: 2.578 ± 0.143
2.21LysPro: 2.21 ± 1.808
2.21LysGln: 2.21 ± 0.895
2.947LysArg: 2.947 ± 1.193
5.893LysSer: 5.893 ± 0.316
5.157LysThr: 5.157 ± 1.188
6.262LysVal: 6.262 ± 0.734
0.368LysTrp: 0.368 ± 0.752
4.052LysTyr: 4.052 ± 0.74
0.0LysXaa: 0.0 ± 0.0
Leu
4.052LeuAla: 4.052 ± 1.963
0.368LeuCys: 0.368 ± 0.149
4.052LeuAsp: 4.052 ± 1.963
6.262LeuGlu: 6.262 ± 0.167
2.578LeuPhe: 2.578 ± 0.143
5.157LeuGly: 5.157 ± 2.089
3.315LeuHis: 3.315 ± 0.46
7.366LeuIle: 7.366 ± 0.621
5.157LeuLys: 5.157 ± 2.089
9.576LeuLeu: 9.576 ± 5.132
4.052LeuMet: 4.052 ± 0.546
6.262LeuAsn: 6.262 ± 1.635
2.578LeuPro: 2.578 ± 0.758
5.157LeuGln: 5.157 ± 3.318
4.788LeuArg: 4.788 ± 0.764
7.735LeuSer: 7.735 ± 0.43
5.893LeuThr: 5.893 ± 2.118
5.893LeuVal: 5.893 ± 0.585
1.105LeuTrp: 1.105 ± 0.448
6.998LeuTyr: 6.998 ± 0.77
0.0LeuXaa: 0.0 ± 0.0
Met
1.473MetAla: 1.473 ± 0.304
0.368MetCys: 0.368 ± 0.149
2.947MetAsp: 2.947 ± 1.193
1.473MetGlu: 1.473 ± 0.597
0.737MetPhe: 0.737 ± 0.298
0.737MetGly: 0.737 ± 0.298
0.0MetHis: 0.0 ± 0.0
1.105MetIle: 1.105 ± 0.454
1.473MetLys: 1.473 ± 0.304
1.105MetLeu: 1.105 ± 0.448
0.737MetMet: 0.737 ± 0.298
0.737MetAsn: 0.737 ± 0.298
0.737MetPro: 0.737 ± 0.298
1.473MetGln: 1.473 ± 0.597
2.578MetArg: 2.578 ± 0.143
1.842MetSer: 1.842 ± 0.155
0.0MetThr: 0.0 ± 0.0
2.21MetVal: 2.21 ± 0.006
1.105MetTrp: 1.105 ± 0.448
0.368MetTyr: 0.368 ± 0.752
0.0MetXaa: 0.0 ± 0.0
Asn
1.842AsnAla: 1.842 ± 0.746
1.105AsnCys: 1.105 ± 0.448
3.315AsnAsp: 3.315 ± 0.442
2.947AsnGlu: 2.947 ± 1.193
3.315AsnPhe: 3.315 ± 0.442
2.21AsnGly: 2.21 ± 0.895
0.737AsnHis: 0.737 ± 0.298
4.42AsnIle: 4.42 ± 0.012
4.42AsnLys: 4.42 ± 0.889
7.366AsnLeu: 7.366 ± 0.621
1.473AsnMet: 1.473 ± 0.304
2.578AsnAsn: 2.578 ± 0.143
2.578AsnPro: 2.578 ± 1.659
2.947AsnGln: 2.947 ± 0.292
1.842AsnArg: 1.842 ± 1.056
2.578AsnSer: 2.578 ± 1.659
4.42AsnThr: 4.42 ± 0.889
3.315AsnVal: 3.315 ± 0.442
0.0AsnTrp: 0.0 ± 0.0
1.842AsnTyr: 1.842 ± 0.155
0.0AsnXaa: 0.0 ± 0.0
Pro
1.473ProAla: 1.473 ± 0.304
0.737ProCys: 0.737 ± 1.504
2.21ProAsp: 2.21 ± 0.006
2.578ProGlu: 2.578 ± 1.044
0.737ProPhe: 0.737 ± 0.603
3.315ProGly: 3.315 ± 0.442
1.105ProHis: 1.105 ± 0.454
2.578ProIle: 2.578 ± 0.758
5.525ProLys: 5.525 ± 2.238
3.315ProLeu: 3.315 ± 0.442
0.368ProMet: 0.368 ± 0.149
1.842ProAsn: 1.842 ± 0.155
1.473ProPro: 1.473 ± 0.304
1.473ProGln: 1.473 ± 0.304
2.21ProArg: 2.21 ± 0.006
2.578ProSer: 2.578 ± 0.143
1.842ProThr: 1.842 ± 1.957
2.578ProVal: 2.578 ± 0.758
0.0ProTrp: 0.0 ± 0.0
1.473ProTyr: 1.473 ± 0.304
0.0ProXaa: 0.0 ± 0.0
Gln
3.683GlnAla: 3.683 ± 0.591
0.0GlnCys: 0.0 ± 0.0
2.947GlnAsp: 2.947 ± 0.609
1.842GlnGlu: 1.842 ± 0.746
0.737GlnPhe: 0.737 ± 0.298
1.473GlnGly: 1.473 ± 0.597
0.368GlnHis: 0.368 ± 0.149
2.578GlnIle: 2.578 ± 0.143
2.578GlnLys: 2.578 ± 0.143
3.683GlnLeu: 3.683 ± 2.112
0.737GlnMet: 0.737 ± 0.298
1.105GlnAsn: 1.105 ± 0.448
2.947GlnPro: 2.947 ± 0.292
1.473GlnGln: 1.473 ± 0.304
2.947GlnArg: 2.947 ± 0.292
4.052GlnSer: 4.052 ± 1.963
1.842GlnThr: 1.842 ± 0.155
1.105GlnVal: 1.105 ± 0.454
0.0GlnTrp: 0.0 ± 0.0
1.842GlnTyr: 1.842 ± 0.746
0.0GlnXaa: 0.0 ± 0.0
Arg
2.947ArgAla: 2.947 ± 0.609
1.105ArgCys: 1.105 ± 0.454
3.683ArgAsp: 3.683 ± 1.492
3.683ArgGlu: 3.683 ± 0.591
1.842ArgPhe: 1.842 ± 0.155
2.578ArgGly: 2.578 ± 0.143
1.842ArgHis: 1.842 ± 0.746
3.683ArgIle: 3.683 ± 0.591
4.052ArgLys: 4.052 ± 0.74
5.525ArgLeu: 5.525 ± 1.367
0.737ArgMet: 0.737 ± 0.298
3.315ArgAsn: 3.315 ± 0.442
0.737ArgPro: 0.737 ± 0.603
1.105ArgGln: 1.105 ± 0.448
3.683ArgArg: 3.683 ± 1.492
4.42ArgSer: 4.42 ± 0.913
3.315ArgThr: 3.315 ± 0.442
2.578ArgVal: 2.578 ± 1.044
0.368ArgTrp: 0.368 ± 0.752
2.21ArgTyr: 2.21 ± 0.895
0.0ArgXaa: 0.0 ± 0.0
Ser
4.052SerAla: 4.052 ± 0.74
0.737SerCys: 0.737 ± 0.603
5.893SerAsp: 5.893 ± 0.316
2.947SerGlu: 2.947 ± 1.51
2.578SerPhe: 2.578 ± 0.758
2.947SerGly: 2.947 ± 0.292
2.578SerHis: 2.578 ± 0.143
6.63SerIle: 6.63 ± 0.018
4.42SerLys: 4.42 ± 0.889
8.84SerLeu: 8.84 ± 0.925
0.368SerMet: 0.368 ± 0.149
4.42SerAsn: 4.42 ± 0.889
2.578SerPro: 2.578 ± 0.143
3.315SerGln: 3.315 ± 0.442
2.947SerArg: 2.947 ± 0.292
8.471SerSer: 8.471 ± 0.173
6.63SerThr: 6.63 ± 0.919
4.052SerVal: 4.052 ± 0.161
0.0SerTrp: 0.0 ± 0.0
4.42SerTyr: 4.42 ± 2.715
0.0SerXaa: 0.0 ± 0.0
Thr
4.052ThrAla: 4.052 ± 0.74
0.0ThrCys: 0.0 ± 0.0
3.315ThrAsp: 3.315 ± 1.343
3.683ThrGlu: 3.683 ± 1.211
2.578ThrPhe: 2.578 ± 1.044
1.842ThrGly: 1.842 ± 0.746
1.105ThrHis: 1.105 ± 0.454
5.525ThrIle: 5.525 ± 1.367
5.157ThrLys: 5.157 ± 0.286
2.578ThrLeu: 2.578 ± 0.758
1.105ThrMet: 1.105 ± 0.448
4.42ThrAsn: 4.42 ± 0.012
1.473ThrPro: 1.473 ± 0.597
0.737ThrGln: 0.737 ± 0.603
3.683ThrArg: 3.683 ± 2.112
6.63ThrSer: 6.63 ± 2.721
3.315ThrThr: 3.315 ± 0.442
2.947ThrVal: 2.947 ± 0.292
0.737ThrTrp: 0.737 ± 0.603
4.42ThrTyr: 4.42 ± 0.012
0.0ThrXaa: 0.0 ± 0.0
Val
2.947ValAla: 2.947 ± 0.292
1.105ValCys: 1.105 ± 0.454
2.578ValAsp: 2.578 ± 1.044
2.947ValGlu: 2.947 ± 0.609
1.473ValPhe: 1.473 ± 0.597
2.947ValGly: 2.947 ± 1.193
2.578ValHis: 2.578 ± 0.758
6.262ValIle: 6.262 ± 2.87
3.683ValLys: 3.683 ± 0.591
3.683ValLeu: 3.683 ± 1.211
0.737ValMet: 0.737 ± 0.298
0.368ValAsn: 0.368 ± 0.149
2.578ValPro: 2.578 ± 1.044
1.105ValGln: 1.105 ± 0.454
2.947ValArg: 2.947 ± 0.292
4.788ValSer: 4.788 ± 1.038
5.525ValThr: 5.525 ± 1.337
2.947ValVal: 2.947 ± 0.292
0.0ValTrp: 0.0 ± 0.0
4.42ValTyr: 4.42 ± 0.913
0.0ValXaa: 0.0 ± 0.0
Trp
0.368TrpAla: 0.368 ± 0.149
0.368TrpCys: 0.368 ± 0.149
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.368TrpPhe: 0.368 ± 0.149
0.0TrpGly: 0.0 ± 0.0
0.368TrpHis: 0.368 ± 0.149
0.737TrpIle: 0.737 ± 0.298
0.0TrpLys: 0.0 ± 0.0
1.105TrpLeu: 1.105 ± 0.448
0.368TrpMet: 0.368 ± 0.149
0.737TrpAsn: 0.737 ± 0.603
0.0TrpPro: 0.0 ± 0.0
0.737TrpGln: 0.737 ± 0.298
0.368TrpArg: 0.368 ± 0.752
0.737TrpSer: 0.737 ± 0.298
0.0TrpThr: 0.0 ± 0.0
0.368TrpVal: 0.368 ± 0.149
0.0TrpTrp: 0.0 ± 0.0
0.368TrpTyr: 0.368 ± 0.752
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.21TyrAla: 2.21 ± 0.006
1.473TyrCys: 1.473 ± 1.205
4.052TyrAsp: 4.052 ± 1.641
5.893TyrGlu: 5.893 ± 1.486
2.21TyrPhe: 2.21 ± 0.907
1.105TyrGly: 1.105 ± 1.355
1.105TyrHis: 1.105 ± 0.448
4.42TyrIle: 4.42 ± 1.814
3.315TyrLys: 3.315 ± 0.46
4.788TyrLeu: 4.788 ± 0.137
1.105TyrMet: 1.105 ± 0.388
4.052TyrAsn: 4.052 ± 0.161
1.105TyrPro: 1.105 ± 0.454
2.21TyrGln: 2.21 ± 0.895
2.578TyrArg: 2.578 ± 0.143
2.21TyrSer: 2.21 ± 0.907
1.842TyrThr: 1.842 ± 1.056
3.315TyrVal: 3.315 ± 2.262
0.737TyrTrp: 0.737 ± 0.298
3.683TyrTyr: 3.683 ± 2.112
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2716 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski