Amino acid dipepetide frequency for Xinzhou nematode virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.966AlaAla: 11.966 ± 1.613
2.137AlaCys: 2.137 ± 1.094
4.274AlaAsp: 4.274 ± 0.039
4.274AlaGlu: 4.274 ± 0.039
3.419AlaPhe: 3.419 ± 0.613
4.701AlaGly: 4.701 ± 3.051
2.137AlaHis: 2.137 ± 1.094
8.12AlaIle: 8.12 ± 2.437
4.274AlaLys: 4.274 ± 1.035
6.41AlaLeu: 6.41 ± 1.016
2.137AlaMet: 2.137 ± 1.055
5.128AlaAsn: 5.128 ± 0.383
3.846AlaPro: 3.846 ± 2.398
4.701AlaGln: 4.701 ± 0.902
6.838AlaArg: 6.838 ± 1.227
8.547AlaSer: 8.547 ± 4.375
8.12AlaThr: 8.12 ± 0.785
9.402AlaVal: 9.402 ± 0.344
3.419AlaTrp: 3.419 ± 0.461
0.855AlaTyr: 0.855 ± 0.422
0.0AlaXaa: 0.0 ± 0.0
Cys
0.427CysAla: 0.427 ± 0.211
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.282CysGlu: 1.282 ± 0.633
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.282CysHis: 1.282 ± 0.633
0.427CysIle: 0.427 ± 0.211
1.282CysLys: 1.282 ± 0.441
0.855CysLeu: 0.855 ± 0.422
0.0CysMet: 0.0 ± 0.0
0.855CysAsn: 0.855 ± 0.422
0.427CysPro: 0.427 ± 0.211
0.427CysGln: 0.427 ± 0.211
1.282CysArg: 1.282 ± 0.633
0.427CysSer: 0.427 ± 0.211
0.855CysThr: 0.855 ± 0.652
1.282CysVal: 1.282 ± 0.441
0.0CysTrp: 0.0 ± 0.0
0.427CysTyr: 0.427 ± 0.211
0.0CysXaa: 0.0 ± 0.0
Asp
6.838AspAla: 6.838 ± 0.922
0.0AspCys: 0.0 ± 0.0
4.701AspAsp: 4.701 ± 0.902
2.991AspGlu: 2.991 ± 0.402
1.282AspPhe: 1.282 ± 0.633
1.709AspGly: 1.709 ± 0.23
2.137AspHis: 2.137 ± 1.055
3.846AspIle: 3.846 ± 1.324
2.137AspLys: 2.137 ± 1.055
3.846AspLeu: 3.846 ± 0.25
1.709AspMet: 1.709 ± 0.844
0.855AspAsn: 0.855 ± 0.422
3.419AspPro: 3.419 ± 1.688
1.282AspGln: 1.282 ± 0.633
2.991AspArg: 2.991 ± 0.672
2.137AspSer: 2.137 ± 0.02
4.701AspThr: 4.701 ± 1.246
3.846AspVal: 3.846 ± 0.824
0.427AspTrp: 0.427 ± 0.863
0.855AspTyr: 0.855 ± 0.652
0.0AspXaa: 0.0 ± 0.0
Glu
5.983GluAla: 5.983 ± 1.344
0.427GluCys: 0.427 ± 0.211
1.282GluAsp: 1.282 ± 1.516
2.991GluGlu: 2.991 ± 0.672
0.427GluPhe: 0.427 ± 0.211
2.137GluGly: 2.137 ± 0.02
1.709GluHis: 1.709 ± 0.844
3.846GluIle: 3.846 ± 0.25
3.419GluLys: 3.419 ± 0.461
4.701GluLeu: 4.701 ± 1.246
2.137GluMet: 2.137 ± 1.055
1.709GluAsn: 1.709 ± 0.844
2.564GluPro: 2.564 ± 1.266
0.427GluGln: 0.427 ± 0.211
1.709GluArg: 1.709 ± 0.844
5.556GluSer: 5.556 ± 1.555
3.419GluThr: 3.419 ± 1.688
4.701GluVal: 4.701 ± 1.977
1.709GluTrp: 1.709 ± 0.844
1.282GluTyr: 1.282 ± 0.441
0.0GluXaa: 0.0 ± 0.0
Phe
3.846PheAla: 3.846 ± 1.324
0.0PheCys: 0.0 ± 0.0
0.427PheAsp: 0.427 ± 0.211
1.709PheGlu: 1.709 ± 0.23
0.855PhePhe: 0.855 ± 0.422
1.709PheGly: 1.709 ± 0.23
0.855PheHis: 0.855 ± 0.652
1.282PheIle: 1.282 ± 0.441
1.282PheLys: 1.282 ± 0.633
3.846PheLeu: 3.846 ± 1.898
1.709PheMet: 1.709 ± 0.844
0.427PheAsn: 0.427 ± 0.211
2.137PhePro: 2.137 ± 1.055
0.855PheGln: 0.855 ± 0.652
2.991PheArg: 2.991 ± 1.746
2.991PheSer: 2.991 ± 0.402
1.282PheThr: 1.282 ± 0.633
2.564PheVal: 2.564 ± 0.883
0.855PheTrp: 0.855 ± 0.652
0.855PheTyr: 0.855 ± 0.422
0.0PheXaa: 0.0 ± 0.0
Gly
4.274GlyAla: 4.274 ± 1.113
0.427GlyCys: 0.427 ± 0.863
3.419GlyAsp: 3.419 ± 0.613
0.855GlyGlu: 0.855 ± 0.652
1.282GlyPhe: 1.282 ± 1.516
2.564GlyGly: 2.564 ± 0.191
0.855GlyHis: 0.855 ± 0.422
5.983GlyIle: 5.983 ± 1.344
2.137GlyLys: 2.137 ± 1.055
5.983GlyLeu: 5.983 ± 2.418
3.846GlyMet: 3.846 ± 1.898
0.855GlyAsn: 0.855 ± 0.422
1.709GlyPro: 1.709 ± 0.23
1.282GlyGln: 1.282 ± 0.633
2.564GlyArg: 2.564 ± 1.266
3.419GlySer: 3.419 ± 0.613
4.274GlyThr: 4.274 ± 0.039
5.128GlyVal: 5.128 ± 0.383
2.137GlyTrp: 2.137 ± 0.02
1.282GlyTyr: 1.282 ± 2.59
0.0GlyXaa: 0.0 ± 0.0
His
2.991HisAla: 2.991 ± 1.477
0.0HisCys: 0.0 ± 0.0
2.137HisAsp: 2.137 ± 0.02
1.282HisGlu: 1.282 ± 0.441
1.282HisPhe: 1.282 ± 1.516
1.282HisGly: 1.282 ± 0.441
0.855HisHis: 0.855 ± 1.727
2.137HisIle: 2.137 ± 1.094
0.427HisLys: 0.427 ± 0.211
1.282HisLeu: 1.282 ± 0.441
1.282HisMet: 1.282 ± 0.441
0.427HisAsn: 0.427 ± 0.211
1.282HisPro: 1.282 ± 0.441
1.282HisGln: 1.282 ± 0.633
1.282HisArg: 1.282 ± 0.441
3.419HisSer: 3.419 ± 1.688
1.709HisThr: 1.709 ± 0.844
3.419HisVal: 3.419 ± 0.613
0.0HisTrp: 0.0 ± 0.0
0.427HisTyr: 0.427 ± 0.211
0.0HisXaa: 0.0 ± 0.0
Ile
8.12IleAla: 8.12 ± 1.363
1.282IleCys: 1.282 ± 0.633
4.274IleAsp: 4.274 ± 0.039
2.137IleGlu: 2.137 ± 1.094
2.991IlePhe: 2.991 ± 2.82
2.137IleGly: 2.137 ± 0.02
2.137IleHis: 2.137 ± 0.02
2.564IleIle: 2.564 ± 0.883
1.282IleLys: 1.282 ± 0.633
4.274IleLeu: 4.274 ± 4.336
2.137IleMet: 2.137 ± 1.055
2.991IleAsn: 2.991 ± 1.746
5.128IlePro: 5.128 ± 0.691
2.137IleGln: 2.137 ± 1.055
6.838IleArg: 6.838 ± 0.152
3.846IleSer: 3.846 ± 0.25
1.282IleThr: 1.282 ± 0.633
4.701IleVal: 4.701 ± 2.32
0.0IleTrp: 0.0 ± 0.0
1.709IleTyr: 1.709 ± 0.23
0.0IleXaa: 0.0 ± 0.0
Lys
5.556LysAla: 5.556 ± 0.594
0.855LysCys: 0.855 ± 0.422
2.564LysAsp: 2.564 ± 1.266
2.564LysGlu: 2.564 ± 0.191
1.709LysPhe: 1.709 ± 0.844
1.709LysGly: 1.709 ± 0.844
0.427LysHis: 0.427 ± 0.211
2.137LysIle: 2.137 ± 0.02
1.709LysLys: 1.709 ± 0.844
3.846LysLeu: 3.846 ± 1.898
2.137LysMet: 2.137 ± 0.02
0.427LysAsn: 0.427 ± 0.211
2.564LysPro: 2.564 ± 1.957
0.855LysGln: 0.855 ± 0.652
3.846LysArg: 3.846 ± 0.824
2.564LysSer: 2.564 ± 1.266
2.991LysThr: 2.991 ± 0.402
1.709LysVal: 1.709 ± 0.844
0.427LysTrp: 0.427 ± 0.211
0.855LysTyr: 0.855 ± 0.422
0.0LysXaa: 0.0 ± 0.0
Leu
6.838LeuAla: 6.838 ± 4.144
1.282LeuCys: 1.282 ± 0.633
2.991LeuAsp: 2.991 ± 0.672
4.701LeuGlu: 4.701 ± 1.246
2.991LeuPhe: 2.991 ± 1.477
4.701LeuGly: 4.701 ± 0.902
1.709LeuHis: 1.709 ± 0.844
5.128LeuIle: 5.128 ± 1.457
3.419LeuLys: 3.419 ± 0.461
7.692LeuLeu: 7.692 ± 3.797
0.855LeuMet: 0.855 ± 0.652
6.838LeuAsn: 6.838 ± 1.996
2.991LeuPro: 2.991 ± 0.402
1.709LeuGln: 1.709 ± 0.844
5.983LeuArg: 5.983 ± 0.805
6.838LeuSer: 6.838 ± 0.152
4.274LeuThr: 4.274 ± 1.035
4.701LeuVal: 4.701 ± 1.246
1.282LeuTrp: 1.282 ± 0.633
2.137LeuTyr: 2.137 ± 1.055
0.0LeuXaa: 0.0 ± 0.0
Met
3.419MetAla: 3.419 ± 0.461
0.427MetCys: 0.427 ± 0.211
1.709MetAsp: 1.709 ± 0.844
3.419MetGlu: 3.419 ± 1.688
0.855MetPhe: 0.855 ± 0.422
2.991MetGly: 2.991 ± 1.477
1.282MetHis: 1.282 ± 0.441
2.564MetIle: 2.564 ± 1.266
2.564MetLys: 2.564 ± 1.266
3.419MetLeu: 3.419 ± 0.613
1.709MetMet: 1.709 ± 0.844
1.709MetAsn: 1.709 ± 0.844
1.282MetPro: 1.282 ± 1.516
0.0MetGln: 0.0 ± 0.0
2.137MetArg: 2.137 ± 1.055
3.846MetSer: 3.846 ± 0.25
2.564MetThr: 2.564 ± 0.883
2.137MetVal: 2.137 ± 0.02
0.0MetTrp: 0.0 ± 0.0
1.282MetTyr: 1.282 ± 0.633
0.0MetXaa: 0.0 ± 0.0
Asn
3.846AsnAla: 3.846 ± 1.324
0.427AsnCys: 0.427 ± 0.211
1.282AsnAsp: 1.282 ± 1.516
2.137AsnGlu: 2.137 ± 0.02
1.282AsnPhe: 1.282 ± 0.441
1.709AsnGly: 1.709 ± 2.379
2.137AsnHis: 2.137 ± 1.094
2.137AsnIle: 2.137 ± 0.02
0.427AsnLys: 0.427 ± 0.211
2.137AsnLeu: 2.137 ± 1.055
0.855AsnMet: 0.855 ± 0.422
1.709AsnAsn: 1.709 ± 0.23
2.564AsnPro: 2.564 ± 0.191
1.282AsnGln: 1.282 ± 0.441
2.991AsnArg: 2.991 ± 0.402
2.137AsnSer: 2.137 ± 1.055
3.419AsnThr: 3.419 ± 0.461
4.701AsnVal: 4.701 ± 2.32
0.427AsnTrp: 0.427 ± 0.211
0.855AsnTyr: 0.855 ± 0.422
0.0AsnXaa: 0.0 ± 0.0
Pro
2.991ProAla: 2.991 ± 1.746
0.427ProCys: 0.427 ± 0.211
2.564ProAsp: 2.564 ± 1.266
2.991ProGlu: 2.991 ± 0.402
0.855ProPhe: 0.855 ± 0.652
3.419ProGly: 3.419 ± 0.461
1.709ProHis: 1.709 ± 0.23
1.709ProIle: 1.709 ± 0.23
2.564ProLys: 2.564 ± 0.191
2.991ProLeu: 2.991 ± 1.746
1.709ProMet: 1.709 ± 0.685
2.564ProAsn: 2.564 ± 0.191
1.709ProPro: 1.709 ± 1.305
0.427ProGln: 0.427 ± 0.863
4.274ProArg: 4.274 ± 1.113
3.846ProSer: 3.846 ± 0.25
0.855ProThr: 0.855 ± 0.422
4.701ProVal: 4.701 ± 0.902
0.855ProTrp: 0.855 ± 0.422
0.855ProTyr: 0.855 ± 0.422
0.0ProXaa: 0.0 ± 0.0
Gln
2.991GlnAla: 2.991 ± 0.402
0.0GlnCys: 0.0 ± 0.0
1.709GlnAsp: 1.709 ± 1.305
0.855GlnGlu: 0.855 ± 0.422
1.282GlnPhe: 1.282 ± 0.441
2.137GlnGly: 2.137 ± 1.055
0.855GlnHis: 0.855 ± 0.422
1.282GlnIle: 1.282 ± 0.441
0.427GlnLys: 0.427 ± 0.211
0.855GlnLeu: 0.855 ± 0.422
0.427GlnMet: 0.427 ± 0.211
0.855GlnAsn: 0.855 ± 0.422
1.282GlnPro: 1.282 ± 1.516
0.0GlnGln: 0.0 ± 0.0
0.855GlnArg: 0.855 ± 0.652
3.419GlnSer: 3.419 ± 0.613
0.855GlnThr: 0.855 ± 0.652
2.137GlnVal: 2.137 ± 1.055
0.427GlnTrp: 0.427 ± 0.211
1.709GlnTyr: 1.709 ± 1.305
0.0GlnXaa: 0.0 ± 0.0
Arg
8.974ArgAla: 8.974 ± 0.941
0.855ArgCys: 0.855 ± 0.422
2.991ArgAsp: 2.991 ± 0.402
5.128ArgGlu: 5.128 ± 0.691
3.846ArgPhe: 3.846 ± 0.824
3.846ArgGly: 3.846 ± 1.898
2.137ArgHis: 2.137 ± 1.055
3.846ArgIle: 3.846 ± 0.25
0.855ArgLys: 0.855 ± 0.422
6.41ArgLeu: 6.41 ± 3.164
1.709ArgMet: 1.709 ± 0.844
3.419ArgAsn: 3.419 ± 1.535
1.282ArgPro: 1.282 ± 1.516
1.709ArgGln: 1.709 ± 0.844
4.701ArgArg: 4.701 ± 0.172
4.701ArgSer: 4.701 ± 0.902
3.846ArgThr: 3.846 ± 2.398
7.692ArgVal: 7.692 ± 0.574
0.0ArgTrp: 0.0 ± 0.0
1.282ArgTyr: 1.282 ± 0.633
0.0ArgXaa: 0.0 ± 0.0
Ser
6.41SerAla: 6.41 ± 0.059
0.855SerCys: 0.855 ± 0.422
3.419SerAsp: 3.419 ± 1.688
4.701SerGlu: 4.701 ± 0.172
2.564SerPhe: 2.564 ± 0.191
5.983SerGly: 5.983 ± 0.269
1.282SerHis: 1.282 ± 0.441
5.128SerIle: 5.128 ± 0.691
5.983SerLys: 5.983 ± 1.344
6.41SerLeu: 6.41 ± 0.059
3.846SerMet: 3.846 ± 0.824
1.709SerAsn: 1.709 ± 0.844
1.709SerPro: 1.709 ± 0.844
2.564SerGln: 2.564 ± 0.883
3.419SerArg: 3.419 ± 0.461
7.692SerSer: 7.692 ± 1.574
3.419SerThr: 3.419 ± 1.535
7.265SerVal: 7.265 ± 1.438
2.137SerTrp: 2.137 ± 1.094
1.709SerTyr: 1.709 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
6.838ThrAla: 6.838 ± 3.375
0.0ThrCys: 0.0 ± 0.0
4.274ThrAsp: 4.274 ± 2.109
2.991ThrGlu: 2.991 ± 0.672
2.564ThrPhe: 2.564 ± 0.191
4.274ThrGly: 4.274 ± 2.187
1.709ThrHis: 1.709 ± 0.23
2.991ThrIle: 2.991 ± 0.402
4.701ThrLys: 4.701 ± 2.32
3.846ThrLeu: 3.846 ± 0.824
2.137ThrMet: 2.137 ± 1.202
2.137ThrAsn: 2.137 ± 2.168
2.991ThrPro: 2.991 ± 0.672
0.855ThrGln: 0.855 ± 0.422
4.701ThrArg: 4.701 ± 1.246
1.709ThrSer: 1.709 ± 0.23
3.419ThrThr: 3.419 ± 2.609
4.274ThrVal: 4.274 ± 1.113
1.709ThrTrp: 1.709 ± 1.305
2.991ThrTyr: 2.991 ± 0.672
0.0ThrXaa: 0.0 ± 0.0
Val
8.12ValAla: 8.12 ± 0.785
1.282ValCys: 1.282 ± 0.633
3.419ValAsp: 3.419 ± 0.613
2.564ValGlu: 2.564 ± 1.266
1.709ValPhe: 1.709 ± 0.844
3.419ValGly: 3.419 ± 0.613
2.564ValHis: 2.564 ± 0.191
5.128ValIle: 5.128 ± 0.691
1.709ValLys: 1.709 ± 0.844
7.265ValLeu: 7.265 ± 1.438
5.983ValMet: 5.983 ± 0.805
2.564ValAsn: 2.564 ± 1.266
5.128ValPro: 5.128 ± 0.691
1.282ValGln: 1.282 ± 0.441
6.41ValArg: 6.41 ± 2.207
5.556ValSer: 5.556 ± 1.668
7.692ValThr: 7.692 ± 0.5
5.556ValVal: 5.556 ± 1.555
1.709ValTrp: 1.709 ± 0.23
3.846ValTyr: 3.846 ± 0.824
0.0ValXaa: 0.0 ± 0.0
Trp
1.282TrpAla: 1.282 ± 0.633
0.427TrpCys: 0.427 ± 0.211
2.137TrpAsp: 2.137 ± 1.094
2.137TrpGlu: 2.137 ± 1.094
0.855TrpPhe: 0.855 ± 0.422
0.855TrpGly: 0.855 ± 0.422
0.0TrpHis: 0.0 ± 0.0
0.427TrpIle: 0.427 ± 0.211
0.855TrpLys: 0.855 ± 0.422
0.855TrpLeu: 0.855 ± 0.652
0.855TrpMet: 0.855 ± 0.652
0.0TrpAsn: 0.0 ± 0.0
0.427TrpPro: 0.427 ± 0.211
0.427TrpGln: 0.427 ± 0.211
1.282TrpArg: 1.282 ± 0.633
2.991TrpSer: 2.991 ± 0.672
0.427TrpThr: 0.427 ± 0.863
1.282TrpVal: 1.282 ± 0.441
0.427TrpTrp: 0.427 ± 0.863
0.427TrpTyr: 0.427 ± 0.211
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.991TyrAla: 2.991 ± 1.746
0.427TyrCys: 0.427 ± 0.211
2.137TyrAsp: 2.137 ± 1.055
0.427TyrGlu: 0.427 ± 0.211
0.427TyrPhe: 0.427 ± 0.211
2.991TyrGly: 2.991 ± 0.402
0.427TyrHis: 0.427 ± 0.863
1.282TyrIle: 1.282 ± 0.441
0.0TyrLys: 0.0 ± 0.0
2.137TyrLeu: 2.137 ± 1.094
1.709TyrMet: 1.709 ± 0.844
1.282TyrAsn: 1.282 ± 0.441
0.0TyrPro: 0.0 ± 0.0
0.855TyrGln: 0.855 ± 1.727
2.137TyrArg: 2.137 ± 1.055
2.564TyrSer: 2.564 ± 0.191
1.709TyrThr: 1.709 ± 0.844
1.709TyrVal: 1.709 ± 0.844
0.427TyrTrp: 0.427 ± 0.211
0.855TyrTyr: 0.855 ± 0.422
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2341 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski