Amino acid dipepetide frequency for Gossypium darwinii symptomless virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.348AlaAla: 5.348 ± 1.613
0.891AlaCys: 0.891 ± 0.724
0.891AlaAsp: 0.891 ± 0.724
0.0AlaGlu: 0.0 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
1.783AlaGly: 1.783 ± 0.659
1.783AlaHis: 1.783 ± 1.309
1.783AlaIle: 1.783 ± 1.309
2.674AlaLys: 2.674 ± 1.07
8.021AlaLeu: 8.021 ± 2.782
0.0AlaMet: 0.0 ± 0.0
2.674AlaAsn: 2.674 ± 1.096
2.674AlaPro: 2.674 ± 1.096
4.456AlaGln: 4.456 ± 1.314
4.456AlaArg: 4.456 ± 1.844
4.456AlaSer: 4.456 ± 2.803
2.674AlaThr: 2.674 ± 2.172
2.674AlaVal: 2.674 ± 1.067
1.783AlaTrp: 1.783 ± 0.659
1.783AlaTyr: 1.783 ± 0.963
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.783CysCys: 1.783 ± 1.882
0.0CysAsp: 0.0 ± 0.0
0.891CysGlu: 0.891 ± 0.724
0.891CysPhe: 0.891 ± 0.867
1.783CysGly: 1.783 ± 0.94
0.891CysHis: 0.891 ± 0.922
1.783CysIle: 1.783 ± 1.159
1.783CysLys: 1.783 ± 0.659
0.0CysLeu: 0.0 ± 0.0
0.891CysMet: 0.891 ± 0.941
1.783CysAsn: 1.783 ± 0.94
1.783CysPro: 1.783 ± 1.882
0.891CysGln: 0.891 ± 0.655
0.891CysArg: 0.891 ± 0.941
4.456CysSer: 4.456 ± 2.002
1.783CysThr: 1.783 ± 1.031
0.891CysVal: 0.891 ± 0.724
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.783AspAla: 1.783 ± 1.309
0.0AspCys: 0.0 ± 0.0
0.891AspAsp: 0.891 ± 0.655
2.674AspGlu: 2.674 ± 0.761
1.783AspPhe: 1.783 ± 0.659
2.674AspGly: 2.674 ± 1.331
0.0AspHis: 0.0 ± 0.0
2.674AspIle: 2.674 ± 1.842
2.674AspLys: 2.674 ± 0.93
3.565AspLeu: 3.565 ± 1.304
0.0AspMet: 0.0 ± 0.0
3.565AspAsn: 3.565 ± 0.953
4.456AspPro: 4.456 ± 1.809
2.674AspGln: 2.674 ± 1.067
1.783AspArg: 1.783 ± 1.448
2.674AspSer: 2.674 ± 1.067
2.674AspThr: 2.674 ± 1.788
5.348AspVal: 5.348 ± 1.798
0.891AspTrp: 0.891 ± 0.655
3.565AspTyr: 3.565 ± 0.982
0.0AspXaa: 0.0 ± 0.0
Glu
6.239GluAla: 6.239 ± 1.701
0.891GluCys: 0.891 ± 0.922
1.783GluAsp: 1.783 ± 1.691
6.239GluGlu: 6.239 ± 3.76
3.565GluPhe: 3.565 ± 1.892
4.456GluGly: 4.456 ± 1.476
1.783GluHis: 1.783 ± 1.182
0.0GluIle: 0.0 ± 0.0
0.891GluLys: 0.891 ± 0.655
5.348GluLeu: 5.348 ± 2.237
0.0GluMet: 0.0 ± 0.0
3.565GluAsn: 3.565 ± 1.896
2.674GluPro: 2.674 ± 1.096
1.783GluGln: 1.783 ± 1.159
0.0GluArg: 0.0 ± 0.0
5.348GluSer: 5.348 ± 1.803
0.0GluThr: 0.0 ± 0.0
1.783GluVal: 1.783 ± 0.659
2.674GluTrp: 2.674 ± 1.742
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.891PheCys: 0.891 ± 0.724
3.565PheAsp: 3.565 ± 1.146
1.783PheGlu: 1.783 ± 0.659
1.783PhePhe: 1.783 ± 0.659
1.783PheGly: 1.783 ± 1.448
1.783PheHis: 1.783 ± 1.309
1.783PheIle: 1.783 ± 1.309
3.565PheLys: 3.565 ± 2.519
7.13PheLeu: 7.13 ± 2.109
0.891PheMet: 0.891 ± 0.655
0.891PheAsn: 0.891 ± 0.724
0.891PhePro: 0.891 ± 0.941
3.565PheGln: 3.565 ± 1.876
5.348PheArg: 5.348 ± 2.537
2.674PheSer: 2.674 ± 1.742
0.891PheThr: 0.891 ± 0.846
1.783PheVal: 1.783 ± 0.659
0.0PheTrp: 0.0 ± 0.0
1.783PheTyr: 1.783 ± 1.159
0.0PheXaa: 0.0 ± 0.0
Gly
2.674GlyAla: 2.674 ± 1.377
2.674GlyCys: 2.674 ± 1.303
1.783GlyAsp: 1.783 ± 1.309
3.565GlyGlu: 3.565 ± 1.04
1.783GlyPhe: 1.783 ± 1.245
2.674GlyGly: 2.674 ± 1.096
1.783GlyHis: 1.783 ± 0.94
2.674GlyIle: 2.674 ± 1.07
6.239GlyLys: 6.239 ± 2.456
3.565GlyLeu: 3.565 ± 1.37
0.891GlyMet: 0.891 ± 0.941
2.674GlyAsn: 2.674 ± 2.015
3.565GlyPro: 3.565 ± 1.681
2.674GlyGln: 2.674 ± 0.805
0.891GlyArg: 0.891 ± 0.655
3.565GlySer: 3.565 ± 0.975
2.674GlyThr: 2.674 ± 1.2
2.674GlyVal: 2.674 ± 1.842
0.0GlyTrp: 0.0 ± 0.0
0.891GlyTyr: 0.891 ± 0.941
0.0GlyXaa: 0.0 ± 0.0
His
1.783HisAla: 1.783 ± 1.448
0.891HisCys: 0.891 ± 0.941
2.674HisAsp: 2.674 ± 1.646
0.891HisGlu: 0.891 ± 0.655
4.456HisPhe: 4.456 ± 1.783
2.674HisGly: 2.674 ± 1.979
0.0HisHis: 0.0 ± 0.0
1.783HisIle: 1.783 ± 1.035
2.674HisLys: 2.674 ± 1.342
3.565HisLeu: 3.565 ± 1.413
0.0HisMet: 0.0 ± 0.0
2.674HisAsn: 2.674 ± 1.331
2.674HisPro: 2.674 ± 1.07
1.783HisGln: 1.783 ± 0.963
3.565HisArg: 3.565 ± 2.062
0.891HisSer: 0.891 ± 0.655
1.783HisThr: 1.783 ± 1.448
1.783HisVal: 1.783 ± 0.946
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.783IleCys: 1.783 ± 0.963
3.565IleAsp: 3.565 ± 2.618
0.891IleGlu: 0.891 ± 0.655
3.565IlePhe: 3.565 ± 1.889
1.783IleGly: 1.783 ± 1.448
2.674IleHis: 2.674 ± 1.817
0.0IleIle: 0.0 ± 0.0
6.239IleLys: 6.239 ± 1.712
1.783IleLeu: 1.783 ± 1.158
0.0IleMet: 0.0 ± 0.0
1.783IleAsn: 1.783 ± 1.182
1.783IlePro: 1.783 ± 0.946
4.456IleGln: 4.456 ± 2.21
3.565IleArg: 3.565 ± 1.632
5.348IleSer: 5.348 ± 1.75
0.891IleThr: 0.891 ± 0.655
2.674IleVal: 2.674 ± 1.221
1.783IleTrp: 1.783 ± 1.099
1.783IleTyr: 1.783 ± 0.659
0.0IleXaa: 0.0 ± 0.0
Lys
1.783LysAla: 1.783 ± 1.735
2.674LysCys: 2.674 ± 1.377
3.565LysAsp: 3.565 ± 1.876
6.239LysGlu: 6.239 ± 2.685
2.674LysPhe: 2.674 ± 0.912
3.565LysGly: 3.565 ± 0.908
0.891LysHis: 0.891 ± 0.655
3.565LysIle: 3.565 ± 1.146
1.783LysLys: 1.783 ± 0.659
0.891LysLeu: 0.891 ± 0.867
0.0LysMet: 0.0 ± 0.0
7.13LysAsn: 7.13 ± 1.65
2.674LysPro: 2.674 ± 1.221
1.783LysGln: 1.783 ± 1.159
3.565LysArg: 3.565 ± 1.15
5.348LysSer: 5.348 ± 0.984
3.565LysThr: 3.565 ± 0.982
5.348LysVal: 5.348 ± 1.78
0.891LysTrp: 0.891 ± 0.724
4.456LysTyr: 4.456 ± 1.03
0.0LysXaa: 0.0 ± 0.0
Leu
0.891LeuAla: 0.891 ± 0.655
1.783LeuCys: 1.783 ± 1.309
2.674LeuAsp: 2.674 ± 1.331
4.456LeuGlu: 4.456 ± 1.846
1.783LeuPhe: 1.783 ± 1.182
4.456LeuGly: 4.456 ± 1.603
2.674LeuHis: 2.674 ± 1.377
4.456LeuIle: 4.456 ± 1.879
5.348LeuLys: 5.348 ± 1.45
1.783LeuLeu: 1.783 ± 1.308
1.783LeuMet: 1.783 ± 1.09
6.239LeuAsn: 6.239 ± 2.349
0.891LeuPro: 0.891 ± 0.922
3.565LeuGln: 3.565 ± 1.304
8.021LeuArg: 8.021 ± 2.41
5.348LeuSer: 5.348 ± 1.959
7.13LeuThr: 7.13 ± 1.928
4.456LeuVal: 4.456 ± 1.372
0.0LeuTrp: 0.0 ± 0.0
5.348LeuTyr: 5.348 ± 2.382
0.0LeuXaa: 0.0 ± 0.0
Met
0.891MetAla: 0.891 ± 0.724
0.891MetCys: 0.891 ± 0.724
3.565MetAsp: 3.565 ± 1.902
0.891MetGlu: 0.891 ± 0.846
1.783MetPhe: 1.783 ± 1.448
1.783MetGly: 1.783 ± 0.9
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.783MetLeu: 1.783 ± 1.159
0.0MetMet: 0.0 ± 0.0
0.891MetAsn: 0.891 ± 0.724
0.891MetPro: 0.891 ± 0.655
0.0MetGln: 0.0 ± 0.0
1.783MetArg: 1.783 ± 1.031
1.783MetSer: 1.783 ± 1.035
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
2.674MetTrp: 2.674 ± 1.047
2.674MetTyr: 2.674 ± 1.646
0.0MetXaa: 0.0 ± 0.0
Asn
3.565AsnAla: 3.565 ± 1.681
0.891AsnCys: 0.891 ± 0.922
1.783AsnAsp: 1.783 ± 1.309
1.783AsnGlu: 1.783 ± 1.159
0.891AsnPhe: 0.891 ± 0.724
2.674AsnGly: 2.674 ± 1.067
3.565AsnHis: 3.565 ± 1.653
1.783AsnIle: 1.783 ± 0.659
0.891AsnLys: 0.891 ± 0.655
8.021AsnLeu: 8.021 ± 3.221
2.674AsnMet: 2.674 ± 1.669
1.783AsnAsn: 1.783 ± 1.099
4.456AsnPro: 4.456 ± 1.552
5.348AsnGln: 5.348 ± 1.436
4.456AsnArg: 4.456 ± 2.31
2.674AsnSer: 2.674 ± 1.574
4.456AsnThr: 4.456 ± 1.982
3.565AsnVal: 3.565 ± 1.307
0.891AsnTrp: 0.891 ± 0.655
3.565AsnTyr: 3.565 ± 1.116
0.0AsnXaa: 0.0 ± 0.0
Pro
4.456ProAla: 4.456 ± 1.03
3.565ProCys: 3.565 ± 1.937
2.674ProAsp: 2.674 ± 1.983
0.891ProGlu: 0.891 ± 0.655
2.674ProPhe: 2.674 ± 1.07
1.783ProGly: 1.783 ± 0.9
5.348ProHis: 5.348 ± 2.139
3.565ProIle: 3.565 ± 1.701
3.565ProLys: 3.565 ± 1.681
4.456ProLeu: 4.456 ± 1.174
0.891ProMet: 0.891 ± 0.724
2.674ProAsn: 2.674 ± 1.07
1.783ProPro: 1.783 ± 0.946
3.565ProGln: 3.565 ± 2.638
3.565ProArg: 3.565 ± 0.908
5.348ProSer: 5.348 ± 2.397
3.565ProThr: 3.565 ± 1.876
2.674ProVal: 2.674 ± 1.221
0.0ProTrp: 0.0 ± 0.0
1.783ProTyr: 1.783 ± 0.659
0.0ProXaa: 0.0 ± 0.0
Gln
5.348GlnAla: 5.348 ± 2.301
0.0GlnCys: 0.0 ± 0.0
2.674GlnAsp: 2.674 ± 1.979
4.456GlnGlu: 4.456 ± 1.111
1.783GlnPhe: 1.783 ± 1.309
1.783GlnGly: 1.783 ± 1.309
2.674GlnHis: 2.674 ± 2.015
3.565GlnIle: 3.565 ± 2.618
0.891GlnLys: 0.891 ± 0.941
0.891GlnLeu: 0.891 ± 0.941
0.891GlnMet: 0.891 ± 0.846
4.456GlnAsn: 4.456 ± 1.809
5.348GlnPro: 5.348 ± 3.397
5.348GlnGln: 5.348 ± 1.787
2.674GlnArg: 2.674 ± 1.327
4.456GlnSer: 4.456 ± 1.476
3.565GlnThr: 3.565 ± 1.344
5.348GlnVal: 5.348 ± 2.397
0.0GlnTrp: 0.0 ± 0.0
1.783GlnTyr: 1.783 ± 1.099
0.0GlnXaa: 0.0 ± 0.0
Arg
0.891ArgAla: 0.891 ± 0.724
1.783ArgCys: 1.783 ± 1.245
3.565ArgAsp: 3.565 ± 1.361
1.783ArgGlu: 1.783 ± 1.319
2.674ArgPhe: 2.674 ± 1.096
3.565ArgGly: 3.565 ± 0.983
2.674ArgHis: 2.674 ± 1.983
4.456ArgIle: 4.456 ± 1.993
2.674ArgLys: 2.674 ± 1.646
3.565ArgLeu: 3.565 ± 2.049
1.783ArgMet: 1.783 ± 1.448
1.783ArgAsn: 1.783 ± 1.308
4.456ArgPro: 4.456 ± 1.335
2.674ArgGln: 2.674 ± 1.597
6.239ArgArg: 6.239 ± 3.819
8.913ArgSer: 8.913 ± 2.854
6.239ArgThr: 6.239 ± 3.247
6.239ArgVal: 6.239 ± 1.727
0.0ArgTrp: 0.0 ± 0.0
1.783ArgTyr: 1.783 ± 1.159
0.0ArgXaa: 0.0 ± 0.0
Ser
2.674SerAla: 2.674 ± 1.327
0.891SerCys: 0.891 ± 0.941
4.456SerAsp: 4.456 ± 1.072
3.565SerGlu: 3.565 ± 1.304
3.565SerPhe: 3.565 ± 0.983
3.565SerGly: 3.565 ± 1.078
2.674SerHis: 2.674 ± 1.817
3.565SerIle: 3.565 ± 1.941
8.913SerLys: 8.913 ± 1.58
3.565SerLeu: 3.565 ± 1.344
1.783SerMet: 1.783 ± 0.926
5.348SerAsn: 5.348 ± 1.578
8.021SerPro: 8.021 ± 1.493
3.565SerGln: 3.565 ± 1.365
8.021SerArg: 8.021 ± 1.338
9.804SerSer: 9.804 ± 4.705
5.348SerThr: 5.348 ± 1.895
2.674SerVal: 2.674 ± 1.688
0.0SerTrp: 0.0 ± 0.0
1.783SerTyr: 1.783 ± 0.94
0.0SerXaa: 0.0 ± 0.0
Thr
4.456ThrAla: 4.456 ± 1.629
0.891ThrCys: 0.891 ± 0.846
0.0ThrAsp: 0.0 ± 0.0
1.783ThrGlu: 1.783 ± 1.031
1.783ThrPhe: 1.783 ± 1.153
4.456ThrGly: 4.456 ± 1.072
2.674ThrHis: 2.674 ± 1.525
0.891ThrIle: 0.891 ± 0.655
3.565ThrLys: 3.565 ± 1.319
6.239ThrLeu: 6.239 ± 1.48
2.674ThrMet: 2.674 ± 1.037
5.348ThrAsn: 5.348 ± 1.816
4.456ThrPro: 4.456 ± 1.15
4.456ThrGln: 4.456 ± 2.982
1.783ThrArg: 1.783 ± 0.9
1.783ThrSer: 1.783 ± 1.691
2.674ThrThr: 2.674 ± 1.991
2.674ThrVal: 2.674 ± 1.688
0.891ThrTrp: 0.891 ± 0.846
0.891ThrTyr: 0.891 ± 0.655
0.0ThrXaa: 0.0 ± 0.0
Val
1.783ValAla: 1.783 ± 1.448
0.0ValCys: 0.0 ± 0.0
4.456ValAsp: 4.456 ± 0.927
3.565ValGlu: 3.565 ± 1.994
2.674ValPhe: 2.674 ± 1.241
0.891ValGly: 0.891 ± 0.724
1.783ValHis: 1.783 ± 1.159
6.239ValIle: 6.239 ± 2.612
5.348ValLys: 5.348 ± 1.798
5.348ValLeu: 5.348 ± 1.824
2.674ValMet: 2.674 ± 2.172
1.783ValAsn: 1.783 ± 1.159
3.565ValPro: 3.565 ± 0.878
3.565ValGln: 3.565 ± 0.908
3.565ValArg: 3.565 ± 2.896
3.565ValSer: 3.565 ± 1.365
1.783ValThr: 1.783 ± 1.448
2.674ValVal: 2.674 ± 1.221
0.891ValTrp: 0.891 ± 0.655
4.456ValTyr: 4.456 ± 1.962
0.0ValXaa: 0.0 ± 0.0
Trp
2.674TrpAla: 2.674 ± 1.964
0.0TrpCys: 0.0 ± 0.0
0.891TrpAsp: 0.891 ± 0.941
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.891TrpGly: 0.891 ± 0.655
0.891TrpHis: 0.891 ± 0.724
0.0TrpIle: 0.0 ± 0.0
0.891TrpLys: 0.891 ± 0.867
0.0TrpLeu: 0.0 ± 0.0
0.891TrpMet: 0.891 ± 0.724
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.891TrpGln: 0.891 ± 0.655
0.891TrpArg: 0.891 ± 0.922
0.891TrpSer: 0.891 ± 0.922
1.783TrpThr: 1.783 ± 1.099
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.783TrpTyr: 1.783 ± 0.9
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.565TyrAla: 3.565 ± 1.361
0.0TyrCys: 0.0 ± 0.0
0.891TyrAsp: 0.891 ± 0.724
2.674TyrGlu: 2.674 ± 1.983
2.674TyrPhe: 2.674 ± 0.912
0.891TyrGly: 0.891 ± 0.655
0.0TyrHis: 0.0 ± 0.0
1.783TyrIle: 1.783 ± 1.309
1.783TyrLys: 1.783 ± 1.309
3.565TyrLeu: 3.565 ± 1.078
2.674TyrMet: 2.674 ± 1.733
3.565TyrAsn: 3.565 ± 1.654
1.783TyrPro: 1.783 ± 0.9
0.891TyrGln: 0.891 ± 0.724
2.674TyrArg: 2.674 ± 1.688
4.456TyrSer: 4.456 ± 1.783
0.891TyrThr: 0.891 ± 0.724
5.348TyrVal: 5.348 ± 2.275
0.0TyrTrp: 0.0 ± 0.0
0.891TyrTyr: 0.891 ± 0.922
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1123 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski