Amino acid dipepetide frequency for Horseradish curly top virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.876AlaAla: 1.876 ± 0.84
1.876AlaCys: 1.876 ± 1.403
4.69AlaAsp: 4.69 ± 1.791
0.938AlaGlu: 0.938 ± 0.677
0.938AlaPhe: 0.938 ± 1.029
2.814AlaGly: 2.814 ± 1.306
0.938AlaHis: 0.938 ± 1.029
1.876AlaIle: 1.876 ± 0.84
2.814AlaLys: 2.814 ± 2.03
6.567AlaLeu: 6.567 ± 0.633
1.876AlaMet: 1.876 ± 1.166
1.876AlaAsn: 1.876 ± 1.353
1.876AlaPro: 1.876 ± 0.84
0.938AlaGln: 0.938 ± 0.677
2.814AlaArg: 2.814 ± 1.306
0.938AlaSer: 0.938 ± 0.902
1.876AlaThr: 1.876 ± 1.111
3.752AlaVal: 3.752 ± 1.912
0.0AlaTrp: 0.0 ± 0.0
0.938AlaTyr: 0.938 ± 0.788
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.876CysGlu: 1.876 ± 1.004
2.814CysPhe: 2.814 ± 2.413
0.938CysGly: 0.938 ± 0.677
0.0CysHis: 0.0 ± 0.0
0.938CysIle: 0.938 ± 1.029
0.938CysLys: 0.938 ± 0.677
1.876CysLeu: 1.876 ± 0.9
0.0CysMet: 0.0 ± 0.0
0.938CysAsn: 0.938 ± 0.677
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.876CysArg: 1.876 ± 0.84
0.938CysSer: 0.938 ± 1.029
3.752CysThr: 3.752 ± 1.564
0.0CysVal: 0.0 ± 0.0
0.938CysTrp: 0.938 ± 0.677
0.938CysTyr: 0.938 ± 0.788
0.0CysXaa: 0.0 ± 0.0
Asp
0.938AspAla: 0.938 ± 0.677
0.938AspCys: 0.938 ± 0.677
4.69AspAsp: 4.69 ± 1.38
3.752AspGlu: 3.752 ± 1.564
2.814AspPhe: 2.814 ± 1.487
2.814AspGly: 2.814 ± 1.369
0.0AspHis: 0.0 ± 0.0
6.567AspIle: 6.567 ± 2.138
1.876AspLys: 1.876 ± 0.84
0.938AspLeu: 0.938 ± 1.07
1.876AspMet: 1.876 ± 1.576
3.752AspAsn: 3.752 ± 1.505
0.938AspPro: 0.938 ± 0.677
0.938AspGln: 0.938 ± 1.07
0.938AspArg: 0.938 ± 1.07
3.752AspSer: 3.752 ± 1.333
4.69AspThr: 4.69 ± 1.57
4.69AspVal: 4.69 ± 1.268
2.814AspTrp: 2.814 ± 1.207
1.876AspTyr: 1.876 ± 1.353
0.0AspXaa: 0.0 ± 0.0
Glu
4.69GluAla: 4.69 ± 2.385
0.0GluCys: 0.0 ± 0.0
2.814GluAsp: 2.814 ± 1.914
8.443GluGlu: 8.443 ± 7.06
0.0GluPhe: 0.0 ± 0.0
6.567GluGly: 6.567 ± 2.323
1.876GluHis: 1.876 ± 1.124
3.752GluIle: 3.752 ± 3.274
1.876GluLys: 1.876 ± 2.059
2.814GluLeu: 2.814 ± 1.207
1.876GluMet: 1.876 ± 1.353
1.876GluAsn: 1.876 ± 0.84
0.938GluPro: 0.938 ± 0.677
1.876GluGln: 1.876 ± 1.409
2.814GluArg: 2.814 ± 1.069
4.69GluSer: 4.69 ± 2.294
0.938GluThr: 0.938 ± 0.902
3.752GluVal: 3.752 ± 2.08
1.876GluTrp: 1.876 ± 1.05
0.938GluTyr: 0.938 ± 0.677
0.0GluXaa: 0.0 ± 0.0
Phe
0.938PheAla: 0.938 ± 0.677
0.0PheCys: 0.0 ± 0.0
4.69PheAsp: 4.69 ± 1.684
0.0PheGlu: 0.0 ± 0.0
2.814PhePhe: 2.814 ± 1.207
2.814PheGly: 2.814 ± 0.905
1.876PheHis: 1.876 ± 0.9
2.814PheIle: 2.814 ± 1.057
0.938PheLys: 0.938 ± 1.109
6.567PheLeu: 6.567 ± 2.335
1.876PheMet: 1.876 ± 1.983
2.814PheAsn: 2.814 ± 1.232
1.876PhePro: 1.876 ± 1.04
2.814PheGln: 2.814 ± 1.782
4.69PheArg: 4.69 ± 1.114
2.814PheSer: 2.814 ± 1.855
1.876PheThr: 1.876 ± 1.004
0.938PheVal: 0.938 ± 0.677
0.938PheTrp: 0.938 ± 0.788
1.876PheTyr: 1.876 ± 1.223
0.0PheXaa: 0.0 ± 0.0
Gly
2.814GlyAla: 2.814 ± 1.482
0.0GlyCys: 0.0 ± 0.0
2.814GlyAsp: 2.814 ± 1.306
3.752GlyGlu: 3.752 ± 1.496
0.938GlyPhe: 0.938 ± 1.029
6.567GlyGly: 6.567 ± 2.442
0.938GlyHis: 0.938 ± 0.677
3.752GlyIle: 3.752 ± 1.08
3.752GlyLys: 3.752 ± 1.903
2.814GlyLeu: 2.814 ± 0.905
1.876GlyMet: 1.876 ± 1.576
1.876GlyAsn: 1.876 ± 0.84
4.69GlyPro: 4.69 ± 2.904
4.69GlyGln: 4.69 ± 1.241
2.814GlyArg: 2.814 ± 2.009
9.381GlySer: 9.381 ± 3.134
3.752GlyThr: 3.752 ± 1.884
2.814GlyVal: 2.814 ± 1.487
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.938HisAla: 0.938 ± 0.677
0.938HisCys: 0.938 ± 0.677
0.0HisAsp: 0.0 ± 0.0
1.876HisGlu: 1.876 ± 2.059
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.938HisHis: 0.938 ± 0.677
1.876HisIle: 1.876 ± 1.124
0.938HisLys: 0.938 ± 0.677
3.752HisLeu: 3.752 ± 1.149
1.876HisMet: 1.876 ± 0.845
2.814HisAsn: 2.814 ± 2.03
1.876HisPro: 1.876 ± 1.353
0.0HisGln: 0.0 ± 0.0
0.938HisArg: 0.938 ± 1.109
0.938HisSer: 0.938 ± 0.902
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
1.876HisTrp: 1.876 ± 0.84
0.938HisTyr: 0.938 ± 1.07
0.0HisXaa: 0.0 ± 0.0
Ile
0.938IleAla: 0.938 ± 0.902
0.0IleCys: 0.0 ± 0.0
2.814IleAsp: 2.814 ± 0.905
4.69IleGlu: 4.69 ± 3.352
4.69IlePhe: 4.69 ± 2.541
2.814IleGly: 2.814 ± 0.905
0.938IleHis: 0.938 ± 0.677
6.567IleIle: 6.567 ± 2.37
4.69IleLys: 4.69 ± 3.022
7.505IleLeu: 7.505 ± 2.942
0.0IleMet: 0.0 ± 0.0
0.938IleAsn: 0.938 ± 0.788
3.752IlePro: 3.752 ± 1.08
2.814IleGln: 2.814 ± 1.312
5.629IleArg: 5.629 ± 1.107
2.814IleSer: 2.814 ± 1.554
5.629IleThr: 5.629 ± 1.076
5.629IleVal: 5.629 ± 2.32
1.876IleTrp: 1.876 ± 1.111
1.876IleTyr: 1.876 ± 1.04
0.0IleXaa: 0.0 ± 0.0
Lys
5.629LysAla: 5.629 ± 2.612
2.814LysCys: 2.814 ± 2.305
6.567LysAsp: 6.567 ± 2.904
3.752LysGlu: 3.752 ± 1.072
0.938LysPhe: 0.938 ± 0.788
2.814LysGly: 2.814 ± 1.532
1.876LysHis: 1.876 ± 0.84
1.876LysIle: 1.876 ± 1.637
6.567LysLys: 6.567 ± 2.522
3.752LysLeu: 3.752 ± 1.838
1.876LysMet: 1.876 ± 0.84
1.876LysAsn: 1.876 ± 1.05
3.752LysPro: 3.752 ± 2.707
1.876LysGln: 1.876 ± 1.409
4.69LysArg: 4.69 ± 2.933
1.876LysSer: 1.876 ± 1.353
6.567LysThr: 6.567 ± 2.488
3.752LysVal: 3.752 ± 2.015
1.876LysTrp: 1.876 ± 1.04
4.69LysTyr: 4.69 ± 1.268
0.0LysXaa: 0.0 ± 0.0
Leu
1.876LeuAla: 1.876 ± 1.637
1.876LeuCys: 1.876 ± 1.353
6.567LeuAsp: 6.567 ± 2.444
2.814LeuGlu: 2.814 ± 1.459
5.629LeuPhe: 5.629 ± 2.27
5.629LeuGly: 5.629 ± 1.713
1.876LeuHis: 1.876 ± 1.05
1.876LeuIle: 1.876 ± 1.353
5.629LeuLys: 5.629 ± 1.989
8.443LeuLeu: 8.443 ± 3.711
2.814LeuMet: 2.814 ± 1.726
2.814LeuAsn: 2.814 ± 1.482
1.876LeuPro: 1.876 ± 1.403
2.814LeuGln: 2.814 ± 1.312
1.876LeuArg: 1.876 ± 1.05
5.629LeuSer: 5.629 ± 1.971
4.69LeuThr: 4.69 ± 2.294
1.876LeuVal: 1.876 ± 1.352
0.0LeuTrp: 0.0 ± 0.0
5.629LeuTyr: 5.629 ± 1.924
0.0LeuXaa: 0.0 ± 0.0
Met
0.938MetAla: 0.938 ± 0.788
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.876MetGlu: 1.876 ± 1.637
0.0MetPhe: 0.0 ± 0.0
2.814MetGly: 2.814 ± 1.532
0.0MetHis: 0.0 ± 0.0
0.938MetIle: 0.938 ± 0.788
2.814MetLys: 2.814 ± 1.069
2.814MetLeu: 2.814 ± 1.482
0.938MetMet: 0.938 ± 0.902
0.938MetAsn: 0.938 ± 0.788
4.69MetPro: 4.69 ± 1.207
1.876MetGln: 1.876 ± 1.124
2.814MetArg: 2.814 ± 1.482
1.876MetSer: 1.876 ± 1.218
1.876MetThr: 1.876 ± 1.111
1.876MetVal: 1.876 ± 1.403
0.0MetTrp: 0.0 ± 0.0
1.876MetTyr: 1.876 ± 1.576
0.0MetXaa: 0.0 ± 0.0
Asn
5.629AsnAla: 5.629 ± 2.612
2.814AsnCys: 2.814 ± 1.069
1.876AsnAsp: 1.876 ± 0.84
1.876AsnGlu: 1.876 ± 0.84
1.876AsnPhe: 1.876 ± 1.449
3.752AsnGly: 3.752 ± 1.505
0.0AsnHis: 0.0 ± 0.0
4.69AsnIle: 4.69 ± 1.38
1.876AsnLys: 1.876 ± 1.004
4.69AsnLeu: 4.69 ± 1.795
1.876AsnMet: 1.876 ± 0.84
1.876AsnAsn: 1.876 ± 1.576
3.752AsnPro: 3.752 ± 1.469
1.876AsnGln: 1.876 ± 2.219
0.0AsnArg: 0.0 ± 0.0
1.876AsnSer: 1.876 ± 1.353
3.752AsnThr: 3.752 ± 2.391
4.69AsnVal: 4.69 ± 2.541
0.938AsnTrp: 0.938 ± 0.677
5.629AsnTyr: 5.629 ± 1.851
0.0AsnXaa: 0.0 ± 0.0
Pro
1.876ProAla: 1.876 ± 1.04
0.0ProCys: 0.0 ± 0.0
2.814ProAsp: 2.814 ± 1.65
5.629ProGlu: 5.629 ± 2.22
1.876ProPhe: 1.876 ± 1.637
1.876ProGly: 1.876 ± 1.05
0.938ProHis: 0.938 ± 0.677
0.0ProIle: 0.0 ± 0.0
3.752ProLys: 3.752 ± 2.247
2.814ProLeu: 2.814 ± 1.487
2.814ProMet: 2.814 ± 1.366
4.69ProAsn: 4.69 ± 1.57
3.752ProPro: 3.752 ± 1.402
1.876ProGln: 1.876 ± 1.353
3.752ProArg: 3.752 ± 1.496
7.505ProSer: 7.505 ± 4.222
5.629ProThr: 5.629 ± 2.422
2.814ProVal: 2.814 ± 1.306
0.938ProTrp: 0.938 ± 0.677
1.876ProTyr: 1.876 ± 0.84
0.0ProXaa: 0.0 ± 0.0
Gln
2.814GlnAla: 2.814 ± 1.057
0.938GlnCys: 0.938 ± 0.677
0.938GlnAsp: 0.938 ± 1.029
2.814GlnGlu: 2.814 ± 2.128
0.938GlnPhe: 0.938 ± 1.029
0.938GlnGly: 0.938 ± 0.788
1.876GlnHis: 1.876 ± 1.124
2.814GlnIle: 2.814 ± 1.151
2.814GlnLys: 2.814 ± 1.366
1.876GlnLeu: 1.876 ± 1.353
0.938GlnMet: 0.938 ± 0.788
0.938GlnAsn: 0.938 ± 0.677
3.752GlnPro: 3.752 ± 0.93
0.938GlnGln: 0.938 ± 0.902
2.814GlnArg: 2.814 ± 1.306
3.752GlnSer: 3.752 ± 1.402
0.938GlnThr: 0.938 ± 1.109
0.938GlnVal: 0.938 ± 0.902
1.876GlnTrp: 1.876 ± 0.84
1.876GlnTyr: 1.876 ± 0.84
0.0GlnXaa: 0.0 ± 0.0
Arg
0.938ArgAla: 0.938 ± 0.677
0.938ArgCys: 0.938 ± 0.788
3.752ArgAsp: 3.752 ± 1.31
0.938ArgGlu: 0.938 ± 0.677
2.814ArgPhe: 2.814 ± 1.057
2.814ArgGly: 2.814 ± 1.151
1.876ArgHis: 1.876 ± 1.004
5.629ArgIle: 5.629 ± 1.756
6.567ArgLys: 6.567 ± 3.712
0.938ArgLeu: 0.938 ± 1.109
0.0ArgMet: 0.0 ± 0.0
3.752ArgAsn: 3.752 ± 1.245
1.876ArgPro: 1.876 ± 0.84
1.876ArgGln: 1.876 ± 1.353
4.69ArgArg: 4.69 ± 2.409
4.69ArgSer: 4.69 ± 1.097
5.629ArgThr: 5.629 ± 3.39
2.814ArgVal: 2.814 ± 1.406
0.0ArgTrp: 0.0 ± 0.0
2.814ArgTyr: 2.814 ± 1.432
0.0ArgXaa: 0.0 ± 0.0
Ser
0.938SerAla: 0.938 ± 0.677
0.938SerCys: 0.938 ± 1.029
1.876SerAsp: 1.876 ± 1.637
0.938SerGlu: 0.938 ± 1.109
8.443SerPhe: 8.443 ± 3.827
4.69SerGly: 4.69 ± 2.146
0.938SerHis: 0.938 ± 0.677
5.629SerIle: 5.629 ± 2.445
2.814SerLys: 2.814 ± 1.849
4.69SerLeu: 4.69 ± 3.01
1.876SerMet: 1.876 ± 2.08
7.505SerAsn: 7.505 ± 2.551
2.814SerPro: 2.814 ± 3.328
1.876SerGln: 1.876 ± 1.04
6.567SerArg: 6.567 ± 3.388
6.567SerSer: 6.567 ± 1.047
10.319SerThr: 10.319 ± 4.034
0.938SerVal: 0.938 ± 0.788
1.876SerTrp: 1.876 ± 1.111
0.938SerTyr: 0.938 ± 1.07
0.0SerXaa: 0.0 ± 0.0
Thr
1.876ThrAla: 1.876 ± 2.219
0.938ThrCys: 0.938 ± 0.677
0.0ThrAsp: 0.0 ± 0.0
0.938ThrGlu: 0.938 ± 1.109
3.752ThrPhe: 3.752 ± 1.08
4.69ThrGly: 4.69 ± 2.151
3.752ThrHis: 3.752 ± 1.149
5.629ThrIle: 5.629 ± 1.932
4.69ThrLys: 4.69 ± 1.38
5.629ThrLeu: 5.629 ± 1.924
1.876ThrMet: 1.876 ± 1.111
4.69ThrAsn: 4.69 ± 1.684
7.505ThrPro: 7.505 ± 5.135
1.876ThrGln: 1.876 ± 0.84
3.752ThrArg: 3.752 ± 2.407
6.567ThrSer: 6.567 ± 5.408
4.69ThrThr: 4.69 ± 2.288
4.69ThrVal: 4.69 ± 2.385
0.0ThrTrp: 0.0 ± 0.0
5.629ThrTyr: 5.629 ± 2.035
0.0ThrXaa: 0.0 ± 0.0
Val
2.814ValAla: 2.814 ± 2.03
0.938ValCys: 0.938 ± 0.902
1.876ValAsp: 1.876 ± 2.139
3.752ValGlu: 3.752 ± 2.055
1.876ValPhe: 1.876 ± 1.223
1.876ValGly: 1.876 ± 1.04
0.0ValHis: 0.0 ± 0.0
3.752ValIle: 3.752 ± 2.015
7.505ValLys: 7.505 ± 2.592
0.938ValLeu: 0.938 ± 1.07
1.876ValMet: 1.876 ± 1.932
4.69ValAsn: 4.69 ± 1.097
3.752ValPro: 3.752 ± 1.402
2.814ValGln: 2.814 ± 1.306
1.876ValArg: 1.876 ± 1.05
2.814ValSer: 2.814 ± 1.207
0.938ValThr: 0.938 ± 0.677
2.814ValVal: 2.814 ± 1.057
0.0ValTrp: 0.0 ± 0.0
3.752ValTyr: 3.752 ± 1.469
0.0ValXaa: 0.0 ± 0.0
Trp
2.814TrpAla: 2.814 ± 1.369
0.938TrpCys: 0.938 ± 1.029
0.938TrpAsp: 0.938 ± 0.788
0.938TrpGlu: 0.938 ± 1.07
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.814TrpIle: 2.814 ± 2.034
2.814TrpLys: 2.814 ± 1.482
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.938TrpAsn: 0.938 ± 0.677
0.0TrpPro: 0.0 ± 0.0
0.938TrpGln: 0.938 ± 0.677
0.0TrpArg: 0.0 ± 0.0
0.938TrpSer: 0.938 ± 0.788
2.814TrpThr: 2.814 ± 1.306
0.938TrpVal: 0.938 ± 0.902
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.938TyrAla: 0.938 ± 0.788
1.876TyrCys: 1.876 ± 1.05
1.876TyrAsp: 1.876 ± 1.111
2.814TyrGlu: 2.814 ± 1.366
2.814TyrPhe: 2.814 ± 0.905
2.814TyrGly: 2.814 ± 1.057
1.876TyrHis: 1.876 ± 0.84
2.814TyrIle: 2.814 ± 1.306
3.752TyrLys: 3.752 ± 1.245
2.814TyrLeu: 2.814 ± 1.369
1.876TyrMet: 1.876 ± 1.442
3.752TyrAsn: 3.752 ± 1.475
3.752TyrPro: 3.752 ± 1.364
2.814TyrGln: 2.814 ± 2.364
0.0TyrArg: 0.0 ± 0.0
3.752TyrSer: 3.752 ± 3.043
2.814TyrThr: 2.814 ± 1.406
0.938TyrVal: 0.938 ± 0.788
0.0TyrTrp: 0.0 ± 0.0
0.938TyrTyr: 0.938 ± 1.07
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1067 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski