Amino acid dipepetide frequency for Tomato yellow leaf curl Sardinia virus (isolate Spain-2) (TYLCSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.653AlaAla: 3.653 ± 1.508
0.913AlaCys: 0.913 ± 0.714
0.913AlaAsp: 0.913 ± 1.115
0.913AlaGlu: 0.913 ± 1.115
0.0AlaPhe: 0.0 ± 0.0
0.913AlaGly: 0.913 ± 0.648
2.74AlaHis: 2.74 ± 1.164
3.653AlaIle: 3.653 ± 2.029
5.479AlaLys: 5.479 ± 1.537
4.566AlaLeu: 4.566 ± 1.045
0.913AlaMet: 0.913 ± 0.6
1.826AlaAsn: 1.826 ± 1.167
3.653AlaPro: 3.653 ± 1.231
0.913AlaGln: 0.913 ± 0.648
3.653AlaArg: 3.653 ± 2.029
2.74AlaSer: 2.74 ± 1.37
3.653AlaThr: 3.653 ± 2.284
1.826AlaVal: 1.826 ± 1.192
0.913AlaTrp: 0.913 ± 0.648
0.913AlaTyr: 0.913 ± 0.648
0.0AlaXaa: 0.0 ± 0.0
Cys
0.913CysAla: 0.913 ± 1.026
1.826CysCys: 1.826 ± 2.23
0.0CysAsp: 0.0 ± 0.0
0.913CysGlu: 0.913 ± 0.714
0.913CysPhe: 0.913 ± 1.025
1.826CysGly: 1.826 ± 1.018
0.0CysHis: 0.0 ± 0.0
1.826CysIle: 1.826 ± 1.081
1.826CysLys: 1.826 ± 0.722
0.913CysLeu: 0.913 ± 0.714
0.913CysMet: 0.913 ± 1.115
0.913CysAsn: 0.913 ± 0.648
1.826CysPro: 1.826 ± 2.23
0.0CysGln: 0.0 ± 0.0
0.913CysArg: 0.913 ± 0.648
3.653CysSer: 3.653 ± 2.1
0.913CysThr: 0.913 ± 0.714
0.913CysVal: 0.913 ± 0.714
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.826AspAla: 1.826 ± 0.722
1.826AspCys: 1.826 ± 2.052
3.653AspAsp: 3.653 ± 1.055
2.74AspGlu: 2.74 ± 1.171
0.913AspPhe: 0.913 ± 0.714
2.74AspGly: 2.74 ± 1.945
0.0AspHis: 0.0 ± 0.0
0.913AspIle: 0.913 ± 0.714
0.913AspLys: 0.913 ± 0.648
9.132AspLeu: 9.132 ± 2.776
0.0AspMet: 0.0 ± 0.0
2.74AspAsn: 2.74 ± 1.983
2.74AspPro: 2.74 ± 1.523
1.826AspGln: 1.826 ± 1.018
1.826AspArg: 1.826 ± 1.428
5.479AspSer: 5.479 ± 1.724
0.0AspThr: 0.0 ± 0.0
10.959AspVal: 10.959 ± 1.857
1.826AspTrp: 1.826 ± 1.018
2.74AspTyr: 2.74 ± 1.164
0.0AspXaa: 0.0 ± 0.0
Glu
3.653GluAla: 3.653 ± 2.029
0.0GluCys: 0.0 ± 0.0
1.826GluAsp: 1.826 ± 1.018
4.566GluGlu: 4.566 ± 1.865
1.826GluPhe: 1.826 ± 1.167
3.653GluGly: 3.653 ± 1.075
0.913GluHis: 0.913 ± 0.648
1.826GluIle: 1.826 ± 2.05
0.913GluLys: 0.913 ± 0.648
4.566GluLeu: 4.566 ± 2.008
0.913GluMet: 0.913 ± 0.909
4.566GluAsn: 4.566 ± 1.947
1.826GluPro: 1.826 ± 1.265
4.566GluGln: 4.566 ± 1.947
0.0GluArg: 0.0 ± 0.0
0.0GluSer: 0.0 ± 0.0
1.826GluThr: 1.826 ± 1.167
0.913GluVal: 0.913 ± 1.115
1.826GluTrp: 1.826 ± 1.018
1.826GluTyr: 1.826 ± 0.924
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.913PheCys: 0.913 ± 0.714
4.566PheAsp: 4.566 ± 1.271
2.74PheGlu: 2.74 ± 1.84
3.653PhePhe: 3.653 ± 1.211
1.826PheGly: 1.826 ± 1.428
3.653PheHis: 3.653 ± 1.075
0.0PheIle: 0.0 ± 0.0
3.653PheLys: 3.653 ± 1.075
6.393PheLeu: 6.393 ± 1.852
0.913PheMet: 0.913 ± 0.648
3.653PheAsn: 3.653 ± 1.791
1.826PhePro: 1.826 ± 1.167
4.566PheGln: 4.566 ± 2.068
3.653PheArg: 3.653 ± 2.283
1.826PheSer: 1.826 ± 1.377
3.653PheThr: 3.653 ± 2.107
0.913PheVal: 0.913 ± 0.648
0.0PheTrp: 0.0 ± 0.0
0.913PheTyr: 0.913 ± 0.714
0.0PheXaa: 0.0 ± 0.0
Gly
0.913GlyAla: 0.913 ± 0.648
1.826GlyCys: 1.826 ± 1.081
3.653GlyAsp: 3.653 ± 1.075
1.826GlyGlu: 1.826 ± 1.409
0.913GlyPhe: 0.913 ± 1.026
2.74GlyGly: 2.74 ± 1.171
2.74GlyHis: 2.74 ± 1.164
3.653GlyIle: 3.653 ± 1.339
4.566GlyLys: 4.566 ± 1.834
0.913GlyLeu: 0.913 ± 1.115
0.913GlyMet: 0.913 ± 0.714
0.913GlyAsn: 0.913 ± 0.909
3.653GlyPro: 3.653 ± 1.443
2.74GlyGln: 2.74 ± 1.171
1.826GlyArg: 1.826 ± 1.297
1.826GlySer: 1.826 ± 0.722
2.74GlyThr: 2.74 ± 0.861
4.566GlyVal: 4.566 ± 2.477
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.826HisAla: 1.826 ± 1.428
2.74HisCys: 2.74 ± 2.166
1.826HisAsp: 1.826 ± 1.384
1.826HisGlu: 1.826 ± 0.924
3.653HisPhe: 3.653 ± 1.726
1.826HisGly: 1.826 ± 1.384
1.826HisHis: 1.826 ± 2.052
1.826HisIle: 1.826 ± 1.377
0.913HisLys: 0.913 ± 1.025
3.653HisLeu: 3.653 ± 2.036
0.913HisMet: 0.913 ± 0.648
3.653HisAsn: 3.653 ± 1.847
0.913HisPro: 0.913 ± 0.648
3.653HisGln: 3.653 ± 1.077
2.74HisArg: 2.74 ± 0.861
0.0HisSer: 0.0 ± 0.0
2.74HisThr: 2.74 ± 2.142
1.826HisVal: 1.826 ± 1.265
0.913HisTrp: 0.913 ± 0.648
1.826HisTyr: 1.826 ± 1.297
0.0HisXaa: 0.0 ± 0.0
Ile
0.913IleAla: 0.913 ± 1.026
0.913IleCys: 0.913 ± 0.648
3.653IleAsp: 3.653 ± 1.726
1.826IleGlu: 1.826 ± 0.924
2.74IlePhe: 2.74 ± 1.364
0.913IleGly: 0.913 ± 1.025
2.74IleHis: 2.74 ± 1.262
0.0IleIle: 0.0 ± 0.0
9.132IleLys: 9.132 ± 1.245
2.74IleLeu: 2.74 ± 1.281
0.913IleMet: 0.913 ± 1.115
3.653IleAsn: 3.653 ± 1.877
1.826IlePro: 1.826 ± 1.018
5.479IleGln: 5.479 ± 3.054
5.479IleArg: 5.479 ± 2.748
4.566IleSer: 4.566 ± 2.375
4.566IleThr: 4.566 ± 3.23
0.913IleVal: 0.913 ± 0.648
2.74IleTrp: 2.74 ± 1.446
1.826IleTyr: 1.826 ± 1.428
0.0IleXaa: 0.0 ± 0.0
Lys
2.74LysAla: 2.74 ± 1.164
1.826LysCys: 1.826 ± 0.924
0.913LysAsp: 0.913 ± 0.648
4.566LysGlu: 4.566 ± 2.366
2.74LysPhe: 2.74 ± 1.281
0.913LysGly: 0.913 ± 0.648
2.74LysHis: 2.74 ± 1.171
4.566LysIle: 4.566 ± 1.255
2.74LysLys: 2.74 ± 0.861
0.913LysLeu: 0.913 ± 0.714
0.0LysMet: 0.0 ± 0.0
4.566LysAsn: 4.566 ± 2.366
3.653LysPro: 3.653 ± 2.14
0.913LysGln: 0.913 ± 0.714
5.479LysArg: 5.479 ± 2.445
5.479LysSer: 5.479 ± 1.966
1.826LysThr: 1.826 ± 1.167
2.74LysVal: 2.74 ± 1.281
0.0LysTrp: 0.0 ± 0.0
6.393LysTyr: 6.393 ± 2.422
0.0LysXaa: 0.0 ± 0.0
Leu
1.826LeuAla: 1.826 ± 1.167
1.826LeuCys: 1.826 ± 1.297
5.479LeuAsp: 5.479 ± 3.054
4.566LeuGlu: 4.566 ± 1.865
1.826LeuPhe: 1.826 ± 0.924
3.653LeuGly: 3.653 ± 1.077
2.74LeuHis: 2.74 ± 1.945
3.653LeuIle: 3.653 ± 2.2
3.653LeuLys: 3.653 ± 1.443
6.393LeuLeu: 6.393 ± 1.867
2.74LeuMet: 2.74 ± 1.233
7.306LeuAsn: 7.306 ± 3.101
0.913LeuPro: 0.913 ± 1.026
5.479LeuGln: 5.479 ± 2.081
6.393LeuArg: 6.393 ± 3.44
4.566LeuSer: 4.566 ± 1.732
4.566LeuThr: 4.566 ± 2.175
2.74LeuVal: 2.74 ± 1.144
0.0LeuTrp: 0.0 ± 0.0
3.653LeuTyr: 3.653 ± 2.31
0.0LeuXaa: 0.0 ± 0.0
Met
1.826MetAla: 1.826 ± 0.722
0.0MetCys: 0.0 ± 0.0
4.566MetAsp: 4.566 ± 1.409
0.0MetGlu: 0.0 ± 0.0
3.653MetPhe: 3.653 ± 1.619
1.826MetGly: 1.826 ± 1.018
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.913MetLys: 0.913 ± 0.714
1.826MetLeu: 1.826 ± 1.167
0.0MetMet: 0.0 ± 0.803
0.0MetAsn: 0.0 ± 0.0
0.913MetPro: 0.913 ± 0.714
0.913MetGln: 0.913 ± 1.026
0.913MetArg: 0.913 ± 0.714
0.913MetSer: 0.913 ± 0.714
0.913MetThr: 0.913 ± 0.909
0.913MetVal: 0.913 ± 0.648
1.826MetTrp: 1.826 ± 1.167
2.74MetTyr: 2.74 ± 2.142
0.0MetXaa: 0.0 ± 0.0
Asn
4.566AsnAla: 4.566 ± 2.366
0.913AsnCys: 0.913 ± 0.648
1.826AsnAsp: 1.826 ± 0.722
2.74AsnGlu: 2.74 ± 1.37
1.826AsnPhe: 1.826 ± 1.192
0.913AsnGly: 0.913 ± 0.909
4.566AsnHis: 4.566 ± 2.282
4.566AsnIle: 4.566 ± 0.976
1.826AsnLys: 1.826 ± 1.018
4.566AsnLeu: 4.566 ± 2.994
2.74AsnMet: 2.74 ± 1.246
3.653AsnAsn: 3.653 ± 1.847
2.74AsnPro: 2.74 ± 0.902
6.393AsnGln: 6.393 ± 1.804
1.826AsnArg: 1.826 ± 1.167
2.74AsnSer: 2.74 ± 1.256
4.566AsnThr: 4.566 ± 1.045
4.566AsnVal: 4.566 ± 2.093
0.913AsnTrp: 0.913 ± 0.648
0.913AsnTyr: 0.913 ± 0.648
0.0AsnXaa: 0.0 ± 0.0
Pro
1.826ProAla: 1.826 ± 1.167
1.826ProCys: 1.826 ± 1.265
3.653ProAsp: 3.653 ± 1.174
0.0ProGlu: 0.0 ± 0.0
2.74ProPhe: 2.74 ± 0.902
0.913ProGly: 0.913 ± 0.648
5.479ProHis: 5.479 ± 2.383
2.74ProIle: 2.74 ± 1.939
2.74ProLys: 2.74 ± 1.171
3.653ProLeu: 3.653 ± 1.077
3.653ProMet: 3.653 ± 1.691
1.826ProAsn: 1.826 ± 1.167
0.913ProPro: 0.913 ± 0.648
2.74ProGln: 2.74 ± 1.293
3.653ProArg: 3.653 ± 1.174
2.74ProSer: 2.74 ± 1.517
8.219ProThr: 8.219 ± 2.103
2.74ProVal: 2.74 ± 1.144
0.913ProTrp: 0.913 ± 0.648
2.74ProTyr: 2.74 ± 1.281
0.0ProXaa: 0.0 ± 0.0
Gln
4.566GlnAla: 4.566 ± 2.305
0.913GlnCys: 0.913 ± 0.714
1.826GlnAsp: 1.826 ± 1.018
1.826GlnGlu: 1.826 ± 0.722
1.826GlnPhe: 1.826 ± 1.018
2.74GlnGly: 2.74 ± 0.935
2.74GlnHis: 2.74 ± 1.783
5.479GlnIle: 5.479 ± 2.194
1.826GlnLys: 1.826 ± 1.167
5.479GlnLeu: 5.479 ± 2.044
0.913GlnMet: 0.913 ± 1.026
1.826GlnAsn: 1.826 ± 1.265
9.132GlnPro: 9.132 ± 2.602
3.653GlnGln: 3.653 ± 2.107
1.826GlnArg: 1.826 ± 0.722
4.566GlnSer: 4.566 ± 0.976
3.653GlnThr: 3.653 ± 1.877
7.306GlnVal: 7.306 ± 1.883
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.74ArgAla: 2.74 ± 1.439
1.826ArgCys: 1.826 ± 1.265
7.306ArgAsp: 7.306 ± 2.21
1.826ArgGlu: 1.826 ± 1.018
5.479ArgPhe: 5.479 ± 1.803
2.74ArgGly: 2.74 ± 0.861
1.826ArgHis: 1.826 ± 1.362
1.826ArgIle: 1.826 ± 1.192
4.566ArgLys: 4.566 ± 1.947
1.826ArgLeu: 1.826 ± 1.377
1.826ArgMet: 1.826 ± 1.428
0.0ArgAsn: 0.0 ± 0.0
3.653ArgPro: 3.653 ± 1.443
4.566ArgGln: 4.566 ± 1.987
7.306ArgArg: 7.306 ± 3.155
5.479ArgSer: 5.479 ± 1.742
4.566ArgThr: 4.566 ± 1.841
3.653ArgVal: 3.653 ± 1.7
0.0ArgTrp: 0.0 ± 0.0
1.826ArgTyr: 1.826 ± 1.265
0.0ArgXaa: 0.0 ± 0.0
Ser
1.826SerAla: 1.826 ± 1.297
0.0SerCys: 0.0 ± 0.0
2.74SerAsp: 2.74 ± 0.861
2.74SerGlu: 2.74 ± 1.945
3.653SerPhe: 3.653 ± 1.512
2.74SerGly: 2.74 ± 1.676
0.913SerHis: 0.913 ± 1.026
4.566SerIle: 4.566 ± 1.76
4.566SerLys: 4.566 ± 1.999
3.653SerLeu: 3.653 ± 1.341
0.0SerMet: 0.0 ± 0.0
5.479SerAsn: 5.479 ± 1.966
9.132SerPro: 9.132 ± 2.206
4.566SerGln: 4.566 ± 2.922
5.479SerArg: 5.479 ± 3.441
11.872SerSer: 11.872 ± 3.457
5.479SerThr: 5.479 ± 3.66
1.826SerVal: 1.826 ± 2.23
0.913SerTrp: 0.913 ± 0.714
2.74SerTyr: 2.74 ± 0.861
0.0SerXaa: 0.0 ± 0.0
Thr
3.653ThrAla: 3.653 ± 1.129
0.913ThrCys: 0.913 ± 0.909
1.826ThrAsp: 1.826 ± 1.817
2.74ThrGlu: 2.74 ± 1.371
4.566ThrPhe: 4.566 ± 2.009
5.479ThrGly: 5.479 ± 2.029
3.653ThrHis: 3.653 ± 2.111
4.566ThrIle: 4.566 ± 1.959
0.913ThrLys: 0.913 ± 0.648
4.566ThrLeu: 4.566 ± 1.157
0.913ThrMet: 0.913 ± 0.648
7.306ThrAsn: 7.306 ± 0.75
2.74ThrPro: 2.74 ± 1.445
1.826ThrGln: 1.826 ± 1.377
3.653ThrArg: 3.653 ± 1.129
6.393ThrSer: 6.393 ± 1.956
1.826ThrThr: 1.826 ± 1.492
3.653ThrVal: 3.653 ± 2.111
0.0ThrTrp: 0.0 ± 0.0
1.826ThrTyr: 1.826 ± 1.265
0.0ThrXaa: 0.0 ± 0.0
Val
0.913ValAla: 0.913 ± 0.648
0.0ValCys: 0.0 ± 0.0
2.74ValAsp: 2.74 ± 1.262
0.913ValGlu: 0.913 ± 1.115
3.653ValPhe: 3.653 ± 1.791
1.826ValGly: 1.826 ± 1.428
1.826ValHis: 1.826 ± 1.384
6.393ValIle: 6.393 ± 2.587
3.653ValLys: 3.653 ± 1.508
2.74ValLeu: 2.74 ± 1.256
2.74ValMet: 2.74 ± 1.281
1.826ValAsn: 1.826 ± 1.035
2.74ValPro: 2.74 ± 1.171
4.566ValGln: 4.566 ± 2.429
2.74ValArg: 2.74 ± 1.517
8.219ValSer: 8.219 ± 2.272
2.74ValThr: 2.74 ± 1.281
2.74ValVal: 2.74 ± 2.142
0.0ValTrp: 0.0 ± 0.0
5.479ValTyr: 5.479 ± 1.949
0.0ValXaa: 0.0 ± 0.0
Trp
1.826TrpAla: 1.826 ± 1.297
0.0TrpCys: 0.0 ± 0.0
0.913TrpAsp: 0.913 ± 1.115
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.913TrpGly: 0.913 ± 0.648
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.913TrpMet: 0.913 ± 0.714
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.913TrpGln: 0.913 ± 0.648
1.826TrpArg: 1.826 ± 1.018
0.913TrpSer: 0.913 ± 1.026
1.826TrpThr: 1.826 ± 1.192
0.913TrpVal: 0.913 ± 0.648
0.0TrpTrp: 0.0 ± 0.0
1.826TrpTyr: 1.826 ± 1.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.74TyrAla: 2.74 ± 1.144
0.0TyrCys: 0.0 ± 0.0
0.913TyrAsp: 0.913 ± 0.714
2.74TyrGlu: 2.74 ± 1.725
2.74TyrPhe: 2.74 ± 0.902
1.826TyrGly: 1.826 ± 0.722
0.0TyrHis: 0.0 ± 0.0
5.479TyrIle: 5.479 ± 1.261
0.913TyrLys: 0.913 ± 1.025
5.479TyrLeu: 5.479 ± 1.536
1.826TyrMet: 1.826 ± 1.126
3.653TyrAsn: 3.653 ± 1.791
0.913TyrPro: 0.913 ± 0.909
1.826TyrGln: 1.826 ± 1.265
3.653TyrArg: 3.653 ± 2.856
0.913TyrSer: 0.913 ± 0.648
2.74TyrThr: 2.74 ± 1.439
1.826TyrVal: 1.826 ± 0.924
0.0TyrTrp: 0.0 ± 0.0
0.913TyrTyr: 0.913 ± 1.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1096 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski