Amino acid dipepetide frequency for Ixeridium yellow mottle virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.74AlaAla: 10.74 ± 2.231
1.193AlaCys: 1.193 ± 0.571
2.387AlaAsp: 2.387 ± 0.662
5.37AlaGlu: 5.37 ± 2.265
1.79AlaPhe: 1.79 ± 0.806
8.353AlaGly: 8.353 ± 3.359
4.177AlaHis: 4.177 ± 1.177
5.967AlaIle: 5.967 ± 0.669
4.177AlaLys: 4.177 ± 1.266
9.547AlaLeu: 9.547 ± 1.129
4.773AlaMet: 4.773 ± 0.932
2.983AlaAsn: 2.983 ± 0.445
5.967AlaPro: 5.967 ± 0.987
2.387AlaGln: 2.387 ± 0.77
4.773AlaArg: 4.773 ± 1.308
3.58AlaSer: 3.58 ± 0.187
11.337AlaThr: 11.337 ± 2.075
4.773AlaVal: 4.773 ± 1.308
1.193AlaTrp: 1.193 ± 0.571
1.79AlaTyr: 1.79 ± 0.626
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.193CysAsp: 1.193 ± 0.571
1.79CysGlu: 1.79 ± 0.686
0.597CysPhe: 0.597 ± 0.835
4.177CysGly: 4.177 ± 1.941
0.0CysHis: 0.0 ± 0.0
0.597CysIle: 0.597 ± 0.835
0.597CysLys: 0.597 ± 0.657
1.193CysLeu: 1.193 ± 0.586
0.597CysMet: 0.597 ± 0.586
1.193CysAsn: 1.193 ± 0.571
1.79CysPro: 1.79 ± 0.806
1.79CysGln: 1.79 ± 0.626
3.58CysArg: 3.58 ± 0.916
0.0CysSer: 0.0 ± 0.0
0.597CysThr: 0.597 ± 0.657
3.58CysVal: 3.58 ± 1.523
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.967AspAla: 5.967 ± 2.297
2.387AspCys: 2.387 ± 0.738
2.387AspAsp: 2.387 ± 1.537
1.193AspGlu: 1.193 ± 0.571
0.597AspPhe: 0.597 ± 0.341
3.58AspGly: 3.58 ± 1.112
1.193AspHis: 1.193 ± 1.314
0.597AspIle: 0.597 ± 0.341
0.597AspLys: 0.597 ± 0.835
2.387AspLeu: 2.387 ± 1.492
3.58AspMet: 3.58 ± 0.771
1.79AspAsn: 1.79 ± 1.023
3.58AspPro: 3.58 ± 0.771
1.193AspGln: 1.193 ± 0.586
1.193AspArg: 1.193 ± 0.571
2.387AspSer: 2.387 ± 0.738
1.79AspThr: 1.79 ± 1.198
2.387AspVal: 2.387 ± 0.529
0.0AspTrp: 0.0 ± 0.0
0.597AspTyr: 0.597 ± 0.341
0.0AspXaa: 0.0 ± 0.0
Glu
9.547GluAla: 9.547 ± 3.256
0.597GluCys: 0.597 ± 0.341
1.79GluAsp: 1.79 ± 0.626
8.353GluGlu: 8.353 ± 2.367
1.193GluPhe: 1.193 ± 0.682
2.983GluGly: 2.983 ± 1.172
0.597GluHis: 0.597 ± 0.341
3.58GluIle: 3.58 ± 1.251
2.983GluLys: 2.983 ± 1.214
5.37GluLeu: 5.37 ± 1.364
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
5.967GluPro: 5.967 ± 0.623
4.773GluGln: 4.773 ± 0.864
4.177GluArg: 4.177 ± 0.803
2.983GluSer: 2.983 ± 0.729
4.177GluThr: 4.177 ± 1.702
0.597GluVal: 0.597 ± 0.835
2.983GluTrp: 2.983 ± 1.172
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.983PheAla: 2.983 ± 1.148
1.193PheCys: 1.193 ± 0.571
2.387PheAsp: 2.387 ± 0.93
1.193PheGlu: 1.193 ± 0.682
0.597PhePhe: 0.597 ± 0.341
2.983PheGly: 2.983 ± 1.104
2.387PheHis: 2.387 ± 0.83
2.387PheIle: 2.387 ± 0.987
0.597PheLys: 0.597 ± 0.341
1.193PheLeu: 1.193 ± 0.746
0.0PheMet: 0.0 ± 0.0
3.58PheAsn: 3.58 ± 1.008
0.0PhePro: 0.0 ± 0.0
0.597PheGln: 0.597 ± 0.341
1.193PheArg: 1.193 ± 0.746
1.193PheSer: 1.193 ± 0.746
2.983PheThr: 2.983 ± 1.104
1.193PheVal: 1.193 ± 1.67
0.0PheTrp: 0.0 ± 0.0
0.597PheTyr: 0.597 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
7.16GlyAla: 7.16 ± 3.155
0.597GlyCys: 0.597 ± 0.835
4.177GlyAsp: 4.177 ± 0.323
5.37GlyGlu: 5.37 ± 1.349
2.387GlyPhe: 2.387 ± 0.83
5.37GlyGly: 5.37 ± 1.883
1.79GlyHis: 1.79 ± 1.198
4.177GlyIle: 4.177 ± 1.828
1.79GlyLys: 1.79 ± 1.023
4.177GlyLeu: 4.177 ± 1.049
1.79GlyMet: 1.79 ± 0.806
4.773GlyAsn: 4.773 ± 0.422
5.967GlyPro: 5.967 ± 2.131
1.193GlyGln: 1.193 ± 1.0
5.967GlyArg: 5.967 ± 0.89
3.58GlySer: 3.58 ± 0.916
8.353GlyThr: 8.353 ± 2.932
4.177GlyVal: 4.177 ± 1.675
1.193GlyTrp: 1.193 ± 0.571
5.967GlyTyr: 5.967 ± 2.152
0.0GlyXaa: 0.0 ± 0.0
His
2.983HisAla: 2.983 ± 1.161
0.597HisCys: 0.597 ± 0.657
0.0HisAsp: 0.0 ± 0.0
2.387HisGlu: 2.387 ± 0.83
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.193HisLeu: 1.193 ± 0.682
0.597HisMet: 0.597 ± 0.341
2.387HisAsn: 2.387 ± 1.272
2.387HisPro: 2.387 ± 1.172
1.193HisGln: 1.193 ± 0.571
4.177HisArg: 4.177 ± 1.416
2.387HisSer: 2.387 ± 0.662
0.0HisThr: 0.0 ± 0.0
2.387HisVal: 2.387 ± 1.492
0.597HisTrp: 0.597 ± 0.341
1.79HisTyr: 1.79 ± 0.686
0.0HisXaa: 0.0 ± 0.0
Ile
4.177IleAla: 4.177 ± 1.43
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
4.773IleGlu: 4.773 ± 1.607
1.193IlePhe: 1.193 ± 0.682
2.387IleGly: 2.387 ± 1.172
0.0IleHis: 0.0 ± 0.0
1.79IleIle: 1.79 ± 0.806
3.58IleLys: 3.58 ± 1.408
1.79IleLeu: 1.79 ± 0.686
0.0IleMet: 0.0 ± 0.0
1.79IleAsn: 1.79 ± 1.023
6.563IlePro: 6.563 ± 1.691
1.79IleGln: 1.79 ± 1.023
2.983IleArg: 2.983 ± 0.815
1.193IleSer: 1.193 ± 0.746
2.983IleThr: 2.983 ± 2.28
2.387IleVal: 2.387 ± 1.492
0.0IleTrp: 0.0 ± 0.0
1.193IleTyr: 1.193 ± 0.571
0.0IleXaa: 0.0 ± 0.0
Lys
2.387LysAla: 2.387 ± 1.492
0.0LysCys: 0.0 ± 0.0
1.193LysAsp: 1.193 ± 0.682
0.597LysGlu: 0.597 ± 0.835
0.597LysPhe: 0.597 ± 0.341
4.177LysGly: 4.177 ± 1.814
1.79LysHis: 1.79 ± 0.698
1.79LysIle: 1.79 ± 0.626
2.387LysLys: 2.387 ± 1.364
2.983LysLeu: 2.983 ± 0.815
1.193LysMet: 1.193 ± 0.571
1.193LysAsn: 1.193 ± 0.746
2.983LysPro: 2.983 ± 1.214
1.193LysGln: 1.193 ± 0.586
2.387LysArg: 2.387 ± 1.841
0.597LysSer: 0.597 ± 0.657
2.387LysThr: 2.387 ± 0.93
4.177LysVal: 4.177 ± 1.43
1.193LysTrp: 1.193 ± 0.682
0.597LysTyr: 0.597 ± 0.341
0.0LysXaa: 0.0 ± 0.0
Leu
5.967LeuAla: 5.967 ± 1.242
1.193LeuCys: 1.193 ± 0.586
4.177LeuAsp: 4.177 ± 0.323
5.37LeuGlu: 5.37 ± 1.924
4.773LeuPhe: 4.773 ± 1.796
8.353LeuGly: 8.353 ± 1.592
0.0LeuHis: 0.0 ± 0.0
1.193LeuIle: 1.193 ± 0.586
3.58LeuLys: 3.58 ± 1.161
10.143LeuLeu: 10.143 ± 4.367
2.983LeuMet: 2.983 ± 1.383
4.177LeuAsn: 4.177 ± 1.168
8.353LeuPro: 8.353 ± 3.116
2.983LeuGln: 2.983 ± 1.257
3.58LeuArg: 3.58 ± 0.187
5.967LeuSer: 5.967 ± 0.535
3.58LeuThr: 3.58 ± 0.926
5.967LeuVal: 5.967 ± 1.833
2.387LeuTrp: 2.387 ± 1.537
2.983LeuTyr: 2.983 ± 1.705
0.0LeuXaa: 0.0 ± 0.0
Met
1.193MetAla: 1.193 ± 0.571
0.597MetCys: 0.597 ± 0.835
1.79MetAsp: 1.79 ± 1.023
2.387MetGlu: 2.387 ± 0.83
0.0MetPhe: 0.0 ± 0.0
2.387MetGly: 2.387 ± 0.529
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.597MetLys: 0.597 ± 0.341
0.0MetLeu: 0.0 ± 0.0
0.597MetMet: 0.597 ± 0.341
1.193MetAsn: 1.193 ± 0.571
0.0MetPro: 0.0 ± 0.0
1.193MetGln: 1.193 ± 0.682
1.79MetArg: 1.79 ± 1.546
2.387MetSer: 2.387 ± 0.987
2.983MetThr: 2.983 ± 0.729
5.37MetVal: 5.37 ± 1.273
1.193MetTrp: 1.193 ± 0.571
0.597MetTyr: 0.597 ± 0.341
0.0MetXaa: 0.0 ± 0.0
Asn
4.177AsnAla: 4.177 ± 0.323
0.597AsnCys: 0.597 ± 0.341
0.597AsnAsp: 0.597 ± 0.835
1.79AsnGlu: 1.79 ± 0.626
1.193AsnPhe: 1.193 ± 0.682
1.79AsnGly: 1.79 ± 1.546
1.193AsnHis: 1.193 ± 0.586
1.193AsnIle: 1.193 ± 0.746
0.597AsnLys: 0.597 ± 0.341
2.983AsnLeu: 2.983 ± 1.104
0.0AsnMet: 0.0 ± 0.0
2.983AsnAsn: 2.983 ± 0.729
2.983AsnPro: 2.983 ± 1.161
0.597AsnGln: 0.597 ± 0.835
1.79AsnArg: 1.79 ± 0.806
2.983AsnSer: 2.983 ± 0.567
0.597AsnThr: 0.597 ± 0.835
6.563AsnVal: 6.563 ± 1.99
0.0AsnTrp: 0.0 ± 0.0
0.597AsnTyr: 0.597 ± 0.835
0.0AsnXaa: 0.0 ± 0.0
Pro
5.37ProAla: 5.37 ± 2.442
2.983ProCys: 2.983 ± 1.148
1.79ProAsp: 1.79 ± 0.81
2.983ProGlu: 2.983 ± 1.141
1.193ProPhe: 1.193 ± 0.571
4.773ProGly: 4.773 ± 0.997
3.58ProHis: 3.58 ± 1.161
2.983ProIle: 2.983 ± 1.412
1.193ProLys: 1.193 ± 1.314
10.74ProLeu: 10.74 ± 1.143
0.0ProMet: 0.0 ± 0.0
1.193ProAsn: 1.193 ± 1.67
2.983ProPro: 2.983 ± 1.141
5.967ProGln: 5.967 ± 1.934
6.563ProArg: 6.563 ± 1.316
3.58ProSer: 3.58 ± 1.112
7.16ProThr: 7.16 ± 2.496
9.547ProVal: 9.547 ± 0.844
2.387ProTrp: 2.387 ± 0.83
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.773GlnAla: 4.773 ± 0.422
0.597GlnCys: 0.597 ± 0.835
0.597GlnAsp: 0.597 ± 0.657
0.597GlnGlu: 0.597 ± 0.835
0.597GlnPhe: 0.597 ± 0.835
0.597GlnGly: 0.597 ± 0.657
1.193GlnHis: 1.193 ± 0.586
1.193GlnIle: 1.193 ± 0.746
0.597GlnLys: 0.597 ± 0.341
7.757GlnLeu: 7.757 ± 2.986
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
5.37GlnPro: 5.37 ± 0.758
2.387GlnGln: 2.387 ± 0.529
2.983GlnArg: 2.983 ± 1.416
0.597GlnSer: 0.597 ± 0.657
2.983GlnThr: 2.983 ± 0.567
2.387GlnVal: 2.387 ± 0.83
0.597GlnTrp: 0.597 ± 0.341
0.597GlnTyr: 0.597 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
8.95ArgAla: 8.95 ± 1.833
2.387ArgCys: 2.387 ± 0.529
3.58ArgAsp: 3.58 ± 1.104
2.983ArgGlu: 2.983 ± 1.416
4.773ArgPhe: 4.773 ± 1.66
5.37ArgGly: 5.37 ± 1.309
0.0ArgHis: 0.0 ± 0.0
0.597ArgIle: 0.597 ± 0.341
3.58ArgLys: 3.58 ± 3.294
4.773ArgLeu: 4.773 ± 1.433
2.983ArgMet: 2.983 ± 1.104
0.597ArgAsn: 0.597 ± 0.835
5.37ArgPro: 5.37 ± 1.642
2.387ArgGln: 2.387 ± 1.492
4.177ArgArg: 4.177 ± 2.026
5.37ArgSer: 5.37 ± 1.769
2.983ArgThr: 2.983 ± 0.815
5.37ArgVal: 5.37 ± 0.97
0.0ArgTrp: 0.0 ± 0.0
2.387ArgTyr: 2.387 ± 0.987
0.0ArgXaa: 0.0 ± 0.0
Ser
4.177SerAla: 4.177 ± 1.62
1.193SerCys: 1.193 ± 1.0
2.387SerAsp: 2.387 ± 1.417
2.983SerGlu: 2.983 ± 1.172
1.193SerPhe: 1.193 ± 0.682
8.353SerGly: 8.353 ± 0.773
2.983SerHis: 2.983 ± 0.445
1.193SerIle: 1.193 ± 0.746
1.193SerLys: 1.193 ± 1.314
5.967SerLeu: 5.967 ± 1.936
1.79SerMet: 1.79 ± 0.626
1.193SerAsn: 1.193 ± 1.314
2.983SerPro: 2.983 ± 1.161
1.193SerGln: 1.193 ± 1.0
2.387SerArg: 2.387 ± 0.529
2.387SerSer: 2.387 ± 0.662
3.58SerThr: 3.58 ± 0.916
4.177SerVal: 4.177 ± 1.189
0.597SerTrp: 0.597 ± 0.341
1.193SerTyr: 1.193 ± 0.682
0.0SerXaa: 0.0 ± 0.0
Thr
4.773ThrAla: 4.773 ± 1.511
0.597ThrCys: 0.597 ± 0.835
3.58ThrAsp: 3.58 ± 0.809
1.193ThrGlu: 1.193 ± 0.682
1.193ThrPhe: 1.193 ± 0.682
7.16ThrGly: 7.16 ± 0.976
1.193ThrHis: 1.193 ± 0.586
3.58ThrIle: 3.58 ± 0.916
4.177ThrLys: 4.177 ± 1.43
7.757ThrLeu: 7.757 ± 1.344
2.983ThrMet: 2.983 ± 0.729
2.387ThrAsn: 2.387 ± 0.662
7.16ThrPro: 7.16 ± 1.506
0.597ThrGln: 0.597 ± 0.341
4.177ThrArg: 4.177 ± 0.966
5.967ThrSer: 5.967 ± 1.354
4.177ThrThr: 4.177 ± 1.168
2.983ThrVal: 2.983 ± 0.947
0.0ThrTrp: 0.0 ± 0.0
1.193ThrTyr: 1.193 ± 0.571
0.0ThrXaa: 0.0 ± 0.0
Val
8.95ValAla: 8.95 ± 2.265
3.58ValCys: 3.58 ± 1.161
4.177ValAsp: 4.177 ± 1.43
7.757ValGlu: 7.757 ± 2.27
3.58ValPhe: 3.58 ± 0.187
4.773ValGly: 4.773 ± 1.323
2.983ValHis: 2.983 ± 0.445
5.37ValIle: 5.37 ± 1.877
1.79ValLys: 1.79 ± 1.023
2.983ValLeu: 2.983 ± 1.237
0.0ValMet: 0.0 ± 0.0
1.79ValAsn: 1.79 ± 0.806
4.773ValPro: 4.773 ± 1.873
1.193ValGln: 1.193 ± 1.67
7.16ValArg: 7.16 ± 2.502
4.177ValSer: 4.177 ± 1.975
0.0ValThr: 0.0 ± 0.0
4.177ValVal: 4.177 ± 0.966
0.0ValTrp: 0.0 ± 0.0
3.58ValTyr: 3.58 ± 1.112
0.0ValXaa: 0.0 ± 0.0
Trp
1.193TrpAla: 1.193 ± 0.571
0.597TrpCys: 0.597 ± 0.341
0.597TrpAsp: 0.597 ± 0.657
2.387TrpGlu: 2.387 ± 0.529
0.597TrpPhe: 0.597 ± 0.341
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.597TrpIle: 0.597 ± 0.835
0.0TrpLys: 0.0 ± 0.0
1.79TrpLeu: 1.79 ± 0.626
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.193TrpGln: 1.193 ± 0.571
1.79TrpArg: 1.79 ± 0.698
0.597TrpSer: 0.597 ± 0.341
0.597TrpThr: 0.597 ± 0.341
0.0TrpVal: 0.0 ± 0.0
0.597TrpTrp: 0.597 ± 0.341
2.387TrpTyr: 2.387 ± 1.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.79TyrAla: 1.79 ± 0.806
2.387TyrCys: 2.387 ± 1.142
0.597TyrAsp: 0.597 ± 0.341
1.193TyrGlu: 1.193 ± 0.571
1.193TyrPhe: 1.193 ± 0.682
2.387TyrGly: 2.387 ± 0.662
0.0TyrHis: 0.0 ± 0.0
1.79TyrIle: 1.79 ± 0.626
1.79TyrLys: 1.79 ± 1.023
2.983TyrLeu: 2.983 ± 0.947
1.79TyrMet: 1.79 ± 0.682
0.0TyrAsn: 0.0 ± 0.0
1.79TyrPro: 1.79 ± 0.626
0.597TyrGln: 0.597 ± 0.341
1.79TyrArg: 1.79 ± 1.023
1.193TyrSer: 1.193 ± 0.746
4.177TyrThr: 4.177 ± 1.107
0.597TyrVal: 0.597 ± 0.341
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1677 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski