Amino acid dipepetide frequency for Sweet potato vein clearing virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.821AlaAla: 0.821 ± 0.647
0.41AlaCys: 0.41 ± 0.422
1.641AlaAsp: 1.641 ± 0.88
3.283AlaGlu: 3.283 ± 1.394
2.052AlaPhe: 2.052 ± 0.995
0.41AlaGly: 0.41 ± 0.304
0.0AlaHis: 0.0 ± 0.0
2.872AlaIle: 2.872 ± 0.919
4.924AlaLys: 4.924 ± 1.401
1.231AlaLeu: 1.231 ± 0.758
0.821AlaMet: 0.821 ± 0.687
2.052AlaAsn: 2.052 ± 1.014
1.231AlaPro: 1.231 ± 0.663
0.41AlaGln: 0.41 ± 0.626
1.231AlaArg: 1.231 ± 0.579
1.641AlaSer: 1.641 ± 0.845
1.641AlaThr: 1.641 ± 0.92
2.052AlaVal: 2.052 ± 1.538
0.41AlaTrp: 0.41 ± 0.315
0.41AlaTyr: 0.41 ± 0.422
0.0AlaXaa: 0.0 ± 0.0
Cys
0.41CysAla: 0.41 ± 0.315
0.821CysCys: 0.821 ± 0.63
0.41CysAsp: 0.41 ± 0.422
1.641CysGlu: 1.641 ± 0.929
1.231CysPhe: 1.231 ± 0.698
1.231CysGly: 1.231 ± 0.392
0.0CysHis: 0.0 ± 0.0
0.41CysIle: 0.41 ± 0.304
2.872CysLys: 2.872 ± 0.652
2.462CysLeu: 2.462 ± 1.532
0.0CysMet: 0.0 ± 0.0
0.821CysAsn: 0.821 ± 0.63
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.821CysArg: 0.821 ± 0.39
0.821CysSer: 0.821 ± 0.39
0.41CysThr: 0.41 ± 0.473
0.821CysVal: 0.821 ± 0.608
0.0CysTrp: 0.0 ± 0.0
2.462CysTyr: 2.462 ± 0.711
0.0CysXaa: 0.0 ± 0.0
Asp
2.052AspAla: 2.052 ± 1.052
1.641AspCys: 1.641 ± 0.532
3.283AspAsp: 3.283 ± 1.384
4.924AspGlu: 4.924 ± 1.178
2.872AspPhe: 2.872 ± 0.759
0.41AspGly: 0.41 ± 0.473
1.231AspHis: 1.231 ± 0.771
5.334AspIle: 5.334 ± 1.931
7.386AspLys: 7.386 ± 1.724
3.283AspLeu: 3.283 ± 1.145
1.641AspMet: 1.641 ± 0.762
6.565AspAsn: 6.565 ± 1.681
0.821AspPro: 0.821 ± 0.858
1.231AspGln: 1.231 ± 0.698
4.103AspArg: 4.103 ± 1.929
4.514AspSer: 4.514 ± 1.379
2.462AspThr: 2.462 ± 1.052
1.641AspVal: 1.641 ± 0.744
0.821AspTrp: 0.821 ± 0.39
1.641AspTyr: 1.641 ± 0.691
0.0AspXaa: 0.0 ± 0.0
Glu
2.052GluAla: 2.052 ± 0.784
0.821GluCys: 0.821 ± 0.606
4.103GluAsp: 4.103 ± 1.249
4.924GluGlu: 4.924 ± 1.037
4.103GluPhe: 4.103 ± 1.044
2.872GluGly: 2.872 ± 1.298
1.231GluHis: 1.231 ± 0.361
9.027GluIle: 9.027 ± 0.856
7.386GluLys: 7.386 ± 1.944
12.31GluLeu: 12.31 ± 2.044
1.641GluMet: 1.641 ± 1.192
5.745GluAsn: 5.745 ± 2.18
2.462GluPro: 2.462 ± 0.631
3.693GluGln: 3.693 ± 1.36
1.641GluArg: 1.641 ± 0.56
4.924GluSer: 4.924 ± 1.888
4.514GluThr: 4.514 ± 1.278
4.924GluVal: 4.924 ± 1.176
0.41GluTrp: 0.41 ± 0.304
4.924GluTyr: 4.924 ± 0.897
0.0GluXaa: 0.0 ± 0.0
Phe
0.821PheAla: 0.821 ± 0.47
0.0PheCys: 0.0 ± 0.0
3.283PheAsp: 3.283 ± 1.062
3.283PheGlu: 3.283 ± 1.141
0.41PhePhe: 0.41 ± 0.473
1.641PheGly: 1.641 ± 0.532
0.0PheHis: 0.0 ± 0.0
4.103PheIle: 4.103 ± 0.832
4.514PheLys: 4.514 ± 1.024
4.514PheLeu: 4.514 ± 1.242
0.41PheMet: 0.41 ± 0.622
2.872PheAsn: 2.872 ± 0.888
0.41PhePro: 0.41 ± 0.304
1.641PheGln: 1.641 ± 1.379
0.821PheArg: 0.821 ± 0.843
4.924PheSer: 4.924 ± 1.433
0.0PheThr: 0.0 ± 0.0
0.821PheVal: 0.821 ± 0.722
1.231PheTrp: 1.231 ± 0.694
1.641PheTyr: 1.641 ± 0.876
0.0PheXaa: 0.0 ± 0.0
Gly
1.641GlyAla: 1.641 ± 0.522
0.41GlyCys: 0.41 ± 0.315
0.821GlyAsp: 0.821 ± 0.419
0.821GlyGlu: 0.821 ± 0.63
2.872GlyPhe: 2.872 ± 1.007
1.231GlyGly: 1.231 ± 0.634
1.641GlyHis: 1.641 ± 0.92
2.462GlyIle: 2.462 ± 1.303
3.693GlyLys: 3.693 ± 1.173
0.41GlyLeu: 0.41 ± 0.304
2.052GlyMet: 2.052 ± 1.069
2.052GlyAsn: 2.052 ± 1.168
0.0GlyPro: 0.0 ± 0.0
2.052GlyGln: 2.052 ± 1.29
0.821GlyArg: 0.821 ± 0.47
2.462GlySer: 2.462 ± 0.647
2.052GlyThr: 2.052 ± 1.132
2.052GlyVal: 2.052 ± 1.229
0.0GlyTrp: 0.0 ± 0.0
3.283GlyTyr: 3.283 ± 0.721
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.41HisCys: 0.41 ± 0.304
0.41HisAsp: 0.41 ± 0.473
0.41HisGlu: 0.41 ± 0.304
1.231HisPhe: 1.231 ± 0.601
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
3.283HisIle: 3.283 ± 1.354
0.821HisLys: 0.821 ± 0.419
1.231HisLeu: 1.231 ± 0.634
0.0HisMet: 0.0 ± 0.0
0.821HisAsn: 0.821 ± 0.443
0.41HisPro: 0.41 ± 0.422
0.41HisGln: 0.41 ± 0.304
0.821HisArg: 0.821 ± 0.419
0.821HisSer: 0.821 ± 0.607
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.641HisTyr: 1.641 ± 0.592
0.0HisXaa: 0.0 ± 0.0
Ile
3.693IleAla: 3.693 ± 1.487
2.052IleCys: 2.052 ± 0.726
8.617IleAsp: 8.617 ± 2.046
9.027IleGlu: 9.027 ± 1.762
3.283IlePhe: 3.283 ± 2.036
2.872IleGly: 2.872 ± 0.495
0.821IleHis: 0.821 ± 0.47
6.155IleIle: 6.155 ± 1.397
10.669IleLys: 10.669 ± 2.166
7.796IleLeu: 7.796 ± 1.935
0.821IleMet: 0.821 ± 0.705
4.514IleAsn: 4.514 ± 1.131
4.103IlePro: 4.103 ± 1.411
3.283IleGln: 3.283 ± 1.293
4.924IleArg: 4.924 ± 1.132
7.796IleSer: 7.796 ± 1.865
2.462IleThr: 2.462 ± 1.416
8.207IleVal: 8.207 ± 2.928
0.821IleTrp: 0.821 ± 0.47
1.641IleTyr: 1.641 ± 0.777
0.0IleXaa: 0.0 ± 0.0
Lys
3.283LysAla: 3.283 ± 1.41
2.462LysCys: 2.462 ± 1.169
9.438LysAsp: 9.438 ± 1.69
11.49LysGlu: 11.49 ± 0.993
4.103LysPhe: 4.103 ± 1.102
4.103LysGly: 4.103 ± 1.363
3.283LysHis: 3.283 ± 1.045
17.234LysIle: 17.234 ± 1.294
9.848LysLys: 9.848 ± 2.147
10.669LysLeu: 10.669 ± 2.444
4.103LysMet: 4.103 ± 1.413
8.207LysAsn: 8.207 ± 2.201
3.283LysPro: 3.283 ± 1.271
2.052LysGln: 2.052 ± 0.625
4.514LysArg: 4.514 ± 1.689
8.207LysSer: 8.207 ± 1.409
4.103LysThr: 4.103 ± 1.404
1.231LysVal: 1.231 ± 0.532
0.0LysTrp: 0.0 ± 0.0
4.514LysTyr: 4.514 ± 1.718
0.0LysXaa: 0.0 ± 0.0
Leu
3.283LeuAla: 3.283 ± 2.301
1.641LeuCys: 1.641 ± 0.8
4.514LeuAsp: 4.514 ± 1.764
9.848LeuGlu: 9.848 ± 4.0
2.872LeuPhe: 2.872 ± 1.313
3.283LeuGly: 3.283 ± 0.99
0.41LeuHis: 0.41 ± 0.304
6.976LeuIle: 6.976 ± 1.697
7.386LeuLys: 7.386 ± 1.378
6.976LeuLeu: 6.976 ± 1.466
2.052LeuMet: 2.052 ± 1.408
6.976LeuAsn: 6.976 ± 1.747
4.103LeuPro: 4.103 ± 1.527
2.462LeuGln: 2.462 ± 0.686
5.745LeuArg: 5.745 ± 1.497
5.334LeuSer: 5.334 ± 1.657
4.924LeuThr: 4.924 ± 1.189
6.976LeuVal: 6.976 ± 1.993
1.231LeuTrp: 1.231 ± 0.606
2.462LeuTyr: 2.462 ± 1.132
0.0LeuXaa: 0.0 ± 0.0
Met
0.821MetAla: 0.821 ± 0.987
0.821MetCys: 0.821 ± 0.606
1.641MetAsp: 1.641 ± 0.56
2.052MetGlu: 2.052 ± 1.463
0.821MetPhe: 0.821 ± 0.63
0.41MetGly: 0.41 ± 0.473
0.41MetHis: 0.41 ± 0.622
2.462MetIle: 2.462 ± 0.728
1.231MetLys: 1.231 ± 0.663
2.052MetLeu: 2.052 ± 1.149
0.41MetMet: 0.41 ± 0.626
2.052MetAsn: 2.052 ± 0.831
0.41MetPro: 0.41 ± 0.622
0.41MetGln: 0.41 ± 0.304
0.821MetArg: 0.821 ± 0.682
1.231MetSer: 1.231 ± 0.799
2.462MetThr: 2.462 ± 1.283
0.41MetVal: 0.41 ± 0.304
0.0MetTrp: 0.0 ± 0.0
1.231MetTyr: 1.231 ± 0.698
0.0MetXaa: 0.0 ± 0.0
Asn
2.052AsnAla: 2.052 ± 1.301
2.462AsnCys: 2.462 ± 1.167
1.231AsnAsp: 1.231 ± 0.584
5.745AsnGlu: 5.745 ± 1.522
1.641AsnPhe: 1.641 ± 0.9
0.41AsnGly: 0.41 ± 0.315
0.41AsnHis: 0.41 ± 0.304
5.745AsnIle: 5.745 ± 1.515
8.207AsnLys: 8.207 ± 1.47
6.155AsnLeu: 6.155 ± 1.704
0.821AsnMet: 0.821 ± 0.414
4.103AsnAsn: 4.103 ± 1.354
4.103AsnPro: 4.103 ± 1.304
5.745AsnGln: 5.745 ± 0.722
5.334AsnArg: 5.334 ± 1.605
4.924AsnSer: 4.924 ± 1.403
3.693AsnThr: 3.693 ± 1.334
4.103AsnVal: 4.103 ± 0.671
1.641AsnTrp: 1.641 ± 0.696
3.283AsnTyr: 3.283 ± 0.876
0.0AsnXaa: 0.0 ± 0.0
Pro
0.821ProAla: 0.821 ± 0.47
0.41ProCys: 0.41 ± 0.304
2.052ProAsp: 2.052 ± 1.073
2.872ProGlu: 2.872 ± 2.175
1.641ProPhe: 1.641 ± 0.795
1.641ProGly: 1.641 ± 0.532
0.41ProHis: 0.41 ± 0.304
2.872ProIle: 2.872 ± 1.498
3.693ProLys: 3.693 ± 1.338
2.872ProLeu: 2.872 ± 0.732
0.41ProMet: 0.41 ± 0.304
2.052ProAsn: 2.052 ± 0.951
0.41ProPro: 0.41 ± 0.315
0.821ProGln: 0.821 ± 0.47
0.821ProArg: 0.821 ± 0.414
1.641ProSer: 1.641 ± 0.951
0.821ProThr: 0.821 ± 0.516
2.052ProVal: 2.052 ± 0.986
0.0ProTrp: 0.0 ± 0.0
1.641ProTyr: 1.641 ± 0.507
0.0ProXaa: 0.0 ± 0.0
Gln
0.821GlnAla: 0.821 ± 0.516
0.821GlnCys: 0.821 ± 0.607
2.462GlnAsp: 2.462 ± 0.75
1.231GlnGlu: 1.231 ± 0.784
1.231GlnPhe: 1.231 ± 0.598
2.462GlnGly: 2.462 ± 1.034
0.0GlnHis: 0.0 ± 0.0
3.283GlnIle: 3.283 ± 0.944
3.693GlnLys: 3.693 ± 1.142
5.334GlnLeu: 5.334 ± 1.716
1.231GlnMet: 1.231 ± 1.233
2.052GlnAsn: 2.052 ± 1.456
0.821GlnPro: 0.821 ± 0.946
1.231GlnGln: 1.231 ± 0.915
1.231GlnArg: 1.231 ± 0.668
1.231GlnSer: 1.231 ± 0.532
1.231GlnThr: 1.231 ± 0.656
3.693GlnVal: 3.693 ± 1.578
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.641ArgAla: 1.641 ± 1.161
0.41ArgCys: 0.41 ± 0.473
1.641ArgAsp: 1.641 ± 1.052
2.462ArgGlu: 2.462 ± 0.622
2.052ArgPhe: 2.052 ± 0.556
1.231ArgGly: 1.231 ± 0.598
0.0ArgHis: 0.0 ± 0.0
3.283ArgIle: 3.283 ± 1.381
6.565ArgLys: 6.565 ± 2.694
3.283ArgLeu: 3.283 ± 0.671
1.641ArgMet: 1.641 ± 0.759
4.103ArgAsn: 4.103 ± 1.308
1.231ArgPro: 1.231 ± 0.64
0.821ArgGln: 0.821 ± 0.419
1.231ArgArg: 1.231 ± 0.612
3.283ArgSer: 3.283 ± 1.61
2.872ArgThr: 2.872 ± 0.731
2.052ArgVal: 2.052 ± 0.556
0.821ArgTrp: 0.821 ± 0.485
4.103ArgTyr: 4.103 ± 0.917
0.0ArgXaa: 0.0 ± 0.0
Ser
2.052SerAla: 2.052 ± 0.556
0.41SerCys: 0.41 ± 0.473
4.514SerAsp: 4.514 ± 1.924
9.027SerGlu: 9.027 ± 3.072
2.052SerPhe: 2.052 ± 0.573
2.052SerGly: 2.052 ± 0.775
0.41SerHis: 0.41 ± 0.422
5.334SerIle: 5.334 ± 2.509
10.259SerLys: 10.259 ± 2.105
7.386SerLeu: 7.386 ± 1.972
1.231SerMet: 1.231 ± 0.974
6.155SerAsn: 6.155 ± 1.205
0.821SerPro: 0.821 ± 0.47
1.231SerGln: 1.231 ± 0.663
3.283SerArg: 3.283 ± 1.144
7.386SerSer: 7.386 ± 2.693
1.641SerThr: 1.641 ± 0.806
4.514SerVal: 4.514 ± 1.347
0.41SerTrp: 0.41 ± 0.422
2.462SerTyr: 2.462 ± 1.345
0.0SerXaa: 0.0 ± 0.0
Thr
0.0ThrAla: 0.0 ± 0.0
0.41ThrCys: 0.41 ± 0.521
4.514ThrAsp: 4.514 ± 1.729
3.693ThrGlu: 3.693 ± 1.78
0.41ThrPhe: 0.41 ± 0.304
1.641ThrGly: 1.641 ± 0.678
0.0ThrHis: 0.0 ± 0.0
0.821ThrIle: 0.821 ± 0.722
6.155ThrLys: 6.155 ± 1.426
4.514ThrLeu: 4.514 ± 1.352
1.231ThrMet: 1.231 ± 0.534
3.283ThrAsn: 3.283 ± 0.925
2.462ThrPro: 2.462 ± 1.132
2.052ThrGln: 2.052 ± 0.68
2.052ThrArg: 2.052 ± 0.921
3.283ThrSer: 3.283 ± 1.894
2.462ThrThr: 2.462 ± 1.492
1.641ThrVal: 1.641 ± 0.92
0.41ThrTrp: 0.41 ± 0.315
2.052ThrTyr: 2.052 ± 0.573
0.0ThrXaa: 0.0 ± 0.0
Val
1.231ValAla: 1.231 ± 0.94
0.41ValCys: 0.41 ± 0.521
1.641ValAsp: 1.641 ± 1.178
4.514ValGlu: 4.514 ± 0.908
0.821ValPhe: 0.821 ± 0.419
2.052ValGly: 2.052 ± 0.92
0.0ValHis: 0.0 ± 0.0
4.924ValIle: 4.924 ± 2.157
11.079ValLys: 11.079 ± 2.263
3.283ValLeu: 3.283 ± 1.334
0.41ValMet: 0.41 ± 0.568
2.462ValAsn: 2.462 ± 0.78
1.641ValPro: 1.641 ± 0.638
1.641ValGln: 1.641 ± 0.678
4.103ValArg: 4.103 ± 0.885
3.693ValSer: 3.693 ± 1.221
2.872ValThr: 2.872 ± 1.037
5.334ValVal: 5.334 ± 2.169
0.0ValTrp: 0.0 ± 0.0
3.283ValTyr: 3.283 ± 1.117
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.821TrpAsp: 0.821 ± 0.607
0.821TrpGlu: 0.821 ± 0.607
0.821TrpPhe: 0.821 ± 0.39
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.641TrpIle: 1.641 ± 0.777
1.231TrpLys: 1.231 ± 0.612
0.821TrpLeu: 0.821 ± 0.39
0.0TrpMet: 0.0 ± 0.0
1.231TrpAsn: 1.231 ± 0.361
0.41TrpPro: 0.41 ± 0.423
0.41TrpGln: 0.41 ± 0.315
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.41TrpThr: 0.41 ± 0.423
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.821TrpTyr: 0.821 ± 0.39
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.462TyrAla: 2.462 ± 0.86
0.821TyrCys: 0.821 ± 0.443
0.821TyrAsp: 0.821 ± 0.419
1.641TyrGlu: 1.641 ± 0.728
0.821TyrPhe: 0.821 ± 0.533
2.872TyrGly: 2.872 ± 0.697
2.462TyrHis: 2.462 ± 1.432
5.334TyrIle: 5.334 ± 1.226
4.514TyrLys: 4.514 ± 1.697
2.872TyrLeu: 2.872 ± 1.196
0.821TyrMet: 0.821 ± 0.47
3.693TyrAsn: 3.693 ± 1.094
0.821TyrPro: 0.821 ± 0.47
2.462TyrGln: 2.462 ± 1.302
0.41TyrArg: 0.41 ± 0.304
4.514TyrSer: 4.514 ± 0.982
2.052TyrThr: 2.052 ± 0.837
2.872TyrVal: 2.872 ± 1.189
1.231TyrTrp: 1.231 ± 0.598
2.462TyrTyr: 2.462 ± 1.168
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2438 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski