Amino acid dipepetide frequency for Coccinia mosaic Tamil Nadu virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.722AlaAla: 3.722 ± 2.601
1.861AlaCys: 1.861 ± 0.802
2.481AlaAsp: 2.481 ± 0.936
1.861AlaGlu: 1.861 ± 0.767
1.241AlaPhe: 1.241 ± 0.826
1.241AlaGly: 1.241 ± 0.745
0.62AlaHis: 0.62 ± 0.81
1.861AlaIle: 1.861 ± 0.997
2.481AlaLys: 2.481 ± 1.057
6.203AlaLeu: 6.203 ± 1.538
0.62AlaMet: 0.62 ± 0.546
1.241AlaAsn: 1.241 ± 0.679
3.722AlaPro: 3.722 ± 1.606
2.481AlaGln: 2.481 ± 1.66
3.102AlaArg: 3.102 ± 1.387
4.963AlaSer: 4.963 ± 1.514
3.102AlaThr: 3.102 ± 1.238
3.722AlaVal: 3.722 ± 1.56
1.241AlaTrp: 1.241 ± 0.592
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.62CysCys: 0.62 ± 0.81
0.62CysAsp: 0.62 ± 0.526
0.62CysGlu: 0.62 ± 0.486
0.62CysPhe: 0.62 ± 0.589
2.481CysGly: 2.481 ± 1.148
0.62CysHis: 0.62 ± 0.562
1.241CysIle: 1.241 ± 0.682
1.241CysLys: 1.241 ± 0.695
0.0CysLeu: 0.0 ± 0.0
1.241CysMet: 1.241 ± 0.944
1.861CysAsn: 1.861 ± 0.997
3.722CysPro: 3.722 ± 2.616
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
2.481CysSer: 2.481 ± 1.091
2.481CysThr: 2.481 ± 1.093
1.241CysVal: 1.241 ± 0.972
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.481AspAla: 2.481 ± 1.057
0.0AspCys: 0.0 ± 0.0
1.861AspAsp: 1.861 ± 0.823
3.102AspGlu: 3.102 ± 0.867
1.241AspPhe: 1.241 ± 0.695
3.102AspGly: 3.102 ± 1.881
2.481AspHis: 2.481 ± 0.942
3.102AspIle: 3.102 ± 1.461
2.481AspLys: 2.481 ± 1.208
6.203AspLeu: 6.203 ± 2.272
0.62AspMet: 0.62 ± 0.537
0.0AspAsn: 0.0 ± 0.0
1.861AspPro: 1.861 ± 0.875
0.62AspGln: 0.62 ± 0.589
3.102AspArg: 3.102 ± 1.165
3.722AspSer: 3.722 ± 1.417
4.963AspThr: 4.963 ± 1.634
7.444AspVal: 7.444 ± 1.366
2.481AspTrp: 2.481 ± 1.671
1.861AspTyr: 1.861 ± 1.024
0.0AspXaa: 0.0 ± 0.0
Glu
2.481GluAla: 2.481 ± 1.091
1.241GluCys: 1.241 ± 0.597
1.861GluAsp: 1.861 ± 0.568
2.481GluGlu: 2.481 ± 1.167
4.342GluPhe: 4.342 ± 1.768
4.963GluGly: 4.963 ± 1.205
0.62GluHis: 0.62 ± 0.526
2.481GluIle: 2.481 ± 1.675
1.241GluLys: 1.241 ± 1.091
3.102GluLeu: 3.102 ± 1.113
0.0GluMet: 0.0 ± 0.0
3.102GluAsn: 3.102 ± 1.49
1.241GluPro: 1.241 ± 0.972
4.342GluGln: 4.342 ± 1.147
1.861GluArg: 1.861 ± 0.875
4.963GluSer: 4.963 ± 0.914
1.861GluThr: 1.861 ± 0.91
2.481GluVal: 2.481 ± 1.138
0.62GluTrp: 0.62 ± 0.562
2.481GluTyr: 2.481 ± 1.675
0.0GluXaa: 0.0 ± 0.0
Phe
1.241PheAla: 1.241 ± 0.604
1.241PheCys: 1.241 ± 0.695
3.102PheAsp: 3.102 ± 1.228
1.241PheGlu: 1.241 ± 0.695
0.62PhePhe: 0.62 ± 0.486
2.481PheGly: 2.481 ± 1.604
0.62PheHis: 0.62 ± 0.546
1.241PheIle: 1.241 ± 0.759
3.102PheLys: 3.102 ± 0.921
4.963PheLeu: 4.963 ± 2.196
0.62PheMet: 0.62 ± 0.546
1.861PheAsn: 1.861 ± 1.031
1.241PhePro: 1.241 ± 0.944
2.481PheGln: 2.481 ± 1.671
4.342PheArg: 4.342 ± 1.42
2.481PheSer: 2.481 ± 0.67
2.481PheThr: 2.481 ± 1.361
1.861PheVal: 1.861 ± 1.114
0.62PheTrp: 0.62 ± 0.537
1.861PheTyr: 1.861 ± 0.605
0.0PheXaa: 0.0 ± 0.0
Gly
3.722GlyAla: 3.722 ± 0.963
1.241GlyCys: 1.241 ± 0.676
3.102GlyAsp: 3.102 ± 0.914
1.861GlyGlu: 1.861 ± 0.766
1.861GlyPhe: 1.861 ± 1.347
2.481GlyGly: 2.481 ± 0.921
1.241GlyHis: 1.241 ± 0.597
2.481GlyIle: 2.481 ± 1.208
6.203GlyLys: 6.203 ± 1.612
5.583GlyLeu: 5.583 ± 2.114
1.241GlyMet: 1.241 ± 0.999
1.241GlyAsn: 1.241 ± 0.745
4.342GlyPro: 4.342 ± 1.665
2.481GlyGln: 2.481 ± 1.093
3.102GlyArg: 3.102 ± 1.214
4.963GlySer: 4.963 ± 0.903
4.963GlyThr: 4.963 ± 1.96
2.481GlyVal: 2.481 ± 1.002
0.0GlyTrp: 0.0 ± 0.0
1.241GlyTyr: 1.241 ± 0.682
0.0GlyXaa: 0.0 ± 0.0
His
2.481HisAla: 2.481 ± 0.715
0.62HisCys: 0.62 ± 0.526
2.481HisAsp: 2.481 ± 0.947
1.241HisGlu: 1.241 ± 1.62
2.481HisPhe: 2.481 ± 1.112
1.861HisGly: 1.861 ± 1.015
1.861HisHis: 1.861 ± 1.005
1.861HisIle: 1.861 ± 0.938
1.241HisLys: 1.241 ± 0.976
0.62HisLeu: 0.62 ± 0.546
0.62HisMet: 0.62 ± 0.526
1.861HisAsn: 1.861 ± 1.184
1.241HisPro: 1.241 ± 1.091
1.861HisGln: 1.861 ± 1.195
2.481HisArg: 2.481 ± 1.097
1.241HisSer: 1.241 ± 0.597
1.241HisThr: 1.241 ± 0.972
1.861HisVal: 1.861 ± 1.109
0.0HisTrp: 0.0 ± 0.0
1.861HisTyr: 1.861 ± 0.722
0.0HisXaa: 0.0 ± 0.0
Ile
1.241IleAla: 1.241 ± 0.99
0.0IleCys: 0.0 ± 0.0
3.722IleAsp: 3.722 ± 1.53
1.861IleGlu: 1.861 ± 1.01
3.722IlePhe: 3.722 ± 2.601
3.722IleGly: 3.722 ± 0.907
1.861IleHis: 1.861 ± 0.767
2.481IleIle: 2.481 ± 1.105
4.963IleLys: 4.963 ± 1.913
3.102IleLeu: 3.102 ± 1.043
0.62IleMet: 0.62 ± 0.52
1.861IleAsn: 1.861 ± 0.997
3.102IlePro: 3.102 ± 1.043
1.241IleGln: 1.241 ± 0.739
4.963IleArg: 4.963 ± 1.134
1.861IleSer: 1.861 ± 0.99
4.342IleThr: 4.342 ± 1.689
1.861IleVal: 1.861 ± 1.03
1.861IleTrp: 1.861 ± 1.24
1.861IleTyr: 1.861 ± 0.605
0.0IleXaa: 0.0 ± 0.0
Lys
2.481LysAla: 2.481 ± 1.117
2.481LysCys: 2.481 ± 0.879
3.722LysAsp: 3.722 ± 1.028
4.342LysGlu: 4.342 ± 1.697
1.861LysPhe: 1.861 ± 1.114
3.722LysGly: 3.722 ± 1.506
1.241LysHis: 1.241 ± 0.597
4.342LysIle: 4.342 ± 1.481
2.481LysLys: 2.481 ± 1.134
3.102LysLeu: 3.102 ± 1.243
1.241LysMet: 1.241 ± 0.682
4.963LysAsn: 4.963 ± 1.662
2.481LysPro: 2.481 ± 0.898
1.241LysGln: 1.241 ± 0.682
2.481LysArg: 2.481 ± 1.352
5.583LysSer: 5.583 ± 1.699
5.583LysThr: 5.583 ± 1.174
3.102LysVal: 3.102 ± 1.729
0.62LysTrp: 0.62 ± 0.486
2.481LysTyr: 2.481 ± 0.957
0.0LysXaa: 0.0 ± 0.0
Leu
1.861LeuAla: 1.861 ± 1.005
2.481LeuCys: 2.481 ± 1.23
4.342LeuAsp: 4.342 ± 2.165
4.342LeuGlu: 4.342 ± 1.345
3.722LeuPhe: 3.722 ± 1.329
2.481LeuGly: 2.481 ± 0.833
1.861LeuHis: 1.861 ± 0.875
3.102LeuIle: 3.102 ± 1.698
5.583LeuLys: 5.583 ± 2.044
3.102LeuLeu: 3.102 ± 1.342
1.861LeuMet: 1.861 ± 1.045
3.722LeuAsn: 3.722 ± 0.656
1.861LeuPro: 1.861 ± 0.722
1.861LeuGln: 1.861 ± 1.393
4.963LeuArg: 4.963 ± 1.579
6.824LeuSer: 6.824 ± 2.366
4.342LeuThr: 4.342 ± 1.254
4.963LeuVal: 4.963 ± 1.33
0.0LeuTrp: 0.0 ± 0.0
2.481LeuTyr: 2.481 ± 0.715
0.0LeuXaa: 0.0 ± 0.0
Met
0.62MetAla: 0.62 ± 0.486
0.0MetCys: 0.0 ± 0.0
2.481MetAsp: 2.481 ± 1.13
0.62MetGlu: 0.62 ± 0.526
1.241MetPhe: 1.241 ± 0.695
1.861MetGly: 1.861 ± 0.823
0.0MetHis: 0.0 ± 0.0
0.62MetIle: 0.62 ± 0.546
1.861MetLys: 1.861 ± 1.093
2.481MetLeu: 2.481 ± 1.138
1.241MetMet: 1.241 ± 0.829
1.861MetAsn: 1.861 ± 0.605
1.241MetPro: 1.241 ± 0.739
0.0MetGln: 0.0 ± 0.0
1.241MetArg: 1.241 ± 0.829
1.241MetSer: 1.241 ± 0.695
1.241MetThr: 1.241 ± 0.682
0.62MetVal: 0.62 ± 0.526
1.241MetTrp: 1.241 ± 0.872
1.241MetTyr: 1.241 ± 0.972
0.0MetXaa: 0.0 ± 0.0
Asn
3.102AsnAla: 3.102 ± 0.881
1.241AsnCys: 1.241 ± 0.682
2.481AsnAsp: 2.481 ± 1.194
1.241AsnGlu: 1.241 ± 0.597
0.62AsnPhe: 0.62 ± 0.486
1.861AsnGly: 1.861 ± 1.093
2.481AsnHis: 2.481 ± 1.369
4.963AsnIle: 4.963 ± 1.666
1.241AsnLys: 1.241 ± 1.125
2.481AsnLeu: 2.481 ± 0.879
1.861AsnMet: 1.861 ± 0.923
3.102AsnAsn: 3.102 ± 1.235
3.102AsnPro: 3.102 ± 0.985
1.241AsnGln: 1.241 ± 0.695
4.342AsnArg: 4.342 ± 1.52
4.963AsnSer: 4.963 ± 1.074
2.481AsnThr: 2.481 ± 0.881
3.722AsnVal: 3.722 ± 1.558
0.62AsnTrp: 0.62 ± 0.526
3.102AsnTyr: 3.102 ± 0.744
0.0AsnXaa: 0.0 ± 0.0
Pro
3.102ProAla: 3.102 ± 1.574
1.241ProCys: 1.241 ± 0.9
3.102ProAsp: 3.102 ± 1.876
0.0ProGlu: 0.0 ± 0.0
1.861ProPhe: 1.861 ± 0.743
3.102ProGly: 3.102 ± 0.881
2.481ProHis: 2.481 ± 1.525
3.102ProIle: 3.102 ± 1.101
3.722ProLys: 3.722 ± 1.863
3.722ProLeu: 3.722 ± 1.277
0.62ProMet: 0.62 ± 0.486
2.481ProAsn: 2.481 ± 1.086
1.241ProPro: 1.241 ± 1.03
1.241ProGln: 1.241 ± 0.676
6.203ProArg: 6.203 ± 1.157
8.685ProSer: 8.685 ± 1.544
3.102ProThr: 3.102 ± 1.018
3.722ProVal: 3.722 ± 1.365
0.62ProTrp: 0.62 ± 0.537
1.861ProTyr: 1.861 ± 1.001
0.0ProXaa: 0.0 ± 0.0
Gln
2.481GlnAla: 2.481 ± 1.1
0.62GlnCys: 0.62 ± 0.537
1.241GlnAsp: 1.241 ± 0.592
3.722GlnGlu: 3.722 ± 1.524
2.481GlnPhe: 2.481 ± 1.525
0.62GlnGly: 0.62 ± 0.546
0.62GlnHis: 0.62 ± 0.562
3.102GlnIle: 3.102 ± 1.429
1.241GlnLys: 1.241 ± 0.944
0.62GlnLeu: 0.62 ± 0.793
0.62GlnMet: 0.62 ± 0.546
3.102GlnAsn: 3.102 ± 1.566
1.861GlnPro: 1.861 ± 1.111
2.481GlnGln: 2.481 ± 1.117
1.861GlnArg: 1.861 ± 1.29
1.861GlnSer: 1.861 ± 0.938
2.481GlnThr: 2.481 ± 0.67
3.102GlnVal: 3.102 ± 1.261
0.0GlnTrp: 0.0 ± 0.0
1.861GlnTyr: 1.861 ± 0.993
0.0GlnXaa: 0.0 ± 0.0
Arg
1.861ArgAla: 1.861 ± 1.072
1.241ArgCys: 1.241 ± 0.944
3.722ArgAsp: 3.722 ± 1.183
3.102ArgGlu: 3.102 ± 1.481
3.102ArgPhe: 3.102 ± 1.614
3.722ArgGly: 3.722 ± 1.307
3.722ArgHis: 3.722 ± 0.959
3.102ArgIle: 3.102 ± 0.803
1.861ArgLys: 1.861 ± 1.057
4.342ArgLeu: 4.342 ± 2.178
1.861ArgMet: 1.861 ± 1.199
2.481ArgAsn: 2.481 ± 0.927
4.963ArgPro: 4.963 ± 1.337
3.722ArgGln: 3.722 ± 1.814
9.305ArgArg: 9.305 ± 3.426
8.685ArgSer: 8.685 ± 1.384
5.583ArgThr: 5.583 ± 1.704
5.583ArgVal: 5.583 ± 1.462
0.62ArgTrp: 0.62 ± 0.526
1.861ArgTyr: 1.861 ± 1.001
0.0ArgXaa: 0.0 ± 0.0
Ser
4.342SerAla: 4.342 ± 1.598
1.861SerCys: 1.861 ± 0.837
4.342SerAsp: 4.342 ± 1.369
3.722SerGlu: 3.722 ± 0.868
3.722SerPhe: 3.722 ± 1.096
3.722SerGly: 3.722 ± 1.034
1.861SerHis: 1.861 ± 1.114
1.241SerIle: 1.241 ± 0.745
8.065SerLys: 8.065 ± 1.818
2.481SerLeu: 2.481 ± 1.057
3.102SerMet: 3.102 ± 1.803
6.824SerAsn: 6.824 ± 1.039
8.065SerPro: 8.065 ± 1.605
3.102SerGln: 3.102 ± 0.949
6.824SerArg: 6.824 ± 2.238
9.926SerSer: 9.926 ± 2.924
9.305SerThr: 9.305 ± 2.668
6.203SerVal: 6.203 ± 2.766
1.241SerTrp: 1.241 ± 0.597
3.722SerTyr: 3.722 ± 1.024
0.0SerXaa: 0.0 ± 0.0
Thr
1.861ThrAla: 1.861 ± 0.883
0.0ThrCys: 0.0 ± 0.0
3.102ThrAsp: 3.102 ± 1.39
4.342ThrGlu: 4.342 ± 0.877
1.241ThrPhe: 1.241 ± 1.053
6.203ThrGly: 6.203 ± 1.768
2.481ThrHis: 2.481 ± 1.162
3.722ThrIle: 3.722 ± 1.398
4.342ThrLys: 4.342 ± 1.529
3.102ThrLeu: 3.102 ± 0.977
1.241ThrMet: 1.241 ± 0.604
3.722ThrAsn: 3.722 ± 0.904
4.342ThrPro: 4.342 ± 2.073
1.861ThrGln: 1.861 ± 0.767
3.722ThrArg: 3.722 ± 1.621
9.305ThrSer: 9.305 ± 4.29
2.481ThrThr: 2.481 ± 1.162
4.963ThrVal: 4.963 ± 2.202
0.62ThrTrp: 0.62 ± 0.526
4.963ThrTyr: 4.963 ± 1.033
0.0ThrXaa: 0.0 ± 0.0
Val
0.62ValAla: 0.62 ± 0.546
1.861ValCys: 1.861 ± 1.208
3.722ValAsp: 3.722 ± 2.048
6.824ValGlu: 6.824 ± 3.9
2.481ValPhe: 2.481 ± 0.936
3.722ValGly: 3.722 ± 1.763
3.102ValHis: 3.102 ± 0.842
2.481ValIle: 2.481 ± 1.096
4.342ValLys: 4.342 ± 1.517
7.444ValLeu: 7.444 ± 2.016
1.241ValMet: 1.241 ± 0.72
3.102ValAsn: 3.102 ± 0.881
4.342ValPro: 4.342 ± 1.047
1.861ValGln: 1.861 ± 0.855
6.203ValArg: 6.203 ± 2.255
4.963ValSer: 4.963 ± 2.274
2.481ValThr: 2.481 ± 1.358
8.065ValVal: 8.065 ± 2.964
0.62ValTrp: 0.62 ± 0.526
3.722ValTyr: 3.722 ± 1.696
0.0ValXaa: 0.0 ± 0.0
Trp
4.963TrpAla: 4.963 ± 1.124
0.0TrpCys: 0.0 ± 0.0
0.62TrpAsp: 0.62 ± 0.81
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.62TrpGly: 0.62 ± 0.546
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.62TrpMet: 0.62 ± 0.486
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.62TrpGln: 0.62 ± 0.546
1.861TrpArg: 1.861 ± 0.812
0.0TrpSer: 0.0 ± 0.0
1.241TrpThr: 1.241 ± 1.177
1.241TrpVal: 1.241 ± 1.053
0.0TrpTrp: 0.0 ± 0.0
1.241TrpTyr: 1.241 ± 0.592
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.102TyrAla: 3.102 ± 1.74
1.241TyrCys: 1.241 ± 0.959
0.62TyrAsp: 0.62 ± 0.486
1.241TyrGlu: 1.241 ± 0.695
1.241TyrPhe: 1.241 ± 0.735
1.861TyrGly: 1.861 ± 0.993
1.241TyrHis: 1.241 ± 0.872
3.722TyrIle: 3.722 ± 1.034
1.861TyrLys: 1.861 ± 1.01
3.102TyrLeu: 3.102 ± 1.329
1.241TyrMet: 1.241 ± 0.679
1.861TyrAsn: 1.861 ± 0.605
1.241TyrPro: 1.241 ± 0.604
1.241TyrGln: 1.241 ± 0.735
3.102TyrArg: 3.102 ± 1.379
4.963TyrSer: 4.963 ± 0.895
1.861TyrThr: 1.861 ± 0.758
4.963TyrVal: 4.963 ± 1.432
0.0TyrTrp: 0.0 ± 0.0
2.481TyrTyr: 2.481 ± 1.219
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1613 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski