Analysis of xDeepFM researcher's mental process - pure personal understanding

Background: DCN is PointWise crossover, and it has been proved in this paper that DCN is essentially a scalar multiplication of input vector X0, which cannot achieve the effect of feature crossover.

Q: How do we use vector for feature crossing

A: Following the idea of FM, A vector can be constructed for each feature and represented by vector dot product.

Q: But a vector becomes a scalar after dot multiplication. How about n-order eigencross?

A: The dot product can be viewed as 1. HadaMard dot product 2. Sum. After the vector completes the HadaMard product, the vector is still a vector, so we lag the sum operation, can’t we do the n-order feature crossing?

Q: But there’s a problem: suppose you have M vectors, you cross them and you get M(m-1)/2 vectors. M of M minus one over two vectors crossed by two, that’s too big a magnitude.

A: Then let’s set K group weights and sum the M(m-1)/2 vectors into K group vectors. Can’t we complete k-order feature cross?

mo4tech.com (Moment For Technology) is a global community with thousands techies from across the global hang out!Passionate technologists, be it gadget freaks, tech enthusiasts, coders, technopreneurs, or CIOs, you would find them all here.

Analysis of xDeepFM researcher’s mental process — pure personal understanding

Q: How do we use vector for feature crossing

Q: But a vector becomes a scalar after dot multiplication. How about n-order eigencross?

Q: But there’s a problem: suppose you have M vectors, you cross them and you get M(m-1)/2 vectors. M of M minus one over two vectors crossed by two, that’s too big a magnitude.

Q: Good idea, let’s sum all the vectors of each layer together, then we get the feature intersection of order 1-H!

Let’s parse CIN with a bad sketch

Analysis of xDeepFM researcher’s mental process — pure personal understanding

Q: How do we use vector for feature crossing

Q: But a vector becomes a scalar after dot multiplication. How about n-order eigencross?

Q: But there’s a problem: suppose you have M vectors, you cross them and you get M(m-1)/2 vectors. M of M minus one over two vectors crossed by two, that’s too big a magnitude.

Q: Good idea, let’s sum all the vectors of each layer together, then we get the feature intersection of order 1-H!

Let’s parse CIN with a bad sketch

Related Posts

How do I run Python code on the iPad?

What you didn’t know about Python! (Suggested collection)

Python applications – A full set of custom sorting solutions